Hi Armin, just some random thoughts on this: - the zip format is not deterministic, so don't compare zip streams (google for "zip stream deterministic", e.g. [1]) - xml is not deterministic, you would need to use a canonical form - look at the xml signature mechanism - we try to preserve the records with the native formats, so they are much more likely to be deterministic ... but it's also not 100% sure ... - I would sign the xml files and extract the hashes from the signature files
Andi [1] https://blog.pivotal.io/labs/labs/barriers-deterministic-reproducible-zip-files -- View this message in context: http://apache-poi.1045710.n5.nabble.com/POI-xlsx-indeterminism-tp5724269p5724270.html Sent from the POI - Dev mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org For additional commands, e-mail: dev-h...@poi.apache.org