Hi Armin,

just some random thoughts on this:
- the zip format is not deterministic, so don't compare zip streams
  (google for "zip stream deterministic", e.g. [1])
- xml is not deterministic, you would need to use a canonical form - look at
the xml signature mechanism
- we try to preserve the records with the native formats, so they are much
more likely to be deterministic
  ... but it's also not 100% sure ...
- I would sign the xml files and extract the hashes from the signature files

Andi


[1]
https://blog.pivotal.io/labs/labs/barriers-deterministic-reproducible-zip-files



--
View this message in context: 
http://apache-poi.1045710.n5.nabble.com/POI-xlsx-indeterminism-tp5724269p5724270.html
Sent from the POI - Dev mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org
For additional commands, e-mail: dev-h...@poi.apache.org

Reply via email to