THausherr commented on PR #1605:
URL: https://github.com/apache/tika/pull/1605#issuecomment-1954084138

   Getting closer while inspecting ArParserTest.java: when working on the 
embedded element, all the detectors (called in CompositeDetector) increase the 
internal offset in ArArchiveInputStream. When the actual element is to be read, 
the offset is at the end of the element so it considers it to be empty.
   
   Thus for some reason, the detectors read from the actual entry instead of 
from some buffer. I'm getting around this by calling 
`TikaInputStream.getPath()` which I already mentioned yesterday, and I'm now 
investigating the failed tests. Those in TruncatedOOXMLTest are harmless, 
looking closer at one of them showed that a useless metadata entry is now 
missing, probably because the exception happens earlier.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@tika.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to