THausherr commented on PR #1976: URL: https://github.com/apache/tika/pull/1976#issuecomment-2407325929
I see you removed the "colon isn't reliable" code part. Did you test what would happen with the file mentioned in that code segment (242970.txt)? IMHO the colon should still be "discriminated" if others have the same confidence. I'm currently trying to build and modified version and will then run a regression test on the csv fileset (this is faster than running the build itself). -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr...@tika.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org