revisiting a prior, never resolved issue
tika 2.4.1 'Text extraction failed' errors when dovecot+fts 2.3.19.1
passes embedded *.eml (message/rfc822) files ;
org.apache.tika.parser.mail.RFC822Parser or dovecot ?
https://dovecot.org/list/dovecot/2022-August/125085.html
i now run
revisiting a prior, never resolved issue
tika 2.4.1 'Text extraction failed' errors when dovecot+fts 2.3.19.1
passes embedded *.eml (message/rfc822) files ;
org.apache.tika.parser.mail.RFC822Parser or dovecot ?
https://dovecot.org/list/dovecot/2022-August/125085.html
i now run
soon-to-be-next-release of Tika,
i saw that was coming
you can turn off throwing zero-byte file exceptions via the config
can you point to the config toggle, or docs, in https://github.com/apache/tika ?
The exceptions should be harmless and you can safely ignore them.
including the SEVER
I think with the soon-to-be-next-release of Tika, you can turn off throwing
zero-byte file exceptions via the config. The exceptions should be harmless
and you can safely ignore them.
Just upgraded to tika 2.9.0.
Testing, as below, same error thrown.
not certain of the correct config here :-