dovecot passes zero byte input stream when passing email with .eml attachment to apache tika parser, causes 'SEVERE' error

2023-08-12 Thread pgnd via dovecot
revisiting a prior, never resolved issue tika 2.4.1 'Text extraction failed' errors when dovecot+fts 2.3.19.1 passes embedded *.eml (message/rfc822) files ; org.apache.tika.parser.mail.RFC822Parser or dovecot ? https://dovecot.org/list/dovecot/2022-August/125085.html i now run

[bug] dovecot passes zero byte input stream when passing email with .eml attachment to apache tika parser, causes 'SEVERE' error

2023-08-17 Thread pgnd via dovecot
revisiting a prior, never resolved issue tika 2.4.1 'Text extraction failed' errors when dovecot+fts 2.3.19.1 passes embedded *.eml (message/rfc822) files ; org.apache.tika.parser.mail.RFC822Parser or dovecot ? https://dovecot.org/list/dovecot/2022-August/125085.html i now run

Re: [bug] dovecot passes zero byte input stream when passing email with .eml attachment to apache tika parser, causes 'SEVERE' error

2023-08-18 Thread pgnd via dovecot
soon-to-be-next-release of Tika, i saw that was coming you can turn off throwing zero-byte file exceptions via the config can you point to the config toggle, or docs, in https://github.com/apache/tika ? The exceptions should be harmless and you can safely ignore them. including the SEVER

Re: [bug] dovecot passes zero byte input stream when passing email with .eml attachment to apache tika parser, causes 'SEVERE' error

2023-08-28 Thread pgnd via dovecot
I think with the soon-to-be-next-release of Tika, you can turn off throwing zero-byte file exceptions via the config. The exceptions should be harmless and you can safely ignore them. Just upgraded to tika 2.9.0. Testing, as below, same error thrown. not certain of the correct config here :-