[jira] [Closed] (TIKA-3305) How do you handle PDFs with custom encoding?

2021-02-24 Thread Nicholas DiPiazza (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3305?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas DiPiazza closed TIKA-3305. --- Resolution: Won't Fix > How do you handle PDFs with custom encoding? > ---

[jira] [Commented] (TIKA-3305) How do you handle PDFs with custom encoding?

2021-02-24 Thread Nicholas DiPiazza (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17289960#comment-17289960 ] Nicholas DiPiazza commented on TIKA-3305: - ok thanks! just making sure. > How do

[GitHub] [tika] peterkronenberg opened a new pull request #405: Correct debugging output

2021-02-24 Thread GitBox
peterkronenberg opened a new pull request #405: URL: https://github.com/apache/tika/pull/405 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

[jira] [Commented] (TIKA-3290) Extension reading it as eml instead of txt

2021-02-24 Thread Vamsi Molli (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17290038#comment-17290038 ] Vamsi Molli commented on TIKA-3290: --- [~lfcnassif] [~nick] Any update on the above questi

[jira] [Commented] (TIKA-3290) Extension reading it as eml instead of txt

2021-02-24 Thread Nick Burch (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17290041#comment-17290041 ] Nick Burch commented on TIKA-3290: -- [~Vamsi452] You do appear to have mistake a free open

[jira] [Commented] (TIKA-3290) Extension reading it as eml instead of txt

2021-02-24 Thread Vamsi Molli (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17290062#comment-17290062 ] Vamsi Molli commented on TIKA-3290: --- [~nick] Sure! > Extension reading it as eml instea

[jira] [Comment Edited] (TIKA-3290) Extension reading it as eml instead of txt

2021-02-24 Thread Vamsi Molli (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17289658#comment-17289658 ] Vamsi Molli edited comment on TIKA-3290 at 2/24/21, 5:14 PM: -

[jira] [Comment Edited] (TIKA-3290) Extension reading it as eml instead of txt

2021-02-24 Thread Vamsi Molli (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17288896#comment-17288896 ] Vamsi Molli edited comment on TIKA-3290 at 2/24/21, 5:16 PM: -

[GitHub] [tika] tballison merged pull request #405: Correct debugging output

2021-02-24 Thread GitBox
tballison merged pull request #405: URL: https://github.com/apache/tika/pull/405 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[jira] [Commented] (TIKA-3305) How do you handle PDFs with custom encoding?

2021-02-24 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17290151#comment-17290151 ] Tim Allison commented on TIKA-3305: --- As [~tilman] says, there's no fixing this. OCR is

[jira] [Commented] (TIKA-3305) How do you handle PDFs with custom encoding?

2021-02-24 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17290153#comment-17290153 ] Tim Allison commented on TIKA-3305: --- At some point, I'd like to push tika-eval's junk de

[jira] [Commented] (TIKA-3290) Extension reading it as eml instead of txt

2021-02-24 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17290163#comment-17290163 ] Tim Allison commented on TIKA-3290: --- Sorry, I haven't been tracking this closely. What,

[jira] [Commented] (TIKA-3303) Broken link to Getting Started page on https://tika.apache.org/

2021-02-24 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17290208#comment-17290208 ] Tim Allison commented on TIKA-3303: --- I _think_ I've fixed the website...at least the maj

[jira] [Commented] (TIKA-3303) Broken link to Getting Started page on https://tika.apache.org/

2021-02-24 Thread Tilman Hausherr (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17290210#comment-17290210 ] Tilman Hausherr commented on TIKA-3303: --- I retested, IMHO there are only two links t

[jira] [Commented] (TIKA-3290) Extension reading it as eml instead of txt

2021-02-24 Thread Vamsi Molli (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17290214#comment-17290214 ] Vamsi Molli commented on TIKA-3290: --- [~tallison] With the previous version 1.24.1 the at

[jira] [Commented] (TIKA-3303) Broken link to Getting Started page on https://tika.apache.org/

2021-02-24 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17290277#comment-17290277 ] Tim Allison commented on TIKA-3303: --- Thank you, [~tilman] .  Those two should now be fix

[jira] [Commented] (TIKA-3290) Extension reading it as eml instead of txt

2021-02-24 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17290481#comment-17290481 ] Tim Allison commented on TIKA-3290: --- We're hoping to release 1.26 fairly soon. My feeli