[ https://issues.apache.org/jira/browse/TIKA-1588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14628890#comment-14628890 ]
Tilman Hausherr commented on TIKA-1588: --------------------------------------- The weird thing is that I can't find any differences with ExtractText and default settings. "respondæ" appears in both extractions. "æ" is an arrow in the PDF. > Upgrade to PDFBox 1.8.10 when available > --------------------------------------- > > Key: TIKA-1588 > URL: https://issues.apache.org/jira/browse/TIKA-1588 > Project: Tika > Issue Type: Improvement > Components: parser > Reporter: Tim Allison > Assignee: Tim Allison > Priority: Minor > Attachments: reports_1_8_9_vs_1_8_10.zip > > > Let's use this ticket to discuss/prepare for the release and integration of > PDFBox 1.8.10 when it is available. -- This message was sent by Atlassian JIRA (v6.3.4#6332)