[ https://issues.apache.org/jira/browse/TIKA-4375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17922763#comment-17922763 ]
Tim Allison commented on TIKA-4375: ----------------------------------- Maybe a tika-eval issue? We should be tokenizing on non-breaking space and "thin space"? > Regression tests for 2.9.3 release > ---------------------------------- > > Key: TIKA-4375 > URL: https://issues.apache.org/jira/browse/TIKA-4375 > Project: Tika > Issue Type: Task > Reporter: Tim Allison > Priority: Major > Attachments: LTWA2JGVJGJ5RVKHTUX6SDS4NTL5UJVQ-p139.pdf, > tika-2.9.2-v-tika-2.9.3-reports.tgz > > -- This message was sent by Atlassian Jira (v8.20.10#820010)