[ https://issues.apache.org/jira/browse/TIKA-2342?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15992739#comment-15992739 ]
Tilman Hausherr edited comment on TIKA-2342 at 1/2/25 8:07 AM: --------------------------------------------------------------- I've traced it down to PDFBox issue: PDFBOX-3774 Thank you Tim. was (Author: ninoskopac): I've traced it down to PDFBox issue: https://issues.apache.org/jira/browse/PDFBOX-3774 Thank you Tim. > Broken words > ------------ > > Key: TIKA-2342 > URL: https://issues.apache.org/jira/browse/TIKA-2342 > Project: Tika > Issue Type: Bug > Components: parser > Affects Versions: 1.14 > Environment: Tika app and Tika server > Reporter: Nino Skopac > Assignee: Tilman Hausherr > Priority: Major > Fix For: 3.0.1, 4.0.0 > > > Original PDF text: "Each certified or noncertified member" > Tika extracted text: "Each certifi ed or noncertifi ed member" -- This message was sent by Atlassian Jira (v8.20.10#820010)