[ 
https://issues.apache.org/jira/browse/TIKA-2342?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15992739#comment-15992739
 ] 

Tilman Hausherr edited comment on TIKA-2342 at 1/2/25 8:07 AM:
---------------------------------------------------------------

I've traced it down to PDFBox issue: PDFBOX-3774

Thank you Tim.


was (Author: ninoskopac):
I've traced it down to PDFBox issue: 
https://issues.apache.org/jira/browse/PDFBOX-3774

Thank you Tim.

> Broken words
> ------------
>
>                 Key: TIKA-2342
>                 URL: https://issues.apache.org/jira/browse/TIKA-2342
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.14
>         Environment: Tika app and Tika server
>            Reporter: Nino Skopac
>            Assignee: Tilman Hausherr
>            Priority: Major
>             Fix For: 3.0.1, 4.0.0
>
>
> Original PDF text: "Each certified or noncertified member"
> Tika extracted text: "Each certifi ed or noncertifi ed member"



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to