[ 
https://issues.apache.org/jira/browse/TIKA-2581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16371999#comment-16371999
 ] 

ASF GitHub Bot commented on TIKA-2581:
--------------------------------------

ewanmellor opened a new pull request #221: Fix for TIKA-2581 contributed by 
ewanmellor.
URL: https://github.com/apache/tika/pull/221
 
 
   TesseractOCRParserTest.testOCROutputsHOCR fails with Tesseract 4.0.
   
   With 3.x, the output is <span>Happy</span> but with 4.0 the output is
   <span><strong>Happy</strong></span>.  Both these seem reasonable to me,
   so update the test to accept either of them.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> testOCROutputsHOCR fails with Tesseract 4.0
> -------------------------------------------
>
>                 Key: TIKA-2581
>                 URL: https://issues.apache.org/jira/browse/TIKA-2581
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.17
>            Reporter: Ewan Mellor
>            Priority: Minor
>
> TesseractOCRParserTest.testOCROutputsHOCR fails with Tesseract 4.0.
> With 3.x, the output is <span>Happy</span> but with 4.0 the output is 
> <span><strong>Happy</strong></span>.
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to