[ 
https://issues.apache.org/jira/browse/TIKA-4043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17731718#comment-17731718
 ] 

Hudson commented on TIKA-4043:
------------------------------

SUCCESS: Integrated in Jenkins build Tika ยป tika-main-jdk11 #1110 (See 
[https://ci-builds.apache.org/job/Tika/job/tika-main-jdk11/1110/])
TIKA-4043 -- fix build issues related to timezone differences and variations of 
output from Tesseract (#1187) (github: 
[https://github.com/apache/tika/commit/32b8cbcf9e900690e67c832b635f36fc91d2330f])
* (edit) 
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-microsoft-module/src/test/java/org/apache/tika/parser/microsoft/rtf/RTFParserTest.java
* (edit) 
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-ocr-module/src/test/java/org/apache/tika/parser/ocr/TesseractOCRParserTest.java


> Fix build for variations in tesseract and timezone info in RTFs
> ---------------------------------------------------------------
>
>                 Key: TIKA-4043
>                 URL: https://issues.apache.org/jira/browse/TIKA-4043
>             Project: Tika
>          Issue Type: Task
>            Reporter: Tim Allison
>            Priority: Major
>             Fix For: 2.8.1
>
>
> From [~grossws]:
> > * OCR (tesseract) multipage test is still the same, it extracts "Page?2" 
> > instead of "Page 2" on my laptop;
> > * RTFParserTest testMetaDataCounts fails because of different time zone 
> > since RTF format itself has only local date/time in meta and I fall into 
> > different size of midnight with my local time (known issue, requires some 
> > changes in metadata to handle correctly). When building with TZ=UTC works 
> > fine.
> We should fix these.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to