[
https://issues.apache.org/jira/browse/TIKA-4043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17731656#comment-17731656
]
ASF GitHub Bot commented on TIKA-4043:
--------------------------------------
tballison merged PR #1187:
URL: https://github.com/apache/tika/pull/1187
> Fix build for variations in tesseract and timezone info in RTFs
> ---------------------------------------------------------------
>
> Key: TIKA-4043
> URL: https://issues.apache.org/jira/browse/TIKA-4043
> Project: Tika
> Issue Type: Task
> Reporter: Tim Allison
> Priority: Major
>
> From [~grossws]:
> > * OCR (tesseract) multipage test is still the same, it extracts "Page?2"
> > instead of "Page 2" on my laptop;
> > * RTFParserTest testMetaDataCounts fails because of different time zone
> > since RTF format itself has only local date/time in meta and I fall into
> > different size of midnight with my local time (known issue, requires some
> > changes in metadata to handle correctly). When building with TZ=UTC works
> > fine.
> We should fix these.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)