[
https://issues.apache.org/jira/browse/TIKA-4683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18077160#comment-18077160
]
Tim Allison commented on TIKA-4683:
-----------------------------------
Y, I noticed the tika-eval continuing problem. Thank you!
I fixed epubs. I did a deep dive into ooxml diffs, and those actually all look
ok.
The charset detection feels like a wash with some better and others not.
I'm going to rollback to 3.x's charset detection chain and rerun.
> Prep for 4.0.0-ALPHA release
> ----------------------------
>
> Key: TIKA-4683
> URL: https://issues.apache.org/jira/browse/TIKA-4683
> Project: Tika
> Issue Type: Task
> Reporter: Tim Allison
> Priority: Major
> Attachments: reports-20260429.tar.gz, reports-4.0.0-20260411.tgz,
> reports.tar.gz
>
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)