[
https://issues.apache.org/jira/browse/TIKA-3798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17557407#comment-17557407
]
Tim Allison commented on TIKA-3798:
-----------------------------------
In Tika 2.4.0, we were using junrar 7.5.1.
https://github.com/junrar/junrar/issues/73 shows infinite loops before 7.5.1
https://github.com/junrar/junrar/issues/81 is still open and has follow up
infinite loops from fuzzing.
In short, this is somewhat of a known issue that hasn't been solved yet even in
7.5.2 (I'm guessing, I'll test later today).
> Tika hangs up with some RAR archives
> ------------------------------------
>
> Key: TIKA-3798
> URL: https://issues.apache.org/jira/browse/TIKA-3798
> Project: Tika
> Issue Type: Bug
> Environment: Windows, Tika 2.4.0
> Reporter: Mikhail Gushinets
> Priority: Major
> Attachments: MicrosoftTeams-image.png, rar-files.csv.gz
>
>
> Passing to Tika rar archive might lead to hanging up.
> When trying to unrar this file manually I get this message: "Checksum is not
> calculated right of file as there might be a change of the metadata"
> I understand that the probably reason is some kind of file corruption here
> but it would be nice if Tika would just throw an exception in such case
> rather than hanging up forever.
--
This message was sent by Atlassian Jira
(v8.20.7#820007)