Tim Allison created TIKA-4650:
---------------------------------
Summary: Improve zip parsing in 4.x
Key: TIKA-4650
URL: https://issues.apache.org/jira/browse/TIKA-4650
Project: Tika
Issue Type: Task
Reporter: Tim Allison
Zip parsing has a number of quirks that require special processing. Over time
those have accreted in the PackageParser. Further, there's not great
coordination between the zip detector and the zip parser...there are some areas
where we could streamline the detect+parse steps.
Let's create a standalone zip parser and improve the coordination between
detection and parsing for zip files.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)