Tim Allison created TIKA-4650:
---------------------------------

             Summary: Improve zip parsing in 4.x
                 Key: TIKA-4650
                 URL: https://issues.apache.org/jira/browse/TIKA-4650
             Project: Tika
          Issue Type: Task
            Reporter: Tim Allison


Zip parsing has a number of quirks that require special processing. Over time 
those have accreted in the PackageParser. Further, there's not great 
coordination between the zip detector and the zip parser...there are some areas 
where we could streamline the detect+parse steps.

Let's create a standalone zip parser and improve the coordination between 
detection and parsing for zip files.




--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to