Tim Allison created TIKA-4673:
---------------------------------

             Summary: Add a parser that's a hook for Jina Reader in 4.x
                 Key: TIKA-4673
                 URL: https://issues.apache.org/jira/browse/TIKA-4673
             Project: Tika
          Issue Type: New Feature
            Reporter: Tim Allison


After adding the modern embedding and ocr options, we may want to add a parser 
that hooks Jina Reader for html and PDF cleaning in 4.x



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to