This is an automated email from the ASF dual-hosted git repository.

tballison pushed a change to branch TIKA-4728-js-in-pdf
in repository https://gitbox.apache.org/repos/asf/tika.git


    from 155d2f6806 TIKA-4728 - add strict validation as an option
     add 32d5f9e01d TIKA-4728 - further tag fixes

No new revisions were added by this update.

Summary of changes:
 .../microsoft/ooxml/AbstractOOXMLExtractor.java    | 12 ++++
 .../microsoft/ooxml/OOXMLTikaBodyPartHandler.java  | 38 +++++++++++
 .../ooxml/OOXMLWordAndPowerPointTextHandler.java   |  7 ++
 .../ooxml/SXSLFPowerPointExtractorDecorator.java   | 16 ++++-
 .../ooxml/SXWPFWordExtractorDecorator.java         |  6 ++
 .../ooxml/XSSFExcelExtractorDecorator.java         | 36 ++++++++++
 .../org/apache/tika/parser/epub/EpubParser.java    | 55 +++++++++++++---
 .../tika/parser/odf/OpenDocumentBodyHandler.java   | 77 ++++++++++++++++++++++
 .../apache/tika/parser/pdf/AbstractPDF2XHTML.java  |  4 +-
 9 files changed, 239 insertions(+), 12 deletions(-)

Reply via email to