[ 
https://issues.apache.org/jira/browse/TIKA-4410?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17955255#comment-17955255
 ] 

Hudson commented on TIKA-4410:
------------------------------

UNSTABLE: Integrated in Jenkins build Tika ยป tika-main-jdk17 #739 (See 
[https://ci-builds.apache.org/job/Tika/job/tika-main-jdk17/739/])
TIKA-4410 (#2226) -- improve feature extraction from xlsx (github: 
[https://github.com/apache/tika/commit/31c1a08ad1d08fdc088dc5cbb28f18363414543e])
* (edit) 
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-microsoft-module/src/main/java/org/apache/tika/parser/microsoft/AbstractPOIFSExtractor.java
* (edit) tika-core/src/test/java/org/apache/tika/pipes/PipesClientTest.java
* (edit) 
tika-parsers/tika-parsers-standard/tika-parsers-standard-package/src/test/java/org/apache/tika/parser/microsoft/ooxml/TruncatedOOXMLTest.java
* (edit) tika-core/src/main/java/org/apache/tika/metadata/Office.java
* (add) 
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-microsoft-module/src/main/java/org/apache/tika/parser/microsoft/ooxml/CommentPersonHandler.java
* (edit) 
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-microsoft-module/src/test/java/org/apache/tika/parser/microsoft/POIContainerExtractionTest.java
* (edit) 
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-microsoft-module/src/test/java/org/apache/tika/parser/microsoft/ooxml/OOXMLParserTest.java
* (edit) 
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-microsoft-module/src/main/java/org/apache/tika/parser/microsoft/ooxml/OPCPackageWrapper.java
* (edit) 
tika-core/src/main/java/org/apache/tika/metadata/writefilter/StandardWriteFilter.java
* (edit) 
tika-core/src/main/java/org/apache/tika/metadata/TikaCoreProperties.java
* (edit) 
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-microsoft-module/src/main/java/org/apache/tika/parser/microsoft/ooxml/XSSFBExcelExtractorDecorator.java
* (edit) 
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-microsoft-module/src/main/java/org/apache/tika/parser/microsoft/ooxml/XSSFExcelExtractorDecorator.java


> Improve feature extraction from xlsx
> ------------------------------------
>
>                 Key: TIKA-4410
>                 URL: https://issues.apache.org/jira/browse/TIKA-4410
>             Project: Tika
>          Issue Type: Task
>            Reporter: Tim Allison
>            Priority: Minor
>             Fix For: 4.0.0
>
>
> There are a number of features that we're not currently extracting from xlsx 
> files that would be useful for forensics, digipres and others.
> These include:
> * hidden sheets
> * very hidden sheets
> * comments
> * threaded comments
> * other things



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to