Tim Allison created TIKA-3047:
-
Summary: Upgrade to POI 4.1.2
Key: TIKA-3047
URL: https://issues.apache.org/jira/browse/TIKA-3047
Project: Tika
Issue Type: Task
Reporter: Tim Allison
All,
I recently downloaded attachments from the following bug trackers:
COMPRESS, TIKA, PDFBox, POI, Open Office, Libre Office and ghostscript:
http://162.242.228.174/docs/bugtrackers/
I then unpackaged/uncompressed all of the package/compressed files so:
COMPRESS-115-1.zip is the second fil
[
https://issues.apache.org/jira/browse/TIKA-3046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-3046:
--
Description: Add format detection for .cdr, .bau, .sob, .oxt, .odp, .odb.
In unpacking attachments to Li
Tim Allison created TIKA-3046:
-
Summary: Add detection of some open office related formats
Key: TIKA-3046
URL: https://issues.apache.org/jira/browse/TIKA-3046
Project: Tika
Issue Type: Task
Hi,
Just following up on this. Do you know why my code isn’t working?
Thank you,
Max
> On Feb 10, 2020, at 2:04 PM, Max Franklin wrote:
>
> Hello,
>
>
> I'm sorry for the inconvenience, but I've been using Tika as part of a
> Python code to extract text from PDFs and convert it into a TXT f
Tim Allison created TIKA-3045:
-
Summary: Allow users to run custom parsing of xfa and xmp
Key: TIKA-3045
URL: https://issues.apache.org/jira/browse/TIKA-3045
Project: Tika
Issue Type: Task
[
https://issues.apache.org/jira/browse/TIKA-3043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17036912#comment-17036912
]
CHARUSHEELA BOPARDIKAR commented on TIKA-3043:
--
Tried adding this exclusion i