Arvind Jagtap created TIKA-3488:
---
Summary: Security issue XXE in TIKA due to JDOM
Key: TIKA-3488
URL: https://issues.apache.org/jira/browse/TIKA-3488
Project: Tika
Issue Type: Bug
Com
Sebastian Nagel created TIKA-3489:
-
Summary: Robots.txt files frequently identified as message/rfc822
Key: TIKA-3489
URL: https://issues.apache.org/jira/browse/TIKA-3489
Project: Tika
Issue T
[
https://issues.apache.org/jira/browse/TIKA-3489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated TIKA-3489:
--
Affects Version/s: 2.0.0
> Robots.txt files frequently identified as message/rfc822
> --
[
https://issues.apache.org/jira/browse/TIKA-3489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17384905#comment-17384905
]
Tim Allison commented on TIKA-3489:
---
Should we try to detect robots.txt files as their o
[
https://issues.apache.org/jira/browse/TIKA-3153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17384913#comment-17384913
]
Sebastian Nagel commented on TIKA-3153:
---
Wasn't this already resolved in 1.25?
{nof
[
https://issues.apache.org/jira/browse/TIKA-2443?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17384915#comment-17384915
]
Sebastian Nagel commented on TIKA-2443:
---
Looks like this was already resolved in 1.2
[
https://issues.apache.org/jira/browse/TIKA-2443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison resolved TIKA-2443.
---
Fix Version/s: 1.25
Resolution: Fixed
Thank you [~snagel]!
> Plain text file identified as rfc
[
https://issues.apache.org/jira/browse/TIKA-3153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison resolved TIKA-3153.
---
Fix Version/s: 1.25
Resolution: Fixed
> Text File identified as message/rfc822
> --
[
https://issues.apache.org/jira/browse/TIKA-3489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17384931#comment-17384931
]
Tim Allison commented on TIKA-3489:
---
[~nick], any recommendations? {{text/x-robots}} sub
[
https://issues.apache.org/jira/browse/TIKA-3489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17384966#comment-17384966
]
Tim Allison commented on TIKA-3489:
---
I added mime detection for robots.txt in {{main}} w
https://stackoverflow.com/questions/68402058/tika-isnt-reading-pdf-properly
Not sure there's much we should do on the Tika side.
How hard would it be to add an "extract only text that is on the page" feature?
Best,
Tim
[
https://issues.apache.org/jira/browse/TIKA-3489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17384992#comment-17384992
]
Sebastian Nagel commented on TIKA-3489:
---
The [robots.txt RFC draft|https://datatrack
Maybe this could be done with the ExtractTextByArea example. However
IIRC the coordinates are awt-like (y 0 on top) coordinates, so the PDF
coordinates should somehow be mapped to this.
Tilman
Am 21.07.2021 um 18:21 schrieb Tim Allison:
https://stackoverflow.com/questions/68402058/tika-isnt-r
[
https://issues.apache.org/jira/browse/TIKA-3489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17385061#comment-17385061
]
Hudson commented on TIKA-3489:
--
FAILURE: Integrated in Jenkins build Tika ยป tika-main-jdk8 #2
Tim Allison created TIKA-3490:
-
Summary: Fix serialization in opensearch emitter for embedded
documents
Key: TIKA-3490
URL: https://issues.apache.org/jira/browse/TIKA-3490
Project: Tika
Issue Ty
[
https://issues.apache.org/jira/browse/TIKA-3490?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-3490:
--
Description: Serialization isn't working for embedded documents in the
OpenSearch emitter. This fix is
[
https://issues.apache.org/jira/browse/TIKA-3483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-3483:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Implement a network policy for Helm Char
[
https://issues.apache.org/jira/browse/TIKA-3454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-3454:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Facilitate configuration of translation
[
https://issues.apache.org/jira/browse/TIKA-3452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-3452:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> java.nio.file.FileSystemException Read-o
[
https://issues.apache.org/jira/browse/TIKA-3400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-3400:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Use equals for Object and String Compari
[
https://issues.apache.org/jira/browse/TIKA-3404?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-3404:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Rearchitect GoogleTranslator to use
> h
[
https://issues.apache.org/jira/browse/TIKA-3003?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-3003:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Remove unused dependencies
> ---
[
https://issues.apache.org/jira/browse/TIKA-3348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-3348:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Improve the workflow for extracting and
[
https://issues.apache.org/jira/browse/TIKA-3420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-3420:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Set tesseract ocr langauges as docker bu
[
https://issues.apache.org/jira/browse/TIKA-2945?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-2945:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> AutoDetectParser should skip the content
[
https://issues.apache.org/jira/browse/TIKA-3368?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-3368:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Add Bill of Materials (BOM) artifact (Ti
[
https://issues.apache.org/jira/browse/TIKA-2758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-2758:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Possible error charset detection
> -
[
https://issues.apache.org/jira/browse/TIKA-3367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-3367:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Add Bill of Materials (BOM) artifact
> -
[
https://issues.apache.org/jira/browse/TIKA-2796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-2796:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Update GoogleTranslator to use google-cl
[
https://issues.apache.org/jira/browse/TIKA-3270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-3270:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Render non-text in PDFs for OCR
> --
[
https://issues.apache.org/jira/browse/TIKA-3314?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-3314:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Treat soft hyphens like hyphens
> --
[
https://issues.apache.org/jira/browse/TIKA-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-2623:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> get embedded resources in PDF/doc files
[
https://issues.apache.org/jira/browse/TIKA-2794?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-2794:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Tika extracts text from pdf on MacBook,
[
https://issues.apache.org/jira/browse/TIKA-2346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-2346:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Allow Office format parsers to exclude p
[
https://issues.apache.org/jira/browse/TIKA-2946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-2946:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Review how TikaConfig can avoid parsing
[
https://issues.apache.org/jira/browse/TIKA-2701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-2701:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Text is not extracted properly from WMF
[
https://issues.apache.org/jira/browse/TIKA-2711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-2711:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> When parsing a UNIX text file apostrophe
[
https://issues.apache.org/jira/browse/TIKA-2720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-2720:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> A parser to output universal sentence en
[
https://issues.apache.org/jira/browse/TIKA-2492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-2492:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Remove pdfdebugger from tika
> -
[
https://issues.apache.org/jira/browse/TIKA-2346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-2346:
--
Fix Version/s: 2.0.1
> Allow Office format parsers to exclude parsing shapes
> -
[
https://issues.apache.org/jira/browse/TIKA-2596?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-2596:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Make PDF2XHTML and AbstractPDF2XHTML pub
[
https://issues.apache.org/jira/browse/TIKA-2565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-2565:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Upgrade edu.ucar dependencies to 4.6.11
[
https://issues.apache.org/jira/browse/TIKA-2312?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-2312:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> [Mp3Parser] expose fields form ID3TagsAn
[
https://issues.apache.org/jira/browse/TIKA-2558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-2558:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Add a new pid api to Tika
>
[
https://issues.apache.org/jira/browse/TIKA-2071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-2071:
--
Fix Version/s: (was: 2.0.0)
2.0.1
> Tika 2.0 - DefaultParser and CompositeParser
[
https://issues.apache.org/jira/browse/TIKA-2340?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-2340:
--
Fix Version/s: 2.0.1
> Add explicit deps to tika-parsers which are currently used from transitive
> sco
[
https://issues.apache.org/jira/browse/TIKA-2639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-2639:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Update freedesktop.org shared-mime-info-
[
https://issues.apache.org/jira/browse/TIKA-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1988:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Age Detection Tika Recogniser
>
[
https://issues.apache.org/jira/browse/TIKA-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1988:
--
Fix Version/s: (was: 2.0.0)
2.0.1
> Age Detection Tika Recogniser
> -
[
https://issues.apache.org/jira/browse/TIKA-2312?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-2312:
--
Fix Version/s: 2.0.1
> [Mp3Parser] expose fields form ID3TagsAndAudio
> ---
[
https://issues.apache.org/jira/browse/TIKA-2542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-2542:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Support in tika-server for getting plain
[
https://issues.apache.org/jira/browse/TIKA-1829?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1829:
--
Fix Version/s: (was: 2.0.0)
2.0.1
> org.apache.tika.parser.ocr.TesseractOCRParser
[
https://issues.apache.org/jira/browse/TIKA-1697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1697:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Parser Implementation for AkomaNtoso Leg
[
https://issues.apache.org/jira/browse/TIKA-1953?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1953:
--
Fix Version/s: (was: 2.0.0)
2.0.1
> tika-server NullPointerException while proces
[
https://issues.apache.org/jira/browse/TIKA-2369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-2369:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Define a clean Recogniser interface: for
[
https://issues.apache.org/jira/browse/TIKA-3104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-3104:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Detection of memgraph files exported fro
[
https://issues.apache.org/jira/browse/TIKA-1724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1724:
--
Fix Version/s: (was: 2.0.0)
2.0.1
> Create parser for .obo file format.
> ---
[
https://issues.apache.org/jira/browse/TIKA-2369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-2369:
--
Fix Version/s: 2.0.1
> Define a clean Recogniser interface: for objects from binary data; and for
> tex
[
https://issues.apache.org/jira/browse/TIKA-1688?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1688:
--
Fix Version/s: (was: 2.0.0)
2.0.1
> Tika Version in Metadata
> --
[
https://issues.apache.org/jira/browse/TIKA-1808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1808:
--
Fix Version/s: (was: 2.0.0)
2.0.1
> Head section closed too eager
> -
[
https://issues.apache.org/jira/browse/TIKA-1709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1709:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Tika Server doesn't handle multi-part at
[
https://issues.apache.org/jira/browse/TIKA-1840?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1840:
--
Fix Version/s: (was: 2.0.0)
2.0.1
> No way to link slide notes to slide in PPT ou
[
https://issues.apache.org/jira/browse/TIKA-1808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1808:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Head section closed too eager
>
[
https://issues.apache.org/jira/browse/TIKA-1829?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1829:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> org.apache.tika.parser.ocr.TesseractOCRP
[
https://issues.apache.org/jira/browse/TIKA-1709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1709:
--
Fix Version/s: (was: 2.0.0)
2.0.1
> Tika Server doesn't handle multi-part attachm
[
https://issues.apache.org/jira/browse/TIKA-2071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-2071:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Tika 2.0 - DefaultParser and CompositePa
[
https://issues.apache.org/jira/browse/TIKA-1724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1724:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Create parser for .obo file format.
> --
[
https://issues.apache.org/jira/browse/TIKA-1953?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1953:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> tika-server NullPointerException while p
[
https://issues.apache.org/jira/browse/TIKA-1840?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1840:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> No way to link slide notes to slide in P
[
https://issues.apache.org/jira/browse/TIKA-1705?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1705:
--
Fix Version/s: (was: 2.0.0)
2.0.1
> Update ASM dependency to 5.0.4
>
[
https://issues.apache.org/jira/browse/TIKA-1395?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1395:
--
Fix Version/s: (was: 2.0.0)
2.0.1
> Create embedded image extraction example
> --
[
https://issues.apache.org/jira/browse/TIKA-2340?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-2340:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Add explicit deps to tika-parsers which
[
https://issues.apache.org/jira/browse/TIKA-1454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1454:
--
Fix Version/s: (was: 2.0.0)
2.0.1
> Extracting as HTML loses links in xlsx, ppt,
[
https://issues.apache.org/jira/browse/TIKA-1640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1640:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Make ExternalParser support aliases for
[
https://issues.apache.org/jira/browse/TIKA-1609?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1609:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Leverage Google's LibPhonenumber for enh
[
https://issues.apache.org/jira/browse/TIKA-1688?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1688:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Tika Version in Metadata
> -
[
https://issues.apache.org/jira/browse/TIKA-1607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1607:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Introduce new arbitrary object key/value
[
https://issues.apache.org/jira/browse/TIKA-1505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1505:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> chmparser breaks down when extracting fr
[
https://issues.apache.org/jira/browse/TIKA-1738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1738:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> ForkClient does not always delete tempor
[
https://issues.apache.org/jira/browse/TIKA-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1390:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Create tika-example module
> ---
[
https://issues.apache.org/jira/browse/TIKA-1456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1456:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Visual Sentiment API parser
> --
[
https://issues.apache.org/jira/browse/TIKA-1598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1598:
--
Fix Version/s: (was: 2.0.0)
2.0.1
> Parser Implementation for Streaming Video
> -
[
https://issues.apache.org/jira/browse/TIKA-1674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1674:
--
Fix Version/s: (was: 2.0.0)
2.0.1
> Add example to show how to extract embedded f
[
https://issues.apache.org/jira/browse/TIKA-1417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1417:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Create Extract Embedded Images from PDFs
[
https://issues.apache.org/jira/browse/TIKA-1697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1697:
--
Fix Version/s: (was: 2.0.0)
2.0.1
> Parser Implementation for AkomaNtoso Legal XM
[
https://issues.apache.org/jira/browse/TIKA-1465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1465:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Implement extraction of non-global varia
[
https://issues.apache.org/jira/browse/TIKA-1609?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1609:
--
Fix Version/s: (was: 2.0.0)
2.0.1
> Leverage Google's LibPhonenumber for enhanced
[
https://issues.apache.org/jira/browse/TIKA-1276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1276:
--
Fix Version/s: (was: 2.0.0)
2.0.1
> Missing embedded dependencies in tika-bundle
[
https://issues.apache.org/jira/browse/TIKA-1952?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1952:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Access Date is getting modified while ca
[
https://issues.apache.org/jira/browse/TIKA-1738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1738:
--
Fix Version/s: (was: 2.0.0)
2.0.1
> ForkClient does not always delete temporary b
[
https://issues.apache.org/jira/browse/TIKA-1366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1366:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Update some of Tika Server services to s
[
https://issues.apache.org/jira/browse/TIKA-1674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1674:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Add example to show how to extract embed
[
https://issues.apache.org/jira/browse/TIKA-1607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1607:
--
Fix Version/s: (was: 2.0.0)
2.0.1
> Introduce new arbitrary object key/values dat
[
https://issues.apache.org/jira/browse/TIKA-1705?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1705:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Update ASM dependency to 5.0.4
> ---
[
https://issues.apache.org/jira/browse/TIKA-1640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1640:
--
Fix Version/s: (was: 2.0.0)
2.0.1
> Make ExternalParser support aliases for key n
[
https://issues.apache.org/jira/browse/TIKA-1328?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1328:
--
Fix Version/s: (was: 2.0.0)
2.0.1
> Translate Metadata and Content
>
[
https://issues.apache.org/jira/browse/TIKA-1616?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1616:
--
Fix Version/s: (was: 2.0.0)
2.0.0-BETA
> Tika Parser for GIBS Metadata
>
[
https://issues.apache.org/jira/browse/TIKA-1417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1417:
--
Fix Version/s: (was: 2.0.0)
2.0.1
> Create Extract Embedded Images from PDFs Exam
[
https://issues.apache.org/jira/browse/TIKA-1800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1800:
--
Fix Version/s: (was: 2.0.0)
2.0.1
> MediaType#parse does not decode escaped speci
[
https://issues.apache.org/jira/browse/TIKA-1577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1577:
--
Fix Version/s: (was: 2.0.0)
2.0.1
> NetCDF Data Extraction
>
1 - 100 of 171 matches
Mail list logo