[jira] [Commented] (TIKA-4459) protected ODF encryption detection fail

2025-07-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18011290#comment-18011290 ] ASF GitHub Bot commented on TIKA-4459: -- tballison merged PR #2291: URL: https://githu

[jira] [Commented] (TIKA-4459) protected ODF encryption detection fail

2025-07-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1805#comment-1805 ] ASF GitHub Bot commented on TIKA-4459: -- THausherr commented on PR #2291: URL: https:/

[jira] [Commented] (TIKA-4459) protected ODF encryption detection fail

2025-07-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18011007#comment-18011007 ] ASF GitHub Bot commented on TIKA-4459: -- tballison opened a new pull request, #2291: U

[jira] [Commented] (TIKA-1997) Problem in Tika().detect for xml file signed in CADES

2025-07-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-1997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18007605#comment-18007605 ] ASF GitHub Bot commented on TIKA-1997: -- rob975 commented on PR #2281: URL: https://gi

[jira] [Commented] (TIKA-4457) Typo in cad parser module pom for Automatic-Module

2025-07-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18005235#comment-18005235 ] ASF GitHub Bot commented on TIKA-4457: -- tballison commented on PR #2280: URL: https:/

[jira] [Commented] (TIKA-4457) Typo in cad parser module pom for Automatic-Module

2025-07-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18005234#comment-18005234 ] ASF GitHub Bot commented on TIKA-4457: -- tballison merged PR #2280: URL: https://githu

[jira] [Commented] (TIKA-1997) Problem in Tika().detect for xml file signed in CADES

2025-07-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-1997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18005228#comment-18005228 ] ASF GitHub Bot commented on TIKA-1997: -- tballison opened a new pull request, #2281: U

[jira] [Commented] (TIKA-4457) Typo in cad parser module pom for Automatic-Module

2025-07-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18005198#comment-18005198 ] ASF GitHub Bot commented on TIKA-4457: -- jdeolive opened a new pull request, #2280: UR

[jira] [Commented] (TIKA-4453) ForkParser fails on documents with more than 100 embedded documents

2025-07-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18004546#comment-18004546 ] ASF GitHub Bot commented on TIKA-4453: -- tballison opened a new pull request, #2278: U

[jira] [Commented] (TIKA-4453) ForkParser fails on documents with more than 100 embedded documents

2025-07-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18004418#comment-18004418 ] ASF GitHub Bot commented on TIKA-4453: -- tballison merged PR #2277: URL: https://githu

[jira] [Commented] (TIKA-4453) ForkParser fails on documents with more than 100 embedded documents

2025-07-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18004411#comment-18004411 ] ASF GitHub Bot commented on TIKA-4453: -- tballison opened a new pull request, #2277: U

[jira] [Commented] (TIKA-4333) Remove tika-batch from 4.x/main

2025-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18004179#comment-18004179 ] ASF GitHub Bot commented on TIKA-4333: -- tballison merged PR #2276: URL: https://githu

[jira] [Commented] (TIKA-4333) Remove tika-batch from 4.x/main

2025-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18004172#comment-18004172 ] ASF GitHub Bot commented on TIKA-4333: -- tballison opened a new pull request, #2276: U

[jira] [Commented] (TIKA-4451) Remove XMLErrorLogUpdater from tika-eval in 4.x

2025-07-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18003909#comment-18003909 ] ASF GitHub Bot commented on TIKA-4451: -- tballison merged PR #2275: URL: https://githu

[jira] [Commented] (TIKA-4452) Remove FileProfiler from tika-eval in 4.x

2025-07-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18003908#comment-18003908 ] ASF GitHub Bot commented on TIKA-4452: -- tballison merged PR #2274: URL: https://githu

[jira] [Commented] (TIKA-4450) Remove tika-batch from ExtractComparer

2025-07-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18003904#comment-18003904 ] ASF GitHub Bot commented on TIKA-4450: -- tballison merged PR #2273: URL: https://githu

[jira] [Commented] (TIKA-4451) Remove XMLErrorLogUpdater from tika-eval in 4.x

2025-07-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18003899#comment-18003899 ] ASF GitHub Bot commented on TIKA-4451: -- tballison opened a new pull request, #2275: U

[jira] [Commented] (TIKA-4452) Remove FileProfiler from tika-eval in 4.x

2025-07-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18003897#comment-18003897 ] ASF GitHub Bot commented on TIKA-4452: -- tballison opened a new pull request, #2274: U

[jira] [Commented] (TIKA-4450) Remove tika-batch from ExtractComparer

2025-07-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18003896#comment-18003896 ] ASF GitHub Bot commented on TIKA-4450: -- tballison opened a new pull request, #2273: U

[jira] [Commented] (TIKA-4342) Remove tika-batch from tika-eval's FileProfiler

2025-07-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4342?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18003879#comment-18003879 ] ASF GitHub Bot commented on TIKA-4342: -- tballison merged PR #2272: URL: https://githu

[jira] [Commented] (TIKA-4342) Remove tika-batch from tika-eval's FileProfiler

2025-07-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4342?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18003878#comment-18003878 ] ASF GitHub Bot commented on TIKA-4342: -- tballison commented on PR #2272: URL: https:/

[jira] [Commented] (TIKA-4342) Remove tika-batch from tika-eval's FileProfiler

2025-07-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4342?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18003875#comment-18003875 ] ASF GitHub Bot commented on TIKA-4342: -- tballison opened a new pull request, #2272: U

[jira] [Commented] (TIKA-4449) Improve xmp metadata key precision for PDFs

2025-07-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18003744#comment-18003744 ] ASF GitHub Bot commented on TIKA-4449: -- tballison merged PR #2266: URL: https://githu

[jira] [Commented] (TIKA-4449) Improve xmp metadata key precision for PDFs

2025-07-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17987922#comment-17987922 ] ASF GitHub Bot commented on TIKA-4449: -- tballison closed pull request #2265: TIKA-444

[jira] [Commented] (TIKA-4437) Extract more info from doc/docx

2025-07-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17987926#comment-17987926 ] ASF GitHub Bot commented on TIKA-4437: -- tballison merged PR #2262: URL: https://githu

[jira] [Commented] (TIKA-4449) Improve xmp metadata key precision for PDFs

2025-07-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17987923#comment-17987923 ] ASF GitHub Bot commented on TIKA-4449: -- tballison opened a new pull request, #2266: U

[jira] [Commented] (TIKA-4444) PDFParser shows wrong data in xmp "dc:subject" tag

2025-07-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17987914#comment-17987914 ] ASF GitHub Bot commented on TIKA-: -- tballison opened a new pull request, #2265: U

[jira] [Commented] (TIKA-4441) InputStream is consumed by Tika.detect for certain files

2025-06-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17986224#comment-17986224 ] ASF GitHub Bot commented on TIKA-4441: -- tballison merged PR #2261: URL: https://githu

[jira] [Commented] (TIKA-4441) InputStream is consumed by Tika.detect for certain files

2025-06-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17986216#comment-17986216 ] ASF GitHub Bot commented on TIKA-4441: -- tballison opened a new pull request, #2261: U

[jira] [Commented] (TIKA-4436) Fix some potential resource leak

2025-06-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17958362#comment-17958362 ] ASF GitHub Bot commented on TIKA-4436: -- THausherr merged PR #2250: URL: https://githu

[jira] [Commented] (TIKA-4436) Fix some potential resource leak

2025-06-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17957627#comment-17957627 ] ASF GitHub Bot commented on TIKA-4436: -- THausherr commented on PR #2250: URL: https:/

[jira] [Commented] (TIKA-4424) Regression in zip-based detection with an InputStream in 3.2.0

2025-06-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4424?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17957006#comment-17957006 ] ASF GitHub Bot commented on TIKA-4424: -- tballison merged PR #2249: URL: https://githu

[jira] [Commented] (TIKA-4424) Regression in zip-based detection with an InputStream in 3.2.0

2025-06-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4424?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17957002#comment-17957002 ] ASF GitHub Bot commented on TIKA-4424: -- tballison opened a new pull request, #2249: U

[jira] [Commented] (TIKA-4434) Extract more info from ppt and pptx

2025-06-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4434?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17956450#comment-17956450 ] ASF GitHub Bot commented on TIKA-4434: -- tballison merged PR #2243: URL: https://githu

[jira] [Commented] (TIKA-4434) Extract more info from ppt and pptx

2025-06-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4434?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17956443#comment-17956443 ] ASF GitHub Bot commented on TIKA-4434: -- tballison commented on PR #2243: URL: https:/

[jira] [Commented] (TIKA-4434) Extract more info from ppt and pptx

2025-06-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4434?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17956442#comment-17956442 ] ASF GitHub Bot commented on TIKA-4434: -- tballison opened a new pull request, #2243: U

[jira] [Commented] (TIKA-4433) Improve handling of null values in StandardWriteFilter

2025-06-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17956371#comment-17956371 ] ASF GitHub Bot commented on TIKA-4433: -- tballison merged PR #2242: URL: https://githu

[jira] [Commented] (TIKA-4433) Improve handling of null values in StandardWriteFilter

2025-06-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17956367#comment-17956367 ] ASF GitHub Bot commented on TIKA-4433: -- tballison opened a new pull request, #2242: U

[jira] [Commented] (TIKA-4432) Issue with EMF Parser Merging Header, Page Number, and First Content Word During Extraction

2025-06-04 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4432?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17956092#comment-17956092 ] ASF GitHub Bot commented on TIKA-4432: -- aashishtudu closed pull request #2241: [TIKA-

[jira] [Commented] (TIKA-4432) Issue with EMF Parser Merging Header, Page Number, and First Content Word During Extraction

2025-06-04 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4432?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17956084#comment-17956084 ] ASF GitHub Bot commented on TIKA-4432: -- aashishtudu opened a new pull request, #2241:

[jira] [Commented] (TIKA-4430) Extract more info from xls files

2025-06-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17955895#comment-17955895 ] ASF GitHub Bot commented on TIKA-4430: -- tballison merged PR #2240: URL: https://githu

[jira] [Commented] (TIKA-4427) Memory Leak when parsing a large (110K+) number of documents

2025-06-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17955889#comment-17955889 ] ASF GitHub Bot commented on TIKA-4427: -- tballison merged PR #2239: URL: https://githu

[jira] [Commented] (TIKA-4430) Extract more info from xls files

2025-06-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17955886#comment-17955886 ] ASF GitHub Bot commented on TIKA-4430: -- tballison opened a new pull request, #2240: U

[jira] [Commented] (TIKA-4427) Memory Leak when parsing a large (110K+) number of documents

2025-06-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17955664#comment-17955664 ] ASF GitHub Bot commented on TIKA-4427: -- tballison opened a new pull request, #2239: U

[jira] [Commented] (TIKA-4410) Improve feature extraction from xlsx

2025-05-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4410?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17955342#comment-17955342 ] ASF GitHub Bot commented on TIKA-4410: -- tballison merged PR #2229: URL: https://githu

[jira] [Commented] (TIKA-4410) Improve feature extraction from xlsx

2025-05-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4410?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17955338#comment-17955338 ] ASF GitHub Bot commented on TIKA-4410: -- tballison opened a new pull request, #2229: U

[jira] [Commented] (TIKA-4357) Ensure namespace prefixes in metadata keys in 4.x

2025-05-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17955226#comment-17955226 ] ASF GitHub Bot commented on TIKA-4357: -- tballison merged PR #2228: URL: https://githu

[jira] [Commented] (TIKA-4426) Add "img:" prefix to unknown image metadata keys

2025-05-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17955225#comment-17955225 ] ASF GitHub Bot commented on TIKA-4426: -- tballison merged PR #2227: URL: https://githu

[jira] [Commented] (TIKA-4410) Improve feature extraction from xlsx

2025-05-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4410?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17955224#comment-17955224 ] ASF GitHub Bot commented on TIKA-4410: -- tballison merged PR #2226: URL: https://githu

[jira] [Commented] (TIKA-4357) Ensure namespace prefixes in metadata keys in 4.x

2025-05-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17955223#comment-17955223 ] ASF GitHub Bot commented on TIKA-4357: -- tballison opened a new pull request, #2228: U

[jira] [Commented] (TIKA-4426) Add "img:" prefix to unknown image metadata keys

2025-05-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17955218#comment-17955218 ] ASF GitHub Bot commented on TIKA-4426: -- tballison opened a new pull request, #2227: U

[jira] [Commented] (TIKA-4425) Add gps timestamp as a normalized metadata field

2025-05-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17955206#comment-17955206 ] ASF GitHub Bot commented on TIKA-4425: -- tballison merged PR #2225: URL: https://githu

[jira] [Commented] (TIKA-4410) Improve feature extraction from xlsx

2025-05-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4410?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17955205#comment-17955205 ] ASF GitHub Bot commented on TIKA-4410: -- tballison opened a new pull request, #2226: U

[jira] [Commented] (TIKA-4425) Add gps timestamp as a normalized metadata field

2025-05-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17955201#comment-17955201 ] ASF GitHub Bot commented on TIKA-4425: -- tballison opened a new pull request, #2225: U

[jira] [Commented] (TIKA-4318) Fix javadoc aggregate in 3.x

2025-05-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17954799#comment-17954799 ] ASF GitHub Bot commented on TIKA-4318: -- tballison commented on PR #: URL: https:/

[jira] [Commented] (TIKA-4318) Fix javadoc aggregate in 3.x

2025-05-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17954792#comment-17954792 ] ASF GitHub Bot commented on TIKA-4318: -- tballison merged PR #: URL: https://githu

[jira] [Commented] (TIKA-4318) Fix javadoc aggregate in 3.x

2025-05-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17954787#comment-17954787 ] ASF GitHub Bot commented on TIKA-4318: -- nddipiazza opened a new pull request, #:

[jira] [Commented] (TIKA-4419) Deal with self-closeable tags handling change in jsoup 1.20.1

2025-05-21 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17953251#comment-17953251 ] ASF GitHub Bot commented on TIKA-4419: -- tballison merged PR #2217: URL: https://githu

[jira] [Commented] (TIKA-4419) Deal with self-closeable tags handling change in jsoup 1.20.1

2025-05-21 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17953245#comment-17953245 ] ASF GitHub Bot commented on TIKA-4419: -- tballison closed pull request #2215: TIKA-441

[jira] [Commented] (TIKA-4419) Deal with self-closeable tags handling change in jsoup 1.20.1

2025-05-21 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17953244#comment-17953244 ] ASF GitHub Bot commented on TIKA-4419: -- tballison opened a new pull request, #2217: U

[jira] [Commented] (TIKA-4419) Deal with self-closeable tags handling change in jsoup 1.20.1

2025-05-21 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17953237#comment-17953237 ] ASF GitHub Bot commented on TIKA-4419: -- tballison opened a new pull request, #2216: U

[jira] [Commented] (TIKA-4419) Deal with self-closeable tags handling change in jsoup 1.20.1

2025-05-21 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17953238#comment-17953238 ] ASF GitHub Bot commented on TIKA-4419: -- tballison merged PR #2216: URL: https://githu

[jira] [Commented] (TIKA-4418) Fix title in writeSelectHeadersInBody() for MSG messages

2025-05-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4418?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17952647#comment-17952647 ] ASF GitHub Bot commented on TIKA-4418: -- tballison merged PR #2214: URL: https://githu

[jira] [Commented] (TIKA-4419) Try to downgrade jsoup to 1.19.1 for the 3.2.0 release

2025-05-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17952640#comment-17952640 ] ASF GitHub Bot commented on TIKA-4419: -- tballison opened a new pull request, #2215: U

[jira] [Commented] (TIKA-4418) Fix title in writeSelectHeadersInBody() for MSG messages

2025-05-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4418?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17952631#comment-17952631 ] ASF GitHub Bot commented on TIKA-4418: -- tballison opened a new pull request, #2214: U

[jira] [Commented] (TIKA-4417) Dependency convergence error with jackcess-encrypt

2025-05-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4417?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17952431#comment-17952431 ] ASF GitHub Bot commented on TIKA-4417: -- THausherr merged PR #2206: URL: https://githu

[jira] [Commented] (TIKA-4417) Dependency convergence error with jackcess-encrypt

2025-05-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4417?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17952418#comment-17952418 ] ASF GitHub Bot commented on TIKA-4417: -- dafriz opened a new pull request, #2206: URL:

[jira] [Commented] (TIKA-4414) tika-eval-core's jar should not include dependencies

2025-05-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17950555#comment-17950555 ] ASF GitHub Bot commented on TIKA-4414: -- tballison merged PR #2198: URL: https://githu

[jira] [Commented] (TIKA-4413) Update tika-eval-app's xlsx writing to Zip64Mode.AlwaysWithCompatibility

2025-05-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17950290#comment-17950290 ] ASF GitHub Bot commented on TIKA-4413: -- tballison merged PR #2196: URL: https://githu

[jira] [Commented] (TIKA-4374) Add attachment "file name" to mime_diffs_A_to_B_details.xlsx in tika-eval

2025-05-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4374?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17950292#comment-17950292 ] ASF GitHub Bot commented on TIKA-4374: -- tballison merged PR #2197: URL: https://githu

[jira] [Commented] (TIKA-4413) Update tika-eval-app's xlsx writing to Zip64Mode.AlwaysWithCompatibility

2025-05-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17950276#comment-17950276 ] ASF GitHub Bot commented on TIKA-4413: -- tballison opened a new pull request, #2196: U

[jira] [Commented] (TIKA-4414) tika-eval-core's jar should not include dependencies

2025-05-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17950285#comment-17950285 ] ASF GitHub Bot commented on TIKA-4414: -- tballison opened a new pull request, #2198: U

[jira] [Commented] (TIKA-4391) Detect inline images in msg files

2025-05-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17950278#comment-17950278 ] ASF GitHub Bot commented on TIKA-4391: -- tballison merged PR #2195: URL: https://githu

[jira] [Commented] (TIKA-4374) Add attachment "file name" to mime_diffs_A_to_B_details.xlsx in tika-eval

2025-05-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4374?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17950277#comment-17950277 ] ASF GitHub Bot commented on TIKA-4374: -- tballison opened a new pull request, #2197: U

[jira] [Commented] (TIKA-4400) Consider simplifying the build with a sandbox profile

2025-05-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17950128#comment-17950128 ] ASF GitHub Bot commented on TIKA-4400: -- tballison merged PR #2183: URL: https://githu

[jira] [Commented] (TIKA-4391) Detect inline images in msg files

2025-05-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17950127#comment-17950127 ] ASF GitHub Bot commented on TIKA-4391: -- tballison opened a new pull request, #2195: U

[jira] [Commented] (TIKA-4406) docker-compose-tika-customocr.yml - yaml: line 22: did not find expected ',' or ']'

2025-04-12 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17943764#comment-17943764 ] ASF GitHub Bot commented on TIKA-4406: -- THausherr merged PR #26: URL: https://github.

[jira] [Commented] (TIKA-4406) docker-compose-tika-customocr.yml - yaml: line 22: did not find expected ',' or ']'

2025-04-12 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17943763#comment-17943763 ] ASF GitHub Bot commented on TIKA-4406: -- THausherr opened a new pull request, #26: URL

[jira] [Commented] (TIKA-4399) RUnpackExtractor -- improve stream wrapping

2025-04-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17941956#comment-17941956 ] ASF GitHub Bot commented on TIKA-4399: -- tballison opened a new pull request, #2182: U

[jira] [Commented] (TIKA-4401) Catch jempbox's NumberFormatException

2025-04-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17943194#comment-17943194 ] ASF GitHub Bot commented on TIKA-4401: -- tballison merged PR #2184: URL: https://githu

[jira] [Commented] (TIKA-4395) cannot get any slide content for pptx file

2025-04-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4395?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17943198#comment-17943198 ] ASF GitHub Bot commented on TIKA-4395: -- tballison merged PR #2185: URL: https://githu

[jira] [Commented] (TIKA-4395) cannot get any slide content for pptx file

2025-04-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4395?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17943189#comment-17943189 ] ASF GitHub Bot commented on TIKA-4395: -- tballison opened a new pull request, #2185: U

[jira] [Commented] (TIKA-4401) Catch jempbox's NumberFormatException

2025-04-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17941993#comment-17941993 ] ASF GitHub Bot commented on TIKA-4401: -- tballison opened a new pull request, #2184: U

[jira] [Commented] (TIKA-4400) Consider simplifying the build with a sandbox profile

2025-04-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17941981#comment-17941981 ] ASF GitHub Bot commented on TIKA-4400: -- tballison opened a new pull request, #2183: U

[jira] [Commented] (TIKA-4399) RUnpackExtractor -- improve stream wrapping

2025-04-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17941976#comment-17941976 ] ASF GitHub Bot commented on TIKA-4399: -- tballison merged PR #2182: URL: https://githu

[jira] [Commented] (TIKA-4393) Thread-safety issue in TikaToXMP.getConverterMap()

2025-03-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17935310#comment-17935310 ] ASF GitHub Bot commented on TIKA-4393: -- tballison merged PR #2163: URL: https://githu

[jira] [Commented] (TIKA-4393) Thread-safety issue in TikaToXMP.getConverterMap()

2025-03-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17935269#comment-17935269 ] ASF GitHub Bot commented on TIKA-4393: -- tballison opened a new pull request, #2163: U

[jira] [Commented] (TIKA-4389) Cleanups for TIKA-4381

2025-02-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17930778#comment-17930778 ] ASF GitHub Bot commented on TIKA-4389: -- tballison merged PR #2144: URL: https://githu

[jira] [Commented] (TIKA-4389) Cleanups for TIKA-4381

2025-02-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17930771#comment-17930771 ] ASF GitHub Bot commented on TIKA-4389: -- tballison opened a new pull request, #2144: U

[jira] [Commented] (TIKA-1997) Problem in Tika().detect for xml file signed in CADES

2025-02-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-1997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17930593#comment-17930593 ] ASF GitHub Bot commented on TIKA-1997: -- rob975 closed pull request #267: TIKA-1997 Pr

[jira] [Commented] (TIKA-4387) Improve robustness of file extension parsing

2025-02-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17930457#comment-17930457 ] ASF GitHub Bot commented on TIKA-4387: -- tballison merged PR #2143: URL: https://githu

[jira] [Commented] (TIKA-4387) Improve robustness of file extension parsing

2025-02-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17930450#comment-17930450 ] ASF GitHub Bot commented on TIKA-4387: -- tballison opened a new pull request, #2143: U

[jira] [Commented] (TIKA-4381) Improve extraction of metadata from Appointment/Task msgs

2025-02-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17930399#comment-17930399 ] ASF GitHub Bot commented on TIKA-4381: -- tballison merged PR #2142: URL: https://githu

[jira] [Commented] (TIKA-4381) Improve extraction of metadata from Appointment/Task msgs

2025-02-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17930394#comment-17930394 ] ASF GitHub Bot commented on TIKA-4381: -- tballison opened a new pull request, #2142: U

[jira] [Commented] (TIKA-4303) Unable to extract Chinese content in onenote

2025-02-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17929915#comment-17929915 ] ASF GitHub Bot commented on TIKA-4303: -- nddipiazza commented on PR #2098: URL: https:

[jira] [Commented] (TIKA-4385) GDALParser deadlocks while reading gdalinfo output

2025-02-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17927528#comment-17927528 ] ASF GitHub Bot commented on TIKA-4385: -- tballison commented on PR #2126: URL: https:/

[jira] [Commented] (TIKA-4385) GDALParser deadlocks while reading gdalinfo output

2025-02-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17927527#comment-17927527 ] ASF GitHub Bot commented on TIKA-4385: -- tballison merged PR #2126: URL: https://githu

[jira] [Commented] (TIKA-4385) GDALParser deadlocks while reading gdalinfo output

2025-02-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17927152#comment-17927152 ] ASF GitHub Bot commented on TIKA-4385: -- THausherr commented on PR #2126: URL: https:/

[jira] [Commented] (TIKA-4385) GDALParser deadlocks while reading gdalinfo output

2025-02-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17927142#comment-17927142 ] ASF GitHub Bot commented on TIKA-4385: -- lsliwko commented on PR #2126: URL: https://g

[jira] [Commented] (TIKA-4385) GDALParser deadlocks while reading gdalinfo output

2025-02-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17927080#comment-17927080 ] ASF GitHub Bot commented on TIKA-4385: -- lsliwko commented on PR #2126: URL: https://g

  1   2   3   4   5   6   7   8   9   10   >