[jira] [Commented] (TIKA-4373) Regression tests for 3.1.0 release

2025-01-30 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17922453#comment-17922453 ] Tim Allison commented on TIKA-4373: --- jsoup 1.18.3 stops parsing around the following pro

[jira] [Commented] (TIKA-4373) Regression tests for 3.1.0 release

2025-01-30 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17922452#comment-17922452 ] Tim Allison commented on TIKA-4373: --- Y. This is a difference in jsoup. I ran the extract

[jira] [Commented] (TIKA-4373) Regression tests for 3.1.0 release

2025-01-30 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17922444#comment-17922444 ] Tim Allison commented on TIKA-4373: --- [~tilman] thank you. I'm looking now. I don't think

[jira] [Commented] (TIKA-4373) Regression tests for 3.1.0 release

2025-01-30 Thread Tilman Hausherr (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17922306#comment-17922306 ] Tilman Hausherr commented on TIKA-4373: --- I found some huge differences with some HTM

[jira] [Commented] (TIKA-4373) Regression tests for 3.1.0 release

2025-01-29 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1790#comment-1790 ] Tim Allison commented on TIKA-4373: --- I finished the regression tests on 3.0.0 vs 3.1.0-r

[jira] [Commented] (TIKA-4373) Regression tests for 3.1.0 release

2025-01-29 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1791#comment-1791 ] Tim Allison commented on TIKA-4373: --- bq. Y, I need to update tika-eval to include the at

[jira] [Commented] (TIKA-4373) Regression tests for 3.1.0 release

2025-01-28 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17921797#comment-17921797 ] Tim Allison commented on TIKA-4373: --- I reopened TIKA-4337 for a trivial xps improvement.

[jira] [Commented] (TIKA-4373) Regression tests for 3.1.0 release

2025-01-28 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17921788#comment-17921788 ] Tim Allison commented on TIKA-4373: --- Y, I need to update tika-eval to include the attach

[jira] [Commented] (TIKA-4373) Regression tests for 3.1.0 release

2025-01-28 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17921789#comment-17921789 ] Tim Allison commented on TIKA-4373: --- I reopened TIKA-4361. > Regression tests for 3.1.0

[jira] [Commented] (TIKA-4373) Regression tests for 3.1.0 release

2025-01-28 Thread Tilman Hausherr (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17921763#comment-17921763 ] Tilman Hausherr commented on TIKA-4373: --- I found only one json file, which is 2PSMEF

[jira] [Commented] (TIKA-4373) Regression tests for 3.1.0 release

2025-01-28 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17921761#comment-17921761 ] Tim Allison commented on TIKA-4373: --- Handful of text->json files I reviewed looks like a

[jira] [Commented] (TIKA-4373) Regression tests for 3.1.0 release

2025-01-28 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17921759#comment-17921759 ] Tim Allison commented on TIKA-4373: --- W00t! And that's why I wanted to look at a few of t

[jira] [Commented] (TIKA-4373) Regression tests for 3.1.0 release

2025-01-28 Thread Tilman Hausherr (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17921756#comment-17921756 ] Tilman Hausherr commented on TIKA-4373: --- [^S53SZFZ2FBOZIVTX3HVP4D4XKHKPEMQQ.csv] is