[jira] [Commented] (TIKA-3147) Strip punctuation in lang id component within tika-eval

2020-07-27 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17165961#comment-17165961 ] Hudson commented on TIKA-3147: -- SUCCESS: Integrated in Jenkins build tika-main-jdk14 #6 (See

[jira] [Commented] (TIKA-3147) Strip punctuation in lang id component within tika-eval

2020-07-27 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17165955#comment-17165955 ] Hudson commented on TIKA-3147: -- SUCCESS: Integrated in Jenkins build tika-branch-1x-jdk8 #353

[jira] [Commented] (TIKA-3147) Strip punctuation in lang id component within tika-eval

2020-07-27 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17165942#comment-17165942 ] Hudson commented on TIKA-3147: -- SUCCESS: Integrated in Jenkins build tika-main-jdk8 #1835 (Se

[jira] [Commented] (TIKA-3147) Strip punctuation in lang id component within tika-eval

2020-07-27 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17165943#comment-17165943 ] Hudson commented on TIKA-3147: -- SUCCESS: Integrated in Jenkins build tika-main-jdk11 #1016 (S

[jira] [Commented] (TIKA-3147) Strip punctuation in lang id component within tika-eval

2020-07-27 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17165906#comment-17165906 ] Hudson commented on TIKA-3147: -- UNSTABLE: Integrated in Jenkins build tika-branch-1x-jdk8 #35

[jira] [Commented] (TIKA-3147) Strip punctuation in lang id component within tika-eval

2020-07-27 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17165901#comment-17165901 ] Hudson commented on TIKA-3147: -- SUCCESS: Integrated in Jenkins build tika-main-jdk11 #1015 (S

[jira] [Commented] (TIKA-3147) Strip punctuation in lang id component within tika-eval

2020-07-27 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17165891#comment-17165891 ] Hudson commented on TIKA-3147: -- SUCCESS: Integrated in Jenkins build tika-main-jdk8 #1834 (Se

[jira] [Commented] (TIKA-3147) Strip punctuation in lang id component within tika-eval

2020-07-27 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17165879#comment-17165879 ] Hudson commented on TIKA-3147: -- UNSTABLE: Integrated in Jenkins build tika-main-jdk14 #5 (See

[jira] [Commented] (TIKA-3147) Strip punctuation in lang id component within tika-eval

2020-07-27 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17165878#comment-17165878 ] Hudson commented on TIKA-3147: -- UNSTABLE: Integrated in Jenkins build tika-main-jdk8-windows

[jira] [Resolved] (TIKA-3147) Strip punctuation in lang id component within tika-eval

2020-07-27 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-3147. --- Fix Version/s: 1.25 Resolution: Fixed > Strip punctuation in lang id component within tika-eval

[jira] [Updated] (TIKA-3147) Strip punctuation in lang id component within tika-eval

2020-07-27 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-3147: -- Summary: Strip punctuation in lang id component within tika-eval (was: String punctuation in lang id co

[jira] [Commented] (TIKA-3148) Remove apache-cxf dependency from tika-parsers

2020-07-27 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17165865#comment-17165865 ] Tim Allison commented on TIKA-3148: --- https://github.com/apache/tika/tree/2.x > Remove a

[jira] [Commented] (TIKA-3148) Remove apache-cxf dependency from tika-parsers

2020-07-27 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17165863#comment-17165863 ] Tim Allison commented on TIKA-3148: --- The Grobid journal parser requires jaxrs. There ar

[jira] [Commented] (TIKA-3147) String punctuation in lang id component within tika-eval

2020-07-27 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17165801#comment-17165801 ] Tim Allison commented on TIKA-3147: --- With that fixed, though {{The quick brown fox &&^&%

[jira] [Commented] (TIKA-3147) String punctuation in lang id component within tika-eval

2020-07-27 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17165795#comment-17165795 ] Tim Allison commented on TIKA-3147: --- Bug found...not initializing with normalizers. Yike

[jira] [Commented] (TIKA-3147) String punctuation in lang id component within tika-eval

2020-07-27 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17165785#comment-17165785 ] Tim Allison commented on TIKA-3147: --- Turns out that it requires a numeral _and_ the semi

[jira] [Commented] (TIKA-3147) String punctuation in lang id component within tika-eval

2020-07-27 Thread Kenneth William Krugler (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17165745#comment-17165745 ] Kenneth William Krugler commented on TIKA-3147: --- Hi [~tallison] - can you in