[ https://issues.apache.org/jira/browse/TIKA-4338?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17892853#comment-17892853 ]
Hudson commented on TIKA-4338: ------------------------------ SUCCESS: Integrated in Jenkins build Tika ยป tika-branch_3x-jdk11 #1855 (See [https://ci-builds.apache.org/job/Tika/job/tika-branch_3x-jdk11/1855/]) Tika-4338 -- remove tagsoup entirely (#2011) (tallison: [https://github.com/apache/tika/commit/2bb46241f13c7d3d63dd36f5bb88e910eee7ff8c]) * (edit) tika-bundles/tika-bundle-standard/pom.xml * (edit) tika-parent/pom.xml * (edit) tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-code-module/src/main/java/org/apache/tika/parser/code/SourceCodeParser.java * (edit) tika-eval/tika-eval-core/src/main/java/org/apache/tika/eval/core/util/ContentTagParser.java * (edit) tika-eval/tika-eval-app/src/test/java/org/apache/tika/eval/app/SimpleComparerTest.java * (edit) tika-server/tika-server-eval/pom.xml * (edit) tika-eval/tika-eval-core/pom.xml * (edit) tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-code-module/pom.xml * (edit) tika-bom/pom.xml * (edit) tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-microsoft-module/src/main/java/org/apache/tika/parser/microsoft/chm/ChmParser.java * (edit) tika-eval/tika-eval-app/pom.xml * (edit) tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-code-module/src/test/java/org/apache/tika/parser/code/SourceCodeParserTest.java * (edit) tika-core/src/main/java/org/apache/tika/sax/xpath/MatchingContentHandler.java > Remove use of EOL component TagSoup 1.2.1 from tika-parser-code-module > ---------------------------------------------------------------------- > > Key: TIKA-4338 > URL: https://issues.apache.org/jira/browse/TIKA-4338 > Project: Tika > Issue Type: Bug > Affects Versions: 3.0.0 > Reporter: Sandeep Kulkarni > Priority: Major > Fix For: 4.0.0, 3.1.0 > > > As per the release notes for Tika 3.0.0, TagSoup is mentioned as replaced > with JSoup. I had requested for its removal earlier in TIKA-4109. > So I integrated Tika 3.0.0 and found that TagSoup is still shown as one of > the dependency component of tika-parser-code-module. It seems to be only > removed from tika-parser-html-module. > So is it possible to completely get rid of TagSoup from Tika as it is EOL? > tika-parser-code-module has dependency of > *org.ccil.cowan.tagsoup:tagsoup:jar:1.2.1.* -- This message was sent by Atlassian Jira (v8.20.10#820010)