[ https://issues.apache.org/jira/browse/TIKA-4361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17921819#comment-17921819 ]
Hudson commented on TIKA-4361: ------------------------------ SUCCESS: Integrated in Jenkins build Tika ยป tika-main-jdk17 #620 (See [https://ci-builds.apache.org/job/Tika/job/tika-main-jdk17/620/]) TIKA-4361 -- follow on fix (#2108) (github: [https://github.com/apache/tika/commit/05db89db3d24c13ee8794999c18af3a0b9a094e5]) * (edit) tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-microsoft-module/src/main/java/org/apache/tika/parser/microsoft/rtf/TextExtractor.java > Rare RTF bug handling styles within an href in a malformed file > --------------------------------------------------------------- > > Key: TIKA-4361 > URL: https://issues.apache.org/jira/browse/TIKA-4361 > Project: Tika > Issue Type: Improvement > Reporter: Tim Allison > Assignee: Tim Allison > Priority: Trivial > Fix For: 4.0.0, 3.1.0 > > > If there's a <b> within a hyperlink, and there's an extra closing }, then the > RTF parser closes the <b> without opening it, leading to a SAXException. > In other parts of the code, we check to make sure that we're not in a > hyperlink before we modify the styles. we should do that in this piece of > code as well. -- This message was sent by Atlassian Jira (v8.20.10#820010)