[ 
https://issues.apache.org/jira/browse/TIKA-4361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17921819#comment-17921819
 ] 

Hudson commented on TIKA-4361:
------------------------------

SUCCESS: Integrated in Jenkins build Tika ยป tika-main-jdk17 #620 (See 
[https://ci-builds.apache.org/job/Tika/job/tika-main-jdk17/620/])
TIKA-4361 -- follow on fix (#2108) (github: 
[https://github.com/apache/tika/commit/05db89db3d24c13ee8794999c18af3a0b9a094e5])
* (edit) 
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-microsoft-module/src/main/java/org/apache/tika/parser/microsoft/rtf/TextExtractor.java


> Rare RTF bug handling styles within an href in a malformed file
> ---------------------------------------------------------------
>
>                 Key: TIKA-4361
>                 URL: https://issues.apache.org/jira/browse/TIKA-4361
>             Project: Tika
>          Issue Type: Improvement
>            Reporter: Tim Allison
>            Assignee: Tim Allison
>            Priority: Trivial
>             Fix For: 4.0.0, 3.1.0
>
>
> If there's a <b> within a hyperlink, and there's an extra closing }, then the 
> RTF parser closes the <b> without opening it, leading to a SAXException.
> In other parts of the code, we check to make sure that we're not in a 
> hyperlink before we modify the styles. we should do that in this piece of 
> code as well.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to