Hi,

While browsing the Tika Jira I found that they have issues with
SmartTags in DOCX documents.
https://issues.apache.org/jira/browse/TIKA-526

I tracked it down to the org.apache.poi.xwpf.usermodel.XWPFParagraph,
which does ignore Child elements of Type CTSmartTagRun.

I have a patch, which recursively collects the text from smart tags,
and this fixes the Tika issues.

<w:smartTag w:uri="urn:schemas-microsoft-com:office:smarttags"
w:element="PlaceName">
        <w:smartTag w:uri="urn:schemas-microsoft-com:office:smarttags"
w:element="place">

However, I noticed that there are no tests at all for hwpf, which
makes me not being very confident in my change :-)

a) Is it correct that there are no testcases here:
C:\repo\poi\src\testcases\org\apache\poi\hwpf
b) Whom can I contact to get feedback on my patch?
c) If I get confirmation on nonexisting tests, can we still get my
patch into beta5?

Regards,
Fabian

--
Fabian Lange | Leiter Competence Center Performance

codecentric AG | Merscheider Straße 1 | 42699 Solingen | Deutschland
tel: +49 (0) 212.23362821 | fax: +49 (0) 212.23362879 | mobil: +49 (0)
160.3673393
www.codecentric.de | blog.codecentric.de | www.meettheexperts.de |
www.more4fi.de

Sitz der Gesellschaft: Düsseldorf | HRB 63043 | Amtsgericht Düsseldorf
Vorstand: Klaus Jäger (Vorsitzender) | Mirko Novakovic . Rainer Vehns
Aufsichtsrat: Patric Fedlmeier (Vorsitzender) . Bernd Klinkmann . Jürgen Schütz

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to