characterRun.getSymbolChar() returns the same char for different symbols

2017-10-05 Thread Teresa Kim
Dear POI users I got a doc document which contains uncommon greek mu and registered symbol and tried to use characterRun.getSymbolChar() method to identify these two symbols. I have noticed however, characterRun.getSymbolChar() always returns the same character and such that I could not find a

Apach POI doc/docx parser

2019-10-05 Thread Teresa Kim
Hi I have documents (either 'doc' or 'docx') that have a special character for 'greater than equal' and using codes in 'WordToHtmlConverter', I see those characters are converted into '('. I tried with the latest apache poi release 4.1.0. My java code is: public class TestWordtoHtmlConve

Re: Apach POI doc/docx parser

2019-10-06 Thread Teresa Kim
? Thanks... Dominik. On Sun, Oct 6, 2019 at 6:48 AM Teresa Kim wrote: Hi I have documents (either 'doc' or 'docx') that have a special character for 'greater than equal' and using codes in 'WordToHtmlConverter', I see those characters are converted into