Hi Chris,
Good to hear from you. I don't know if it would help at all, but I'm
planning to add chart support to Tika soon(ish). I haven't yet opened the
ticket on Tika's JIRA, so please open one there too if that would be of any use
to you.
Best,
Tim
-Original
Okay, so I've got an HWPFDocument that I'm converting to an HTML file using
the WordToHtmlConverter class. After processing the document, the resulting
page has some encoding messed up with it.
Some bullet points for instance are showing gibberish, i've tried encoding
with multiple character sets,