RE: Question about extracting text from charts in XLSX

2017-01-23 Thread Allison, Timothy B.
Hi Chris, Good to hear from you. I don't know if it would help at all, but I'm planning to add chart support to Tika soon(ish). I haven't yet opened the ticket on Tika's JIRA, so please open one there too if that would be of any use to you. Best, Tim -Original

HTML Converter Encoding Issue

2017-01-23 Thread Huda Yousif
Okay, so I've got an HWPFDocument that I'm converting to an HTML file using the WordToHtmlConverter class. After processing the document, the resulting page has some encoding messed up with it. Some bullet points for instance are showing gibberish, i've tried encoding with multiple character sets,