On Sun, Dec 02, 2012 at 08:04:50AM -0800, Benito2313 wrote: > I got a HTML output, its getting there. But is it possible to get the hocr to > give an XML output?
What is it that you're trying to do? HTML is an XML dialect, after all (or can be, if XHTML). You should be able to parse it with all XML tools. The only way to get a different XML representation would be to either delve into the API, or convert the hOCR to something more to your liking. But hOCR is *the* XMLish OCR output standard; I don't see why you'd want anything else. Nick -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

