[tesseract-ocr] "Line separator regions" capabilities?

2022-04-27 Thread Brad
For V5.10.0 of Tesseract, one of the changes is: > Handle image and line separator regions in ALTO, hOCR and text output formats. I'm curious about what this means. Can Tesseract be used to identify rectangles and such on an image that might surround a text region, and if so, is this what this

Re: [tesseract-ocr] "Line separator regions" capabilities?

2022-04-27 Thread Merlijn B.W. Wajer
Hi, On 27/04/2022 19:07, Brad wrote: For V5.10.0 of Tesseract, one of the changes is: (correction: version 5.1.0) Handle image and line separator regions in ALTO, hOCR and text output formats. I'm curious about what this means. Can Tesseract be used to identify rectangles and such on an im

Re: [tesseract-ocr] "Line separator regions" capabilities?

2022-04-27 Thread Brad
Thanks for the information, Merlijn. Will take a look at some of the links you posted. On Wednesday, April 27, 2022 at 10:18:39 AM UTC-7 Merlijn Wajer wrote: > Hi, > > On 27/04/2022 19:07, Brad wrote: > > For V5.10.0 of Tesseract, one of the changes is: > > (correction: version 5.1.0) > > >> Han