Re: [Bug 623438] Re: Font size not correct in merged sandvich PDF

2011-08-10 Thread Igor Filippov
To be fair there are also OCRAD, GOCR, and Tesseract. Igor On Wed, 2011-08-10 at 08:53 +, Martin Wildam wrote: > On Mon, Aug 8, 2011 at 09:40, Jussi Pakkanen > wrote: > > I'd like to remind everyone that Cuneiform is currently unmaintained. > > No-one is working on this or any other bug. >

Re: [Bug 623438] Re: Font size not correct in merged sandvich PDF

2011-08-10 Thread Martin Wildam
On Mon, Aug 8, 2011 at 09:40, Jussi Pakkanen wrote: > I'd like to remind everyone that Cuneiform is currently unmaintained. > No-one is working on this or any other bug. Sad, but I had such an impression already. As far as I can see the one and only OCR option for Linux and Ubuntu that runs stabl

Re: [Bug 623438] Re: Font size not correct in merged sandvich PDF

2010-10-17 Thread Jakub Wilk
>Example: >class='ocr_xword' id='xword_1' title="bbox 0 0 20 20">title="x_bboxes b1x0 b1y0 b1x1 b1y1 b2x0 ...">hello >class='ocr_cinfo' title="x_bboxes b1x0 b1y0 b1x1 b1y1 b2x0 >...">world >(note the whitespace which is not part of any ocr_xword as cuneiform will >produce an incorrect bbox for i

Re: [Bug 623438] Re: Font size not correct in merged sandvich PDF

2010-10-17 Thread Jakub Wilk
>I find the specification somewhat difficult to interpret at times but >it is my understanding that character bbox info goes within the >ocr_line tag element. whether it goes before or after the textual >elements is irrelevant. E.g. > > BYGGNADER > >