[tesseract-ocr] Re: tesseract PDF line offset in 4.0.0 alpha.

2018-02-19 Thread DJArty
What exactly pdf viewer / rendered you use? Did you try another one? > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroup

[tesseract-ocr] Lost spaces in some pdf renderers

2018-02-19 Thread DJArty
Attached pdf OCRed by ocrmypdf using tesseract 4.00.00alpha Linux 4.13.0-32-generic #35~16.04.1-Ubuntu SMP x86_64 x86_64 x86_64 GNU/Linux In some pdf viewers (Evince, Chrome, Opera) all ok but in other (Firefox, Alfresco Share, pdfjs) not so good - lost spaces between the words. So text "Test