Re: [tesseract-ocr] Multiple jpg files into 1 editable pdf

2019-05-08 Thread Shree Devi Kumar
See https://github.com/tesseract-ocr/tesseract/wiki/FAQ#how-to-process-multiple-images-in-a-single-run On Thu, May 9, 2019 at 9:34 AM Brian Lallo wrote: > Thanks for reading. > > I will be taking 200 pages and scanning them into a folder on my computer. > > I need to take those 200 pages (that

Re: [tesseract-ocr] Multiple jpg files into 1 editable pdf

2019-05-08 Thread farhad khalafi
One option is to first convert the 200 images into a single PDF file and then run the PDF through TesseractStudio.Net which will OCR all pages and generate a searchable PDF file with various options. Make sure that the scanning is in at least 200 (prefarably 300) D

[tesseract-ocr] Multiple jpg files into 1 editable pdf

2019-05-08 Thread Brian Lallo
Thanks for reading. I will be taking 200 pages and scanning them into a folder on my computer. I need to take those 200 pages (that will be saved as .jpg) and turn them into 1 single editable pdf document. Any help appreciated -- You received this message because you are subscribed to the Go

Re: [tesseract-ocr] how to

2019-05-08 Thread ניב27 יניב . ס
i used java android and try otsu Adaptive Threshold and sauvolaBinarizeTiled but none of them give me good resaults i try also opencv and also same no good resaults בתאריך יום רביעי, 8 במאי 2019 בשעה 16:14:54 UTC+3, מאת zdenop: > > 1. provide your image if you want help > 2. tesseract is not libra

Re: [tesseract-ocr] how to

2019-05-08 Thread Zdenko Podobny
1. provide your image if you want help 2. tesseract is not library for image enhancement - your need to use other library for it (depend on programming language you used for creating your app) Zdenko ‪st 8. 5. 2019 o 15:11 ‫ניב27 יניב.ס‬‎ napísal(a):‬ > hello everyone > > i have small app for

[tesseract-ocr] how to

2019-05-08 Thread ניב27 יניב . ס
hello everyone i have small app for ocr i do resize to image do 300dpi and set rgb to but i want to ask how to clear image (noise blur) for tesseract cause i do not get a good results -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To

[tesseract-ocr] Re: I have a problem with the current tesseract

2019-05-08 Thread zdenop
What is reason for showing as you code? You are running tesseract from shell, so... If you have problem with tesseract output (I guess) - you will need trainneddata for MICR font. Zdenko Dňa streda, 8. mája 2019 8:02:56 UTC+2 Pkumar .. napísal(-a): > > I have a problem with the current tessera

Re: [tesseract-ocr] How to extract text for processing by tesseract v4?

2019-05-08 Thread Lorenzo Bolzani
Hi, you can try a few things, but you need to write a small script (python, etc.) or use imagemagick. I suggest to first try with gimp, find what works best, and then write the code. You want dark text on clear background. For white text on red: 1. Invert the image. Desaturate. Increase contrast.