[tesseract-ocr] TessNet2 and hOCR/XML output

2018-10-25 Thread Joel Christner
Hi all, Is it possible to get hOCR/XML output directly from TessNet2? From what I see the library will only return List. Or, is there a serialization pattern I should follow? Thanks! -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsu

[tesseract-ocr] Training tesseract 4.0

2018-10-25 Thread Nikhil Kumar
combine_lang_model --input_unicharset /home/nikhil/Desktop/eng.unicharset --script_dir Training_currency/ --output_dir /nikhilanthe \ --pass_through_recoder \ --lang yourmodelname Loaded unicharset of size 112 from file /home/nikhil/Desktop/eng.unicharset Setting unichar properties Mi

[tesseract-ocr] Re: Install Tesseract 4 on CentOS and Red Hat [SOLVED!]

2018-10-25 Thread shree
Hi Alex, Do you have a package for Fedora 28 for tesseract 4? On Wednesday, April 25, 2018 at 12:47:15 PM UTC-4, Александр Поздняков wrote: > > for CentOS > >> yum-config-manager --add-repo >> https://download.opensuse.org/repositories/home:/Alexander_Pozdnyakov/CentOS_7/ >> yum update >> yum

[tesseract-ocr] tesseract generating box files

2018-10-25 Thread Nikhil Kumar
How can I loop through documents to generate box files for thousand documents? please help me -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+uns

[tesseract-ocr] Image Pre Processing

2018-10-25 Thread ProgressNotPerfection
Can anyone suggest an image processing technique that could be applied to the following image to clean it prior to sending to Tesseract? [image: Name.png] Ideally (but not necessarily) using OpenCV. Thanks Jim -- You received this message because you are subscribed to the Google Groups "tesse