Hello,

 

I have recently sent an email to several of you and as you suggest to me, I 
post my questions on this forum. 

I am currently in a software development team and for one of our project, 
we need to perform OCR, that’s why we are using Tesseract. 

 

I have several questions about the tesseract project future. We are 
currently using tesseract 3.0.4 and are very interested in the user pattern 
features, but after a lot of research on the Internet, we found out that it 
was not working properly (as you can see for example here: 
https://github.com/tesseract-ocr/tesseract/issues/960 ). We hoped that the 
problem was fixed in the 4.0.0.alpha version but it seems that it is not 
the case, and we even read that the whitelist feature does not work anymore 
either, and we need it (
https://github.com/tesseract-ocr/tesseract/issues/751 ).

>From the github repo, it seems like even if 4.0.0.alpha is old (2016), the 
project is still alive. We would like to know if the 4.0.0.beta that came 
out 23 days ago fixes these problems. If not, can you tell us if your team 
plans on fixing them please ? Is 4.0.0.beta stable enough to be used in a 
production software ? If not, do you think we can hope for a stable release 
to come in the following months ?

 

 

In addition, we are trying to make our system more robust against noise in 
images and we believe that we could achieve that by tuning some of these 
parameters, but the list we found ( 
https://github.com/naptha/tesseract.js/blob/master/docs/tesseract_parameters.md 
) is not well documented. Is there any documentation more comprehensive 
about them ?

 

Thanks,

Best regards,

 

Thomas BRIENNE.

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/0e80416c-a737-49a8-8634-23463e2d7120%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to