Hi Many thanks for your answers.
I've checked pending milestones of v 5.0.0 (https://github.com/tesseract-ocr/tesseract/milestone/6), and from the 17 pending ones, I see that some of them come from v4, and others are related to training or languages different from english or spanish, . So the only issues that could really affect my project are these ones: https://github.com/tesseract-ocr/tesseract/issues/3109 https://github.com/tesseract-ocr/tesseract/issues/3473 These two issues are only related to performance. As v5 runs 3 or 4 times faster in my tests, I don't see any reason for not using current beta version in production environments. We also have the experience that the archive.org project has had with v5, which is very positive. I think I can try to convince my IT infraestructure team with these arguments. In any case I think I can also wait for the release of 5.0.0 if it doesn´t take months to be delivered. Regards Juan Carlos El martes, 19 de octubre de 2021 a las 18:37:09 UTC+2, Merlijn Wajer escribió: > Hi, > > On 19/10/2021 16:47, Lorenzo Bolzani wrote: > > Hi Merlijn, > > out of curiosity, did you note an impovement over the previous version? > > Yes. Speed and stability is better, and accuracy is also up (IMHO). See > (for example) this link: > https://github.com/tesseract-ocr/tesseract/pull/3141 > > Regards, > Merlijn > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/526b4c7c-e2f9-46a4-b9e5-de20f6ba9b98n%40googlegroups.com.