Hello, Have you found any information regarding the architecture of v5 or v4? I'm searching as well to understand how it works.
Best regards. Le lundi 30 mai 2022 à 19:55:54 UTC-4, giridhar...@gmail.com a écrit : > I am looking to understand the architecture of OCR pipeline in tesseract > v5.0.1 to know about *the preprocessing that happen before the LSTM > network during inference and training*. > > I could only find these 7 year old documentation notes ( > https://github.com/tesseract-ocr/docs/tree/main/das_tutorial2016) and I > am not sure if they are still accurate. > > 1. Is the information I am looking for present anywhere in the online > documentation (https://tesseract-ocr.github.io/tessdoc/)? > 2. Is there a way to turn off the pagelayout analysis and other > preprocessing before the LSTM modules? > > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/306430fd-f4a2-46ae-b0f1-ec0fa1c229ban%40googlegroups.com.