[tesseract-ocr] Re: TessBaseAPI::ProcessPages can be stopped using ETEXT_DESC ?

2022-06-22 Thread flavi...@gmail.com
I also tried this (inside the thread method): ETEXT_DESC monitor; monitor.cancel = &CMyDocEx::cancel; monitor.cancel_this = this; api->Recognize(&monitor); std::shared_ptr renderer = std::make_shared( CStringA(sSrcFile), api-

[tesseract-ocr] Re: TessBaseAPI::ProcessPages can be stopped using ETEXT_DESC ?

2022-06-22 Thread flavi...@gmail.com
I think I've found the answer: it cannot be stopped *api->ProcessPages*. I have tried the following code: api->SetPageSegMode(tesseract::PSM_AUTO); Pix* image = pixRead(CStringA(sSrcFile)); api->SetImage(image); ETEXT_DESC monitor; monitor.cancel = &CMyDoc

[tesseract-ocr] Creating a new language pack

2022-06-22 Thread TiMauzi
Hello everyone, I currently plan on creating a language pack for a new language that isn't in the existing language packs. I don't want a new font, since my language is latin-based. Is there a way of training a new model with just a plain training text / a language corpus and usage of existing