I did fine-tuning by adding some words that contained the new characters that I want. Now what I want to know is when we OCRed the document which is not computerized printed but scan image, the accuracy drops. so I thought if we trained the engine even in scan image then the accuracy won't be dropping so much. any suggestion? And if there is any way that we can feed image instead of lan.training_text , please send me the link
-- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAJxgoodku4qy6DODGnEckRGVpFD_-bj6GZtEVDQEn%3DJPMpzzFw%40mail.gmail.com.