Hello, I am currently working on this Korean dataset and was having some issues on getting the values all correctly. A few problems are the pictures being slightly wonky as well as it being in Korean.
[image: ApplicationFrameHost_bxb8Ck9yTh.png] I cropped the data as well as made it greyscale to attempt to better the image, but it still looks slightly blurry. I'm not sure if this is the best way and can crop out to a larger image. The current problem is that the performance is not very good. The default settings gives me a jumble. Although I found that psm 4 is the best, it still does not look very good and it seems like tesseract just breaks halfway through. [image: Code_I1PxTycm88.png] How can I improve this? I was thinking of cutting the data into slices to read each, but still I am not sure if I can fix this. Is the image quality just not good enough? Thank you -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/3a678433-9334-42e6-9a57-a3f1a5c4cf4dn%40googlegroups.com.