[tesseract-ocr] Improve Current Tesseract Results

Glenn Wed, 13 Jan 2021 05:12:58 -0800

Hello, I am currently working on this Korean dataset and was having some 
issues on getting the values all correctly. A few problems are the pictures 
being slightly wonky as well as it being in Korean.


[image: ApplicationFrameHost_bxb8Ck9yTh.png]

I cropped the data as well as made it greyscale to attempt to better the 
image, but it still looks slightly blurry. I'm not sure if this is the best 
way and can crop out to a larger image.

The current problem is that the performance is not very good. The default 
settings gives me a jumble. Although I found that psm 4 is the best, it 
still does not look very good and it seems like tesseract just breaks 
halfway through.
[image: Code_I1PxTycm88.png]
How can I improve this? I was thinking of cutting the data into slices to 
read each, but still I am not sure if I can fix this. Is the image quality 
just not good enough?

Thank you

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/3a678433-9334-42e6-9a57-a3f1a5c4cf4dn%40googlegroups.com.

[tesseract-ocr] Improve Current Tesseract Results

Reply via email to