[tesseract-ocr] Avestan OCR

Seyedsoroush Hashemi Thu, 25 Jul 2024 06:54:54 -0700

Hey all,
We're considering training a model for Avestan OCR (and probably later a 
model for Pahlavi). Both of these are ancient Iranian languages with 
limited remaining text, which is being digitized by a few projects in 
academia. An OCR model can significantly speed up those projects and enable 
further analysis (e.g., author recognition).


We couldn't find any mention of Avestan in this Google group or in the 
Tesseract documentation. So, could you please answer the following 
questions:
1. Have there been any attempts/progress towards adding Avestan/Pahlavi OCR 
to Tesseract? If so, could you please share the result?
2. Is there anyone who wants to join us in this project?

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/2298e8f7-3504-4906-b64d-eecbd773b6fcn%40googlegroups.com.

[tesseract-ocr] Avestan OCR

Reply via email to