How about regular expressions:

Modern Greek Accented
[\u0370-\u0390 \u03AA-\u03B0 \u03CA-\u03D4]

Polytonic Greek Accented
[\u1F00-\u1FFE]

Hebrew Vowel Points
[\u05BB-\u05B0]

Hebrew Cantillation
[\u0591-\u05AE]

I don't know about Arabic.

Peace,

David

On 8/28/2015 4:21 AM, Peter von Kaehne wrote:
On Fri, 2015-08-28 at 01:27 +0200, Matěj Cepl wrote:
iconv -f utf8 -t us-ascii//translit file.xml \
         |diff -u - file.xml
Thanks Matej,

This would probably work on latin scripts with diacritics, but not on
the scripts I am interested in - Hebrew, Arabic derrived and Greek.

Peter

_______________________________________________
sword-devel mailing list: sword-devel@crosswire.org
http://www.crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page


---
This email has been checked for viruses by Avast antivirus software.
https://www.avast.com/antivirus


_______________________________________________
sword-devel mailing list: sword-devel@crosswire.org
http://www.crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page

Reply via email to