On 2015-08-28, 08:21 GMT, Peter von Kaehne wrote: > On Fri, 2015-08-28 at 01:27 +0200, Matěj Cepl wrote: >> iconv -f utf8 -t us-ascii//translit file.xml \ >> |diff -u - file.xml > > This would probably work on latin scripts with diacritics, but not on > the scripts I am interested in - Hebrew, Arabic derrived and Greek.
Did you try? I know that iconv has quite extensive number of transliteration rules. Other option would be to use recode (https://packages.debian.org/sid/recode, https://admin.fedoraproject.org/pkgdb/package/recode/ or http://directory.fsf.org/wiki/Recode)? It used to have a huge number of transliteration rules. Best, Matěj -- http://www.ceplovi.cz/matej/, Jabber: mc...@ceplovi.cz GPG Finger: 89EF 4BC6 288A BF43 1BAB 25C3 E09F EF25 D964 84AC For a successful technology, reality must take precedence over public relations, for nature cannot be fooled. -- R. P. Feynman's concluding sentence in his appendix to the Challenger Report _______________________________________________ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page