Re: [sword-devel] Converting RTF \'XX to UTF-8

2008-06-22 Thread Chris Little
Aha! An example goes a long way, so now I understand the real problem. You just need to change the codepage. cp1252 is the Windows equivalent of ISO 8859-1. Since you want Greek, you need the ISO 8859-7 equivalent, which would be cp1253, thus: perl -CO -pe 'use Encode; s/\\\'([0-9a-fA-F]{2})/de

[sword-devel] Converting RTF \'XX to UTF-8

2008-06-22 Thread Karl Kleinpaste
I've got an RTF document which contains this kind of encoding: \cf2 \'c3\'e5\u769?\'ed\'e5\'f3\'e9\'f2\cf0 That renders the word "Genesis" in the Greek, i.e. \'c3\'e5 is the capital gamma. As seen in another app which uses this RTF natively: <> I need to find a scriptable way to convert this k