> You can use regexes to match specific characters. But generally your converter > (e.g: of ahttp://en.wikipedia.org/wiki/Lightweight_markup_language) wil ldo > that for you. Do you need to guess the language of the document? I still don't > understand exactly what you want to do. >
Shlomi, I do agree that the description of the problem was weird. Just to make it clear once again: - Langauge specific characters had to be in a plain text file - Users with no idea about the language type in the UTF-8 hex codes instead of the special characters instead - For example 0xC30x85 instead of the Danish character itself - The advantage of this is that I now can process the file because I know it is ASCII compliant - While reading the file line by line, I process them dynamically like so: s/0x([\d+A-Fa-f]{2})/\\x$1/ge - All I have to do now is interpret its character chr(hex($var)) and ensure that my output streams are UTF-8 compliant -- To unsubscribe, e-mail: beginners-unsubscr...@perl.org For additional commands, e-mail: beginners-h...@perl.org http://learn.perl.org/