> You can use regexes to match specific characters. But generally your converter
> (e.g: of ahttp://en.wikipedia.org/wiki/Lightweight_markup_language) wil ldo
> that for you. Do you need to guess the language of the document? I still don't
> understand exactly what you want to do.
>


Shlomi,

I do agree that the description of the problem was weird. Just to make
it clear once again:

- Langauge specific characters had to be in a plain text file
- Users with no idea about the language type in the UTF-8 hex codes
instead of the special characters instead
- For example 0xC30x85 instead of  the Danish character itself
- The advantage of this is that I now  can process the file because I
know it is ASCII compliant

- While reading the file line by line, I process them dynamically like
so: s/0x([\d+A-Fa-f]{2})/\\x$1/ge
- All I have to  do now is interpret its character chr(hex($var)) and
ensure that my output  streams are UTF-8 compliant




-- 
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/


Reply via email to