Re: unicode "table of character" implementation in python

2006-09-09 Thread Martin v. Löwis
Tim Roberts schrieb: >> 0530..058F; Armenian >> 0590..05FF; Hebrew >> ... > > This is a fabulously useful list, Martin. Did you get this from a web > page? Can you tell me where? It's part of the Unicode Consortium's database (UCD, Unicode Character Database). This specific table is called "cod

Re: unicode "table of character" implementation in python

2006-08-29 Thread Tim Roberts
"Martin v. Löwis" <[EMAIL PROTECTED]> wrote: > >In any case, somebody pointed you to the Unicode code blocks. I think >these are Asian scripts (I may have missed some): > >0530..058F; Armenian >0590..05FF; Hebrew >... This is a fabulously useful list, Martin. Did you get this from a web page? Ca

Re: unicode "table of character" implementation in python

2006-08-28 Thread Martin v. Löwis
de "table of character" > implementation in python? I mean, I give a character and python replys > me with the language in which the character occurs. This is a bit unspecific, so likely, nothing that already exists will be completely correct for your needs. If you need to escape characte

Re: unicode "table of character" implementation in python

2006-08-22 Thread Nicolas Pontoizeau
2006/8/22, Brian Beck <[EMAIL PROTECTED]>: > Nicolas, check out the unicodedata module: > http://docs.python.org/lib/module-unicodedata.html > > Find "import unicodedata" on this page for how to use it: > http://www.amk.ca/python/howto/unicode > > I'm not sure if it has built-in support for finding

Re: unicode "table of character" implementation in python

2006-08-22 Thread Brian Beck
Nicolas Pontoizeau wrote: > I am handling a mixed languages text file encoded in UTF-8. Theres is > mainly French, English and Asian languages. I need to detect every > asian characters in order to enclose it by a special tag for latex. > Does anybody know if there is a unicode "t

unicode "table of character" implementation in python

2006-08-22 Thread Nicolas Pontoizeau
Hi, I am handling a mixed languages text file encoded in UTF-8. Theres is mainly French, English and Asian languages. I need to detect every asian characters in order to enclose it by a special tag for latex. Does anybody know if there is a unicode "table of character" implementation in