Frank Niessink wrote:
> ...
> Character Range
> Char ::= #x9 | #xA | #xD | [#x20-#xD7FF] | [#xE000-#xFFFD] | 
> [#x10000-#x10FFFF]"
> 
> - What is the easiest/most pythonic (preferably build-in) way of 
> checking a unicode string for control characters and weeding those 
> characters out?

     drop_controls = [None] * 0x20
     for c in '\t\r\n':
         drop_controls[c] = unichr(c)
     ...
     some_unicode_string = some_unicode_string.translate(drop_controls)

--Scott David Daniels
[EMAIL PROTECTED]
-- 
http://mail.python.org/mailman/listinfo/python-list

Reply via email to