Chris Angelico <ros...@gmail.com> writes: > UTF-8 is highly inefficient for indexing. Given a buffer of (say) a > few thousand bytes, how do you locate the 273rd character?
How often do you need to do that, as opposed to traversing the string by iteration? Anyway, you could use a rope-like implementation, or an index structure over the string. -- http://mail.python.org/mailman/listinfo/python-list