[issue4868] Faster utf-8 decoding

Antoine Pitrou Thu, 08 Jan 2009 07:22:45 -0800

Antoine Pitrou <pit...@free.fr> added the comment:

> Attached patch
> (utf8decode4.patch) changes this and may enter the fast loop on the
> first character.


Thanks!

> Does this idea apply to the encode function as well?

Probably, although with less efficiency (a long can hold 1, 2 or 4
unicode characters depending on the build).
The unrolling part also applies to simple codecs such as latin1.
Unrolling PyUnicode_DecodeLatin1 a bit (4 copies per iteration) makes it
twice faster on non-tiny strings. I'll experiment with utf16.

_______________________________________
Python tracker <rep...@bugs.python.org>
<http://bugs.python.org/issue4868>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

[issue4868] Faster utf-8 decoding

Reply via email to