[issue24870] Optimize coding with surrogateescape and surrogatepass error handlers

Serhiy Storchaka Thu, 24 Sep 2015 06:11:33 -0700

Serhiy Storchaka added the comment:

I worked on UTF-16 and UTF-32 encoders, but now I'm off my developing computer. 
I'll provide updated patch soon.


I think that only "surrogateescape" and "surrogatepass" error handlers have 
need in optimization, because they are used to interpolate with other programs, 
including old Python versions. "strict" stops processing, an optimization is 
not needed here. All other error handlers lose information and can't be used 
per se for transcoding bytes as string or string as bytes. They are used 
together with other slow code (for example for encoding string in XML or HTML 
you first need to escape '&', '<' and quotes). It is easy to add fast handling 
for 'ignore' and 'replace', but these error handlers are used largely for 
produce human-readable output, and adding it can slow down common case (no 
errors). That is why I limit my patch for "surrogateescape" and "surrogatepass" 
only.

----------

_______________________________________
Python tracker <rep...@bugs.python.org>
<http://bugs.python.org/issue24870>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

[issue24870] Optimize coding with surrogateescape and surrogatepass error handlers

Reply via email to