We have a CMS which is written is based on php & mysql. Recently we received
a request to support multiple languages so that sites in that particular
laguage can be created. I did some search on the google and it seems I have
to build in multibyte support for php and mysql. Mbstring
(http://us3.php.net/mbstring) claims to support multiple languages with a
caution saying it might not work properly.

After further research it seems unicode might be the way to go, since
unicode can represents all characters (in all languages) with integers,
which in turn can be handled in php as it has excellent integer support. But
again since all the data is store in mysql we need unicode support for mysql
too and it has 2 formats (http://www.mysql.com/doc/en/Charset-Unicode.html)
usc-2 (for storing data) and utf-8 (for encoding). Finally, we decide to
stick with utf-8 as it could support more languages than usc-2 and also
since php has utf-8 support.

Our application evolved over time and it was not very long ago that we
introduced a new DB layer. Even though efforts were made to change code to
use this new DB layer, I am afraid there still might be instances where we
use the php mysql functions to access db. We use egexes heavily and do a lot
of text manipulation, so that rules out any chance of being closer to utf-8.
So I am wondering if there are any good approaches to this issue i.e. modify
the existing application code minimally to handle utf-8 (that doesn't sound
right!) and make it all transparent to the presentation layer. Any
suggestions or comments are welcome.

Turi
http://aijalon.net

-- 
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php

Reply via email to