Edit report at http://bugs.php.net/bug.php?id=53823&edit=1
ID: 53823 User updated by: keith at chaos-realm dot net Reported by: keith at chaos-realm dot net -Summary: preg_replace: * qualifier on unicode replace garbacles the string +Summary: preg_replace: * qualifier on unicode replace garbles the string Status: Open Type: Bug Package: Unicode Engine related Operating System: Linux PHP Version: 5.3SVN-2011-01-23 (snap) Block user comment: N Private report: N New Comment: . Previous Comments: ------------------------------------------------------------------------ [2011-01-23 18:00:57] keith at chaos-realm dot net Description: ------------ When using the following test script to strip out all unicode except for letters the string becomes garbled when the * qualifier is added, the only surviving character that is intact is ú. Also, if you add \pN to the exceptions it additionally preserves the ó. Verified on 5.2,5.3 and 5.3-SNAP. Test script: --------------- echo preg_replace('/[^\pL\pM]*/iu', '', 'áéÃóú'); or echo preg_replace('/[^\pL\pM\pN]*/iu', '', 'áéÃóú'); Expected result: ---------------- áéÃóú Actual result: -------------- ����ú or ���óú (if \pN is added to the exceptions). ------------------------------------------------------------------------ -- Edit this bug report at http://bugs.php.net/bug.php?id=53823&edit=1