Let's try some regex matching in PHP.
,----
| php > echo preg_match('/\w/', 'a');
| 1
`----

ok, so the basic stuff works in English. Let's go on.

,----
| php > echo preg_match("/\w/", 'א');
| 0
| php > echo preg_match('/\w/u', 'א');
| 0
`----

Oops.. maybe some kind of encoding issue? My whole system is UTF8, so this 
should not be a problem I guess..
,----
| php > var_dump('א');
| string(2) "א"
`----

So I revert to the soon-to-be-deprecated mb_ereg:

,----
| php > mb_regex_encoding('UTF-8');
| php > echo mb_ereg('\w', 'א');
| 1
`----

and now it works.. 

maybe I was wrong expecting preg_match would know hebrew alphanumeric 
characters in the first place? I understand it will for PHP6, but until 
then, beware of sophisticated Hebrew string parsing.

--yuval

Attachment: signature.asc
Description: This is a digitally signed message part.

_______________________________________________
Linux-il mailing list
Linux-il@cs.huji.ac.il
http://mailman.cs.huji.ac.il/mailman/listinfo/linux-il

Reply via email to