Am 21.02.2016 um 11:42 schrieb Anatol Belski <anatol....@belski.net>:
>> -----Original Message-----
>> From: Ángel González [mailto:keis...@gmail.com]
>> Sent: Sunday, February 21, 2016 1:27 AM
>> To: Anatol Belski <anatol....@belski.net>
>> Cc: 'Christian Schneider' <cschn...@cschneid.com>; 'PHP internals'
>> <internals@lists.php.net>
>> Subject: Re: [PHP-DEV] PCRE jit bug with UTF-8 and lookbehind assertion
>> 
>> On 19/02/16 09:20, Anatol Belski wrote:
>>> Could you please write back, what is the out difference between those
>>> two commands? Thanks. Anatol
>> In the first case, it correctly outputs «x°11» (78 c2 b0 7a). With jit 
>> enabled it
>> produces «x z» (78 c2 7a). That is, it is only outputting the lower byte of 
>> the utf-8
>> encoding of the U+00B0 character Tested on PHP 7.0.3 using the system libpcre
>> 8.38
>> 
> Were you putting the snippets into a file or testing on the console? I had an 
> issue while testing this on the console, that some chars was partially 
> swallowed by terminal (which was a utf-8 terminal). When putting into a file, 
> the output is same for both - "x°z". Please see also the continued discussion 
> in the original ticket https://bugs.exim.org/show_bug.cgi?id=1189 . The 
> offsets delivered by PCRE also seem to be correct,  and valgrind doesn't find 
> anything. It were great if you could confirm these insights.

I can reproduce it in a console and in a file.

PCRE Library Version => 8.38 2015-11-23

I also reproduced it with a C program directly using the system PCRE library, 
no PHP involved.
I attached the C source to https://bugs.exim.org/show_bug.cgi?id=1189

Regards,
- Chris


--
PHP Internals - PHP Runtime Development Mailing List
To unsubscribe, visit: http://www.php.net/unsub.php

Reply via email to