On 2023-04-06 08:45, demerphq wrote:
Although this causes pcre2grep to mishandle Unicode characters:
$ echo 'Ævar' | pcre2grep '[Ssß]'
Ævar
it mimics Perl 5.36:
$ echo 'Ævar' | perl -ne 'print $_ if /[Ssß]/'
Ævar
so this seems to be what Perl users expect, despite its infelici
On 2023-04-06 06:39, demerphq wrote:
Unicode specifies that \d match any digit
in any script that it supports.
"Specifies" is too strong. The Unicode Regular Expressions technical
standard (UTS#18) mentions \d only in Annex C[1], next to the word
"digit" in a column labeled "Property" (even
On Fri, Apr 7, 2023 at 12:00 PM Paul Eggert wrote:
>
> On 2023-04-06 06:39, demerphq wrote:
>
> > Unicode specifies that \d match any digit
> > in any script that it supports.
>
> "Specifies" is too strong. The Unicode Regular Expressions technical
> standard (UTS#18) mentions \d only in Annex C[1