Here's a test for the original problem, which is also fixed by the
preceding patch:
0004-tests-ensure-neither-s-nor-S-matches-an-invalid-mult.patch
Description: Binary data
This one really surprised me.
Learning that multibyte \s and \S had been broken since grep-2.6 did
not make my day. But fixing it helped.
Here's how it started:
To demonstrate the (first)bug, set up to use a UTF8 locale:
export LC_ALL=en_US.UTF-8
then run this and note that it matches:
I was surprised to find this dead code in dfa.c.
This removes it:
0002-dfa-remove-dead-disjunct.patch
Description: Binary data
Here's some no-semantic-change cleanup:
0001-maint-dfa-improve-comments-and-formatting.patch
Description: Binary data