Branch: refs/heads/blead
  Home:   https://github.com/Perl/perl5
  Commit: d1e2a852fbc901b45fba20906a8f42ca227ae462
      
https://github.com/Perl/perl5/commit/d1e2a852fbc901b45fba20906a8f42ca227ae462
  Author: Karl Williamson <[email protected]>
  Date:   2025-10-02 (Thu, 02 Oct 2025)

  Changed paths:
    M utf8.h

  Log Message:
  -----------
  Add isIDCONT_lazy_if_safe()

Various places in the code are using isWORDCHAR to match the
continuation in an identifier.  This mostly works, but the two sets are
not identical, and the proper thing to do is to match continuation
characters.  The infrastructure was lacking this macro that would make
it easy to do the right thing.  This commit adds the infrastructure,
leaving it to future commits to use it.

A reasonably complete list of characters that differ between the two
sets is:

MIDDLE DOT
GREEK YPOGEGRAMMENI
GREEK ANO TELEIA
COMBINING CYRILLIC HUNDRED THOUSANDS SIGN
COMBINING CYRILLIC MILLIONS SIGN
ARMENIAN MODIFIER LETTER LEFT HALF RING
ARMENIAN EMPHASIS MARK
NEW TAI LUE THAM DIGIT ONE
COMBINING PARENTHESES OVERLAY
COMBINING ENCLOSING CIRCLE
COMBINING ENCLOSING CIRCLE BACKSLASH
COMBINING ENCLOSING SCREEN
COMBINING ENCLOSING UPWARD POINTING TRIANGLE
MANDAIC LETTER AZ
ESTIMATED SYMBOL
CIRCLED LATIN CAPITAL LETTER A
...
CIRCLED LATIN SMALL LETTER Z
VERTICAL TILDE
KATAKANA MIDDLE DOT
COMBINING CYRILLIC TEN MILLIONS SIGN
COMBINING CYRILLIC THOUSAND MILLIONS SIGN
ARABIC LIGATURE SHADDA WITH DAMMATAN ISOLATED FORM
ARABIC LIGATURE SHADDA WITH SUPERSCRIPT ALEF ISOLATED FORM
ARABIC LIGATURE SALLALLAHOU ALAYHE WASALLAM
ARABIC LIGATURE JALLAJALALOUHOU
ARABIC FATHATAN ISOLATED FORM
ARABIC DAMMATAN ISOLATED FORM
ARABIC KASRATAN ISOLATED FORM
ARABIC FATHA ISOLATED FORM
ARABIC DAMMA ISOLATED FORM
ARABIC KASRA ISOLATED FORM
ARABIC SHADDA ISOLATED FORM
ARABIC SUKUN ISOLATED FORM
HALFWIDTH KATAKANA MIDDLE DOT
SQUARED LATIN CAPITAL LETTER A
SQUARED LATIN CAPITAL LETTER Z
NEGATIVE CIRCLED LATIN CAPITAL LETTER A
...
NEGATIVE CIRCLED LATIN CAPITAL LETTER Z
NEGATIVE SQUARED LATIN CAPITAL LETTER A
...
NEGATIVE SQUARED LATIN CAPITAL LETTER Z



To unsubscribe from these emails, change your notification settings at 
https://github.com/Perl/perl5/settings/notifications

Reply via email to