Author: larry Date: Wed Jun 11 17:34:50 2008 New Revision: 14550 Modified: doc/trunk/design/syn/S05.pod
Log: fixed "samebase" illogic pointed out by moritz++ Modified: doc/trunk/design/syn/S05.pod ============================================================================== --- doc/trunk/design/syn/S05.pod (original) +++ doc/trunk/design/syn/S05.pod Wed Jun 11 17:34:50 2008 @@ -14,9 +14,9 @@ Maintainer: Patrick Michaud <[EMAIL PROTECTED]> and Larry Wall <[EMAIL PROTECTED]> Date: 24 Jun 2002 - Last Modified: 18 May 2008 + Last Modified: 11 Jun 2008 Number: 5 - Version: 79 + Version: 80 This document summarizes Apocalypse 5, which is about the new regex syntax. We now try to call them I<regex> rather than "regular @@ -181,7 +181,7 @@ The single-character modifiers also have longer versions: :i :ignorecase - :b :basechar + :a :ignoreaccent :g :global =item * @@ -227,7 +227,7 @@ =item * -The C<:b> (or C<:basechar>) modifier scopes exactly like C<:ignorecase> +The C<:a> (or C<:ignoreaccent>) modifier scopes exactly like C<:ignorecase> except that it ignores accents instead of case. It is equivalent to taking each grapheme (in both target and pattern), converting both to NFD (maximally decomposed) and then comparing the two base @@ -237,7 +237,7 @@ includes all ignored characters, including any that follow the final base character. -The C<:bb> (or C<:samebase>) variant may be used on a substitution to change the +The C<:aa> (or C<:sameaccent>) variant may be used on a substitution to change the substituted string to the same accent pattern as the matched string. Accent info is carried across on a character by character basis. If the right string is longer than the left one, the remaining characters @@ -968,8 +968,8 @@ unless it happens to be a C<Regex> object, in which case it is matched as a subrule. As with scalar subrules, a tainted subrule always fails. All string values pay attention to the current C<:ignorecase> -and C<:basechar> settings, while C<Regex> values use their own -C<:ignorecase> and C<:basechar> settings. +and C<:ignoreaccent> settings, while C<Regex> values use their own +C<:ignorecase> and C<:ignoreaccent> settings. When you get tired of writing: @@ -1069,7 +1069,7 @@ =back All hash keys, and values that are strings, pay attention to the -C<:ignorecase> and C<:basechar> settings. (Subrules maintain their +C<:ignorecase> and C<:ignoreaccent> settings. (Subrules maintain their own case settings.) You may combine multiple hashes under the same longest-token