i'm basicaly new to unicode, so this message will probably make me look as a
total newbie (which in fact i am), but i've been totally sucked in by this
thread and started searching around in google... :)
anyway, unless i understood everything wrong, if we have "ffi" and decompose it,
and we decompose (as we should) our search string, we can match it while
searching for ffi, ff, f and ffi, fi etc.
("Canonical normalization (NF) does not affect any of these, but compatibility
normalization (NFK) will decompose the ffi ligature into the constituent
letters, so a search for U+0066 (f) as substring would succeed in an NFKC
normalization of U+FB03 but not in NFC normalization of U+FB03",
http://en.wikipedia.org/wiki/Unicode_equivalence)
however according to wikipedia, even characters as "é" would be decomposed (e
◌́), so we could match it while searching for "e" and i think that doesn't make
sense...
i still haven't understood the difference between canonical and compatibility
normalization, but looks like they both will decompose accented letters [For
example, the distinct Unicode strings "U+212B" (the angstrom sign "Å") and
"U+00C5" (the Swedish letter "Å") are both expanded by NFD (or NFKD) into the
sequence "U+0041 U+030A"]
but anyway... there's something i didn't understand in this thread, i.e. all
the questions about options and flags. is it just a technical problem (i.e. we
know how to handle this, we just need decide how to code it) or is a semanthic
problem (i.e. how do we know what this text means)?
if there are new options o flags to add why not get inspirations from other
engines? (like e.g. perl: http://docstore.mik.ua/orelly/perl4/cook/ch01_09.htm
http://www.perlmonks.org/?node_id=597745)
--
--
You received this message from the "vim_dev" maillist.
Do not top-post! Type your reply below the text you are replying to.
For more information, visit http://www.vim.org/maillist.php
---
You received this message because you are subscribed to the Google Groups
"vim_dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.