>>>>> "Andre" == Andre Poenitz <[EMAIL PROTECTED]> writes:

Andre> On Mon, Nov 15, 2004 at 12:10:40PM +0100, Jean-Marc Lasgouttes
Andre> wrote:
>> So, we have two different definitions of what a word is, one being
>> by inclusion and the other by exclusion. I am not sure what is the
>> best way to define a word. Note that the isp_esc_char stuff should
>> move to ControlSpellchecker. Then we would be in postion to unify
>> these two definitions of a word. But how should we do it?

Andre> I think it doesn't matter much how it is done as long as it is
Andre> done uniformly. If in doubt I'd prefer the 'inclusion' method,
Andre> but that's most likely a personal preference about which I
Andre> don't care too much.

The inclusion method now does:

/// return true if a char is alphabetical (including accented chars)
inline
bool IsLetterChar(unsigned char c)
{
        return (c >= 'A' && c <= 'Z')
                || (c >= 'a' && c <= 'z')
                || (c >= 192); // in iso-8859-x these are accented chars
}


We can consider that this is good enough for now... Of course, we
should be able to define word-chars for each charset. I suspect that
locale has all this information for us, but we are not sure that we
use the locale that corresponds to the document's language.

JMarc

Reply via email to