On Monday 30 May 2005 18:54, Andrew Boyd wrote:

>   Now that the QueryParser knows about position increments has anyone
> used this to do stemming at query time and not at indexing time?  I
> suppose one would need a reverse stemmer.  Given the query breath it
> would need to inject breathe, breathes, breathing etc.

There are two things to consider: queries will get more complicated and 
thus slower and the implementation isn't that easy:  while stemming can be 
done with a simple algorithm (for English), you'll need a dictionary with 
at least part-of-speech information for adding suffixes. That's because 
you cannot just add "ing" to any word, otherwise you'd end up with car + 
ing  = caring. (But once you have this dictionary the quality of your 
solution can be better than that of a stemmer, as stemmers also suffer 
form over-stemming, i.e. mapping two non-related words to the same form).

Regards
 Daniel

-- 
http://www.danielnaber.de

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to