I don't have a list of common "combined word" queries.  Splitting of words
seem to be quite a standard thing, most search engines and spell checkers
have this ability.  It would be nice if Lucene provides this out of the box.


karl wettin-3 wrote:
> 
> 
> 14 jan 2008 kl. 19.47 skrev solr_user:
> 
>> Does Lucene spell checker have the ability to suggest splitting of  
>> combined
>> words.  So for e.g. if I have got the word "apple" and "computer" in  
>> my
>> index and if I type "applecomputer" then how can I make it suggest
>> "apple computer"
> 
> 
> It would probably be very expensive to analyze each query token and  
> decompose it using some word list or index.
> 
> If "apple computer" is a phrase that is common in your query, then you  
> can create a new field with chained tokens: "think diffrent apple  
> computer" -> "thinkdiffrent diffrentapple applecomputer". This would  
> at least allow you to search for such typos. You might want to set  
> some threadholds like min/max token size and such in your TokenFilter.
> 
> 
> -- 
> karl
> 
> 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
> 
> 
> 

-- 
View this message in context: 
http://www.nabble.com/spell-checking-for-combined-words-tp14809197p14843700.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to