I don't have a list of common "combined word" queries. Splitting of words seem to be quite a standard thing, most search engines and spell checkers have this ability. It would be nice if Lucene provides this out of the box.
karl wettin-3 wrote: > > > 14 jan 2008 kl. 19.47 skrev solr_user: > >> Does Lucene spell checker have the ability to suggest splitting of >> combined >> words. So for e.g. if I have got the word "apple" and "computer" in >> my >> index and if I type "applecomputer" then how can I make it suggest >> "apple computer" > > > It would probably be very expensive to analyze each query token and > decompose it using some word list or index. > > If "apple computer" is a phrase that is common in your query, then you > can create a new field with chained tokens: "think diffrent apple > computer" -> "thinkdiffrent diffrentapple applecomputer". This would > at least allow you to search for such typos. You might want to set > some threadholds like min/max token size and such in your TokenFilter. > > > -- > karl > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > > > -- View this message in context: http://www.nabble.com/spell-checking-for-combined-words-tp14809197p14843700.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]