I implemented the suggestions-feature for a couple of web-sites.
an example can be seen on
http://www.genios.de/r_firmen/webcgi?START=016&SEITE=firmenk_d.ein&DBN=&WID=01852-8850939-00904_3
genios.de .
type smth in in the Firma and Person fields.
The Firma-index has 3++ mio records, Person ~ 1.
In my project, i stored the user input keyword in the database, as a result,
I build a index from the database and use it to do suggestive search. The
code example is googled and I changed the analyzer and query function. I
attach the code but you have to modify the code to make it run.
For chines
I test the lucene spellchecker and it doesn't support chinese spell checker,
how can i achieve this goal as google does?
2009/4/9 Karl Wettin
> If you use prefix grams only then you'll get a forward-only suggestion
> scheme. I've seen several implementation that use that and it works quite
> wel
If you use prefix grams only then you'll get a forward-only suggestion
scheme. I've seen several implementation that use that and it works
quite well.
harry potter: ^ha, ^har, ^harr, ^harry, ^harry p, ^harry po..
harry houdini: ^ha, ^har, ^harr, ^harry, ^harry h, ^harry ho..
I prefere the tr
I use TermEnum for this sort of "browsing" on untokenized, unstored
fields e.g. TermEnum terms=reader.terms(new Term("mybrowsefld","harr")).
-Gary
Matt Schraeder wrote:
Corerct me if I'm wrong, but I don't think n-grams is really what I'm
looking for here. I'm not looking for a spellchecker or
Corerct me if I'm wrong, but I don't think n-grams is really what I'm
looking for here. I'm not looking for a spellchecker or phrase checker
style suggestive search, but only based on the exact phrases the user is
currently typing. Since Lucene uses term-based searching, I'm not sure
how to have
For this you probably want to use ngrams. Wether or not this is
something that fits in your current index is hard to say. My guess is
that you want to create a new index with one document per unique
phrase. You might also want to try to load this index in an
InstantiatedIndex, that could sp