Re: Using Lucene with Jarowinkler

2008-01-07 Thread Chris Lu
Hi, Shivani, For my understanding, Jarowinkler doesn't quite fit with Lucene's structure. Calculating Jaro-Winkler distance for the query against each word in the index is quite computational intensive. What's possible may be using SoundEx, Metaphone, Double Metaphone, etc, instead. For each word

Re: Using Lucene with Jarowinkler

2008-01-07 Thread Grant Ingersoll
FuzzyQuery uses EditDistance, you probably could create a JaroWinklerQuery that mimics FuzzyQuery but calculates the JaroWinkler score instead of the edit distance. As for dealing with phrases, that would get a bit more complex, but you may be able to use PhraseQuery as an example and then