Glad I asked.

I would think that the automaton would be superior even for larger edit 
distances than 1 or 2 than the equivalent “crappy” algorithm.  But maybe I 
don’t understand something. ;-)

Karl


From: ext Robert Muir [mailto:[email protected]]
Sent: Friday, July 23, 2010 11:05 AM
To: [email protected]
Subject: Re: LevenshteinFilter proposal

this is actually done in trunk.

In trunk fuzzy's enum is a "proxy". for low distances (ed=1,2) it uses 
automaton.

for higher distances it uses the crappy "brute force" method.
but, higher distances still get accelerated if you use a reasonable 
'maxExpansions' to FuzzyQuery... the default is quite bad (1024).


On Fri, Jul 23, 2010 at 10:59 AM, 
<[email protected]<mailto:[email protected]>> wrote:
Thanks!

FuzzyQuery will do for my purposes, for the interim.  But I suspect that 
FuzzyQuery could be made a lot more efficient if it were rebuilt on top of 
Automaton, no?  I understand that this would be a trunk project.

Karl


From: ext Uwe Schindler [mailto:[email protected]<mailto:[email protected]>]
Sent: Friday, July 23, 2010 10:45 AM

To: [email protected]<mailto:[email protected]>
Subject: RE: LevenshteinFilter proposal

Automaton is only in Lucene/Solr Trunk. To get a filter out of FuzzyQuery, use 
MultiTermQueryWrapperFilter(new FuzzyQuery(…))

-----
Uwe Schindler
H.-H.-Meier-Allee 63, D-28213 Bremen
http://www.thetaphi.de<http://www.thetaphi.de/>
eMail: [email protected]<mailto:[email protected]>

From: [email protected]<mailto:[email protected]> 
[mailto:[email protected]<mailto:[email protected]>]
Sent: Friday, July 23, 2010 4:25 PM
To: [email protected]<mailto:[email protected]>
Subject: LevenshteinFilter proposal

Hi Folks,

I’m very interested in using (or developing!) a Levenshtein Filter within the 
family of Solr Filter objects. I don’t see such a class today anywhere. I see 
how the AutomatonQuery object would permit such a thing to be built, but to 
date I don’t know of anyone who has built one. Do you?  If not, I’m willing to 
give it a whirl.  Also, AutomatonQuery doesn’t seem to come up when I look for 
it in the javadocs for Lucene – can you point me in the correct direction?
Thanks!
Karl





--
Robert Muir
[email protected]<mailto:[email protected]>

Reply via email to