Re: Arbitrary String to String Similarity Score

2008-06-20 Thread Otis Gospodnetic
ou are after. > > > > Otis > > -- > > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > > > > > > - Original Message > >> From: Sangrish > >> To: java-user@lucene.apache.org > >> Sent: Friday, June 20,

Re: Arbitrary String to String Similarity Score

2008-06-20 Thread Sangrish
-user@lucene.apache.org >> Sent: Friday, June 20, 2008 12:20:02 AM >> Subject: Re: Arbitrary String to String Similarity Score >> >> >> Given 2 text documents I want to quantitatively find, how similar they >> are, >> with respect to each other. Say, I wan

Re: Arbitrary String to String Similarity Score

2008-06-19 Thread Otis Gospodnetic
you are after. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Sangrish <[EMAIL PROTECTED]> > To: java-user@lucene.apache.org > Sent: Friday, June 20, 2008 12:20:02 AM > Subject: Re: Arbitrary String to String Similarity Sco

Re: Arbitrary String to String Similarity Score

2008-06-19 Thread Sangrish
Given 2 text documents I want to quantitatively find, how similar they are, with respect to each other. Say, I want to find Cosine Similarity score between any two given documents. I am trying to use Lucene for it (is it good for this purpose?) This use case is different from querying against a s

Re: Arbitrary String to String Similarity Score

2008-06-19 Thread Grant Ingersoll
You might also have a look at the MemoryIndex. Question, though, is what are you hoping to gain from doing a Query against a single String? Are you doing a FuzzyQuery? You might look at the SecondString project on SourceForge for doing string comparisons. I guess I am a bit confused by y