The Lucene MoreLikeThis tool in lucene/contrib/similar will do one
variant of what you want.
You can do this particular test in Solr- you'll find it much much
easier to put together.
For other text similarities, you'll have to code them directly.
Lance
On Sat, Nov 13, 2010 at 7:07 AM, Shashi Kan
There are multiple measures of similarity for documents: Cosine similarity
is a frequently used one.
On Sat, Nov 13, 2010 at 9:23 AM, Ciprian URSU wrote:
> Hi Guys,
>
>I just find out about Lucene; after reading the main things on wiki
> it seems to be a great tool, but I still didn't f
Hi Guys,
I just find out about Lucene; after reading the main things on wiki
it seems to be a great tool, but I still didn't find out how can I use it
for my needs. What I want to do is a small tool which has some documents
(mainly text) inside and then when I have a new document as input,