Document similarity

2006-01-20 Thread Aleksey Serba
Hello lucene people! First of all, i would like to thank all of community participants ( developers, users, Erik and Otis for "Lucene in Action" book ) for their great work. As far as i understand it, there are two most popular approches concerning document similarity: 1. "cosine metrics" using te

Re: Document similarity

2006-01-20 Thread Aleksey Serba
Yonik, Klaus, thanks for your quick response. Let me rephrase, i can't compare currently processed document with all documents in my collection using angle between documents in terms-vector space because of performance issues. As far as i can see, i can avoid unnecessary operations. At first, i ca

Re: question about custom sort method

2006-07-17 Thread Aleksey Serba
Hi! Peter, I have exactly the same situation described below. - I have DistanceComparatorSource to sort results by distance from specified spatial coordinates point. Point is different for each query. - I do not close Searcher after each query. - I get "java.lang.OutOfMemoryError: Java heap space

Re: question about custom sort method

2006-07-17 Thread Aleksey Serba
e having? Erik On Jul 17, 2006, at 9:54 AM, Aleksey Serba wrote: > Hi! > > Peter, I have exactly the same situation described below. > - I have DistanceComparatorSource to sort results by distance from > specified spatial coordinates point. Point is different for each > q

Re: question about custom sort method

2006-07-19 Thread Aleksey Serba
n) and get back to you later. Thanks On 7/17/06, Aleksey Serba <[EMAIL PROTECTED]> wrote: Erik, I think Brian have the problem with continuous caching the same sorting values, i.e. he has a few points to calc distance from. In such case you can implement equals and hashCode methods based on