Re: Document Similarities lucene(particularly using doc id's)

2007-08-19 Thread Karl Wettin
20 aug 2007 kl. 05.19 skrev Lokeya: Grant Ingersoll-6 wrote: On Aug 16, 2007, at 2:20 PM, Lokeya wrote: I want to find out the document content similarity A common way of doing this is by calculating the cosine of the angle between the two vectors. I can use the getTermFreqVector() on In

Re: Document Similarities lucene(particularly using doc id's)

2007-08-19 Thread Lokeya
Hi, Thanks for your reply. I can use the getTermFreqVector() on Index Reader and get it. But I am wondering whats the API which has to be used to find the similarity between 2 such vectors which would give a score (doc-doc similairty in essence). Thanks. Grant Ingersoll-6 wrote: > > Hi, >

Lockless read-only deletions in IndexReader?

2007-08-19 Thread karl wettin
I want to set documents in my IndexReader as deleted, but I will never commit these deletions. Sort of a filter on a reader rather than on a searcher, and no write-locks. Can I do that out of the box? Perhaps I can pass down a IndexDeletionPolicy to my IndexWriter that ignores deletions fr

Re: Deleting the result from a query or a filter and not a documents specified by Term

2007-08-19 Thread Erick Erickson
Chris: I didn't understand how your first solution would work, so I tried it. The terms I extracted from the rewritten query were just the four raw terms, i.e. field1:query1 field1:query3 field2:query2 field2:query4. So iterating over and deleting them term by term wouldn't preserve the sense o

Re: query question

2007-08-19 Thread Erick Erickson
Mohammad: See below On 8/19/07, Mohammad Norouzi <[EMAIL PROTECTED]> wrote: > > Erick, > I am using WhitespaceAnalyzer, and yes it's mixed case, in my application > I > never change the entered information to lowercase because of some reasons, I've found it waay easier to index things t

Re: Strategy for date based searching and indexing

2007-08-19 Thread Grant Ingersoll
Have a look at the DateTools utility class. Also, the Wiki has some HOWTOs on Dates: http://wiki.apache.org/lucene-java/HowTo Search this archive for Date handling, plus I believe the Lucene In Action book covers dates as well, although it might be a bit dated. Lucene also comes with sort

Re: Deleting the result from a query or a filter and not a documents specified by Term

2007-08-19 Thread Abu Abdulla alhanbali
Greatly appreciated. It works perfect. On 8/19/07, Chris Hostetter <[EMAIL PROTECTED]> wrote: > > : Is there a way to delete the results from a query or a filter and not > : documents specified by Term. I have seen some explanations here but i do > not > : know how to do it: > : > : > http://www.n

Strategy for date based searching and indexing

2007-08-19 Thread Berlin Brown
I am using the most basic lucene functionality but using against a database. For example, I may have a message forum and will index that message text and message subject from the database. But I haven't figured out a way to index the date. ideally, when I search I should be able to return the mo