20 aug 2007 kl. 05.19 skrev Lokeya:
Grant Ingersoll-6 wrote:
On Aug 16, 2007, at 2:20 PM, Lokeya wrote:
I want to find out the document content similarity
A common way of doing this is by calculating the cosine of the angle
between the two vectors.
I can use the getTermFreqVector() on In
Hi,
Thanks for your reply.
I can use the getTermFreqVector() on Index Reader and get it. But I am
wondering whats the API which has to be used to find the similarity between
2 such vectors which would give a score (doc-doc similairty in essence).
Thanks.
Grant Ingersoll-6 wrote:
>
> Hi,
>
I want to set documents in my IndexReader as deleted, but I will
never commit these deletions. Sort of a filter on a reader rather
than on a searcher, and no write-locks.
Can I do that out of the box?
Perhaps I can pass down a IndexDeletionPolicy to my IndexWriter that
ignores deletions fr
Chris:
I didn't understand how your first solution would work,
so I tried it. The terms I extracted from the rewritten
query were just the four raw terms, i.e.
field1:query1
field1:query3
field2:query2
field2:query4.
So iterating over and deleting them term by term wouldn't
preserve the sense o
Mohammad:
See below
On 8/19/07, Mohammad Norouzi <[EMAIL PROTECTED]> wrote:
>
> Erick,
> I am using WhitespaceAnalyzer, and yes it's mixed case, in my application
> I
> never change the entered information to lowercase because of some reasons,
I've found it waay easier to index things t
Have a look at the DateTools utility class. Also, the Wiki has some
HOWTOs on Dates: http://wiki.apache.org/lucene-java/HowTo
Search this archive for Date handling, plus I believe the Lucene In
Action book covers dates as well, although it might be a bit dated.
Lucene also comes with sort
Greatly appreciated.
It works perfect.
On 8/19/07, Chris Hostetter <[EMAIL PROTECTED]> wrote:
>
> : Is there a way to delete the results from a query or a filter and not
> : documents specified by Term. I have seen some explanations here but i do
> not
> : know how to do it:
> :
> :
> http://www.n
I am using the most basic lucene functionality but using against a
database. For example, I may have a message forum and will index that
message text and message subject from the database. But I haven't
figured out a way to index the date. ideally, when I search I should
be able to return the mo