RE: Code search

2005-05-26 Thread Santanu Dutta
Hi Anurag You can go through the link http://today.java.net/pub/a/today/2003/07/30/LuceneIntro.html -Original Message- From: Singh, Anurag (Research) [mailto:[EMAIL PROTECTED] Sent: Friday, May 27, 2005 10:43 AM To: java-user@lucene.apache.org Subject: Code search Hi all, I am new to

Code search

2005-05-26 Thread Singh, Anurag \(Research\)
Hi all, I am new to Lucene project, would like to get some information 1) Can we use Lucene project as a search engine for code repository 2) If yes, how should the code component cataloging should be done so as to have effective and quick search. -regards anurag --

Re: Deleting duplicates from a Lucene index

2005-05-26 Thread Chris Hostetter
: The two symptoms of this not behaving as expected are : 1) ir.docFreq(t) does not always equal the value returned by : ir.termDocs(t).read(docs, freqs) (see below for actual syntax used). : 2) Even after optimizing, I still have the same dupes in my index. As far as #1, i don't know much about

Deleting duplicates from a Lucene index

2005-05-26 Thread Dan Climan
I noticed in my lucene index that I had mistakenly indexed some documents multiple times. I wrote the following piece of code to find and eliminate the duplicates, but it did not behave as expected. Background: Every document has an ItemId field that was indexed as a keyword. Two or more documents

Re: Negative Field Boost

2005-05-26 Thread Volodymyr Bychkoviak
here you can play with boost (+includes:(red frogs) -excludes:(red frogs))^2.0 excludes:(red frogs)^0.5 you can also play with numbers to achieve best results. this is first solution I've thought about(I mean there may be more efficient solutions) regards, Volodymyr Bychkoviak Ryan Skow w

Re: Negative Field Boost

2005-05-26 Thread Ryan Skow
That works very well - thank you for your quick reply. As a followup question, what if the desired effect was to force hits containing terms in the 'excludes' field to appear below other hits? Example using the same setup as before: Query: red frogs Result: both documents would be returned, b

Re: Negative Field Boost

2005-05-26 Thread Volodymyr Bychkoviak
string query should look like: "+includes:(red frogs) -excludes:(red frogs)" You can play with MultiFieldQueryParser a bit. regards, Volodymyr Bychkoviak Ryan Skow wrote: Here is the logical structure of the document I'm working with: The 'Document' has two fields: 'includes' - List of te

Negative Field Boost

2005-05-26 Thread Ryan Skow
Here is the logical structure of the document I'm working with: The 'Document' has two fields: 'includes' - List of terms that provide positive boost 'excludes' - List of terms that provide negative boost Here is a usage scenario: DocumentA includes: red green blue exclud

Re: using the CachingWrapperFilter

2005-05-26 Thread Erik Hatcher
On May 26, 2005, at 5:02 AM, M. Mokotov wrote: Can someone please explain me how do I use the CachingWrapperFilter? There is a bit of it demonstrated in Lucene in Action. Here are some snippets that (sort of) describe it: http://www.lucenebook.com/search?query=CachingWrapperFilter Yo

using the CachingWrapperFilter

2005-05-26 Thread M. Mokotov
Hi, Can someone please explain me how do I use the CachingWrapperFilter? I see that it's built in a decorator way (getting on the constructor another filter and decorate it with caching), still I don't see any basic filter to be the 'root'. On the tests I saw there is a MockFilter, but I couldn