Re: Multisearcher will maintain index order sorting?

2008-10-22 Thread Paul Smith
On 23/10/2008, at 4:20 PM, Ganesh wrote: My Index DB is having 10 million records and it will grow to 30 million. Currently I am using millisecond timestamp and the RAM cosumption is more. I will change the resolution to minute. I am using 2 searcher objects refreshing each other every min

Re: Multisearcher will maintain index order sorting?

2008-10-22 Thread Ganesh
My Index DB is having 10 million records and it will grow to 30 million. Currently I am using millisecond timestamp and the RAM cosumption is more. I will change the resolution to minute. I am using 2 searcher objects refreshing each other every minute. When i do a warmup query with sort of tim

Re: How to use regexQuery along with fuzzy logic capabilities

2008-10-22 Thread Hasan Diwan
I seem to recall running the SimpleQueryParser first. If that throws an Exception, I then ran it with the RegexQueryParser with a reduced score. Hth Sent via BlackBerry by AT&T -Original Message- From: "Agrawal, Aashish \(IT\)" <[EMAIL PROTECTED]> Date: Thu, 23 Oct 2008 12:48:46 To: Su

How to use regexQuery along with fuzzy logic capabilities

2008-10-22 Thread Agrawal, Aashish (IT)
Hi, I want to use lucene for a simple search engine. If I use the code like this, QueryParser parser = new QueryParser(field, analyzer); Query query = parser.parse(line); searcher.search(query) above code doesn't give me regular expression search power, while at the other hand if I use the bel

Re: Newbie Question - Lucene Sorting NOT Ignoring NULL values

2008-10-22 Thread Reetha Hariharan
Hi Yonik: A simple RangeFilter did the trick > public class NotNullRangeFilter extends org.apache.lucene.search.Filter { >private RangeFilter rangeFilter; >public BitSet bits(IndexReader reader) throws IOException { >rangeFilter = RangeFilter.More("lastUpdatedDate"/*Field to sort*

Re: Query Expansion Module for Lucene based on BM25 ranking function

2008-10-22 Thread José Ramón Perez Aguera
Hi Grant, Our query expansion approach is quite simple, we apply pseudo- relevance feedback techniques, where a number of top retrieved documents are used to extract the terms candidates to expand the original query. We have used TermPositions in query time to extract the term statistics n

Re: Query Expansion Module for Lucene based on BM25 ranking function

2008-10-22 Thread Grant Ingersoll
Hi José, Can you explain your approach to implementing? I'm curious how you incorporated in the avg. doc length. Also, have you followed any of the flexible indexing discussions? Finally, what's the license on this code? Thanks, Grant On Oct 21, 2008, at 10:14 AM, José Ramón Pérez Agüer