Re: lucene and database searching, keeping score

2005-12-02 Thread Jeff Rodenburg
George - There are a number of SQL Server specific ways you can do this. Email me off-list as the solution is not relevant to Lucene. -- j On 12/2/05, George Abraham <[EMAIL PROTECTED]> wrote: > > All, > I have created a Lucene index from data in a SQL Server db. When I conduct > a > Lucene sea

RE: Wildcard

2005-12-02 Thread Pasha Bizhan
Hi, > From: John Powers [mailto:[EMAIL PROTECTED] > Lucene only lets you use a wildcard after a term, not before, correct? > What work arounds are there for that? RegexQuery? http://svn.apache.org/repos/asf/lucene/java/trunk/src/java/org/apache/lucene /search/regex/ Also: http://www.mail-arch

Re: lucene and database searching, keeping score

2005-12-02 Thread John Powers
Instead of just putting the keys in the index, can you put all the data you might be getting out to display to the user, or whatever the final thing is? That's what we do.Search on the id as you are, and then the hit has documents that are loaded with all your final information instead of just

Re: Wildcard

2005-12-02 Thread Marc Hadfield
The standard way to do this is to additionally index the reverse of all strings/tokens, potentially in a different field "reverse:", ie index forward:abcd as well as reverse:dcba. Then in queries of the form "*cd", reverse the query to "dc*" so that you end up with "reverse:dc*" in your

Re: Wildcard

2005-12-02 Thread Michael D. Curtin
John Powers wrote: Hello, Lucene only lets you use a wildcard after a term, not before, correct? What work arounds are there for that? If I have an item 108585-123 And another 332323-123 How can I look for all the -123 family of items? Classic indexing problem. Here are a couple simple ideas

Re: Lucene performance bottlenecks

2005-12-02 Thread Doug Cutting
Andrzej Bialecki wrote: For a simple TermQuery, if the DF(term) is above 10%, the response time from IndexSearcher.search() is around 400ms (repeatable, after warm-up). For such complex phrase queries the response time is around 1 sec or more (again, after warm-up). Are you specifying -server

lucene and database searching, keeping score

2005-12-02 Thread George Abraham
All, I have created a Lucene index from data in a SQL Server db. When I conduct a Lucene search, I get back in the hits the primary key (WorkID) and the scores associated with the hits. Then using the list of WorkIDs, I query the database for the other associated information of the assets associate

Wildcard

2005-12-02 Thread John Powers
Hello, Lucene only lets you use a wildcard after a term, not before, correct? What work arounds are there for that? If I have an item 108585-123 And another 332323-123 How can I look for all the -123 family of items? John Powers --

Re: Lucene performance bottlenecks

2005-12-02 Thread Paul Elschot
es to read as much of the > index as it can to memory in order to speed up the access, but > apparently that's not the case. The heap consumption was always in the > order of 100-200MB, no matter how large heap I set (and I tried values > between 1-4GB). > > For those inter

Re: how to control terms to be highlighted?

2005-12-02 Thread mark harwood
Hi Harini, I updated QueryTermsExtractor in Subversion last night to support your requirement. The JUnit test is also updated with a field-specific example. Cheers, Mark --- Harini Raghavan <[EMAIL PROTECTED]> wrote: > Hi Chris, > > Can we pass a different query object for searching > and a

Lucene performance bottlenecks

2005-12-02 Thread Andrzej Bialecki
onsumption was always in the order of 100-200MB, no matter how large heap I set (and I tried values between 1-4GB). For those interested in profiler info, look here: http://www.getopt.org/lucene/20051202/ Here's an example of elapsed times [ms] for IndexSearcher.search, and for getting

Re: how to control terms to be highlighted?

2005-12-02 Thread Harini Raghavan
Hi Chris, Can we pass a different query object for searching and a different one to the highlighter? I am not sure of that. In any case, based on Mark's suggestion I modified the QueryTermsExtractor class and filtered the query terms by the fieldName. Attached is the modified file. Thanks,