Problem when searching ..

2005-04-15 Thread Mufaddal Khumri
Hi, I am creating an index of my data that's persisted by Hibernate using Lucene. I am running my indexer on a huge data set. My indexing takes 1312805ms. At the end of which I get a 26,266KB directory. I can view the contents of my index directory using Luke. When I copy my webapp under Tomcat

Re: Problem when searching ..

2005-04-15 Thread Brisbart Franck
Did you have a look at this thread ?? http://mail-archives.eu.apache.org/mod_mbox/jakarta-lucene-dev/200401.mbox/[EMAIL PROTECTED] It may help you Franck Mufaddal Khumri wrote: Hi, I am creating an index of my data that's persisted by Hibernate using Lucene. I am running my indexer on a huge data

Re: Atomic updates on Lucene index document?

2005-04-15 Thread Jens Kraemer
On Thu, Apr 14, 2005 at 09:53:52PM -0400, Terence Lai wrote: > Hi all, > > As far as I know, I don't find any Lucene API for updating an index > document. What I have to do is to delete the existing index document > and insert a new one. However, this is going to be 2 separate > operations (delete

LUCENE IS NOT ANY OTHER DB

2005-04-15 Thread Karthik N S
  Hi guys Apologies.. It really pains to after a hards day's work that Lucene is turning to be similar to any other DB product. The form is full of Db similar requirements on Lucene to perform. If one requires supports similar to DB ,then can use Any of the free avaliable D

Fields with same name boosting

2005-04-15 Thread Peter Veentjer - Anchor Men
I have question about field boosting. If I have 2 (or more) fields with the same fieldname in a single document, and I boost one of those, than only that one will be boosted? Or will all fields with the same name be boosted? I guess only one field is boosted, but I want to be certain. Met vri

RE: Update performance/indexwriter.delete()?

2005-04-15 Thread Roy Klein
Hi Hoss, Thanks very much for your comments. While batch processing might work in some cases, I believe it's not "safe" in mine. Here's the scenario that I can't guarantee won't happen: There might be 3 transactions in a very short time span (for example, 1 second), here's what they are: 1) up

Re: Fields with same name boosting

2005-04-15 Thread Paul Libbrecht
Le 15 avr. 05, à 14:44, Peter Veentjer - Anchor Men a écrit : I have question about field boosting. If I have 2 (or more) fields with the same fieldname in a single document, and I boost one of those, than only that one will be boosted? Or will all fields with the same name be boosted? I guess only

Re: Update performance/indexwriter.delete()?

2005-04-15 Thread John Haxby
Roy Klein wrote: Here's the scenario that I can't guarantee won't happen: There might be 3 transactions in a very short time span (for example, 1 second), here's what they are: 1) update doc1 (DEL doc1, ADD doc1) 2) update doc2 (DEL doc2, ADD doc2) 3) delete doc1 If I process these in order, then a

RE: Update performance/indexwriter.delete()?

2005-04-15 Thread Peter Veentjer - Anchor Men
I have done something similar. I have added a creational date to my Jobs. Some jobs can take some time (analyzing) and all indexwrite jobs are queued for a threadpool to be processed. After they are analyzed, they are added in a queue to be written to the index. If 2 (or more) indexwrite jobs for t

Re: Fields with same name boosting

2005-04-15 Thread Doug Cutting
Peter Veentjer - Anchor Men wrote: I have question about field boosting. If I have 2 (or more) fields with the same fieldname in a single document, and I boost one of those, than only that one will be boosted? Or will all fields with the same name be boosted? I guess only one field is boosted, bu

strange results using QueryFilter

2005-04-15 Thread Omar Didi
Hi guys, I have to do a search where the user will enter a query and then i have to get the hit's length for that term plus get the hit's length for that term such that the url ends with com or net or . I used QueryFilter, it works fine when the query is only one word but when the query is

Re: Strange sort error

2005-04-15 Thread Daniel Naber
On Friday 15 April 2005 01:46, Chris Hostetter wrote: > Or worse, a query that does work today, stops working tomorow because > one doc was removed. You're right, that is not acceptable. I've created a bug report about the original problem: http://issues.apache.org/bugzilla/show_bug.cgi?id=34477

RE: Update performance/indexwriter.delete()?

2005-04-15 Thread Chris Hostetter
: The first thing that comes to mind is that I could look at the transactions : in the batch queue, and based on the docid, I could make sure to delete all : the matching ADD docid's in the batch queue whenever a matching DEL comes : in. However, that will only work if I know the docid's. But,

Re: strange results using QueryFilter

2005-04-15 Thread Erik Hatcher
On Apr 15, 2005, at 4:52 PM, Omar Didi wrote: Hi guys, I have to do a search where the user will enter a query and then i have to get the hit's length for that term plus get the hit's length for that term such that the url ends with com or net or . I used QueryFilter, it works fine when the

RE: strange results using QueryFilter

2005-04-15 Thread Omar Didi
Hi Erik, here is a sample code that I hope is enough to find out what s wrong. thanks. Query domainQuery = new TermQuery( new Term( domain, brandName ) ); Filter domainFilter = new QueryFilter( domainQuery ); int total_result = searchIndex.getHits( "content", "bank of america" ).length(); String

token type question

2005-04-15 Thread ethandev
Hi, I am working on a program to index/search chemical element/compound. Say I write an analyzer to filter out chemical terms, such as H2O. I noticed that I can specify a tocken's type. Can I construct a token as new Token ("H2", start, end, "chem"); My questions is How do I search all the to

Re: token type question

2005-04-15 Thread Pierrick Brihaye
[EMAIL PROTECTED] a écrit : I am working on a program to index/search chemical element/compound. Say I write an analyzer to filter out chemical terms, such as H2O. I noticed that I can specify a tocken's type. Can I construct a token as new Token ("H2", start, end, "chem"); My questions is How do