Re: IOException question

2006-11-16 Thread Antony Bowesman
Hi Mike, Do you also have a reader open against this index? If yes, then this is totally normal on Windows. A reader holds open the segments cfs files that it is using, so when the writer tries to delete them (because they were merged) the delete fails and Lucene will try again later. Aha,

Re: IOException question

2006-11-16 Thread Michael McCandless
Antony Bowesman wrote: Hi, I have the IndexWriter.infoStream set to System.out and get the following merging segments _4m (2 docs) _4n (1 docs) into _4o (3 docs) java.io.IOException: Cannot delete PathToDB\_29.cfs; Will re-try later. java.io.IOException: Cannot delete PathToDB\_29.cfs; Will re-

IOException question

2006-11-16 Thread Antony Bowesman
Hi, I have the IndexWriter.infoStream set to System.out and get the following merging segments _4m (2 docs) _4n (1 docs) into _4o (3 docs) java.io.IOException: Cannot delete PathToDB\_29.cfs; Will re-try later. java.io.IOException: Cannot delete PathToDB\_29.cfs; Will re-try later. Is this norm

Re: Indexing Performance issue

2006-11-16 Thread Antony Bowesman
spinergywmy wrote: Hi, I having this indexing the pdf file performance issue. It took me more than 10 sec to index a pdf file about 200kb. Is it because I only have a segment file? How can I make the indexing performance better? If you're using the log4j PDFBox jar file, you must make sure

Re: Another problem with the QueryParser

2006-11-16 Thread Chris Hostetter
:Find all documents in the index with the word "Table", and then from that : list, remove any docs that don't have "Chair"? correct. : If the latter is true, then how are these two queries different: : "Table OR NOT Chair" : "Table AND NOT Chair" They aren't. Those queries are fu

Re: Another problem with the QueryParser

2006-11-16 Thread Lucifer Hammer
Hi Chris, Thanks for your response. If it's alright with you, I'd like to spell it out, just so you can confirm that I'm understanding what you're saying: Here's the Query: "Table OR NOT Chair" I thought the results I should expect are: All documents in the index that have the word "Table"

Re: Urgent : Specific string query search

2006-11-16 Thread Chris Hostetter
: As far as I know, WhiteSpaceAnalyzer does support the field:text format. Try More specificaly, the bit of syntax field:value ...or more generally field:"some longer value" is parsed by QueryParser regardless of what Analyzer you use. QP is what extracts the field name and the colon, and then

RE: newbie scoring question

2006-11-16 Thread Chris Hostetter
: So you can make a specific field's relevancy for a given term higher : compared to another term using something like : : Id_field:someterm^2 || blob_field:someterm : : Im kind of a newb myself but I think this should work for you. Indeed that is the way i would recommend solving this problem (wh

Re: Urgent : Specific string query search

2006-11-16 Thread Erick Erickson
First, I'd ask why you need to search on punctuation. This often leads you into an overly-complex place. My first suggestion is that you re-think whether that's desirable. If it's not, just use different analyzer at index and search time. If it is desirable, then you probably need to make your ow

Urgent : Specific string query search

2006-11-16 Thread Krishnendra Nandi
Hi all, I have a specific string query like "Jakarta:" How do i get that? I am using standardAnalyzer and it seems as if it is stripping ":" and it is simply searching for "Jakarta". I have used WhiteSpaceAnalyzer also and its working fine for ":" but I think it has some other limitations. The

Re: Best approach for exact Prefix Field Query

2006-11-16 Thread Martin Braun
hi Erik, > "action and" is likely not a single Term, so you'll want to create a > SpanNearQuery of those individual terms (that match the way they were > when analyzed and indexed, mind you) and use a SpanNearQuery inside a > SpanFirstQuery. Make sense? Yes, it works (see below)! ... but with my

RE: newbie scoring question

2006-11-16 Thread Phil Rosen
You can set the boost on specific terms within your query, I believe the syntax is: Id_field:someterm^2 So you can make a specific field's relevancy for a given term higher compared to another term using something like Id_field:someterm^2 || blob_field:someterm Im kind of a newb myself but I