Group of documents.

2007-10-05 Thread Raghu Ram
Hi, We have an application in which we want to index feeds. Each feed is a collection of articles and some other metadata. The problem is that sometimes we want to search for feeds and sometimes for articles. As far as I know lucene doesn't provide any abstraction for grouping its documents.

Re: Help with Lucene Indexer crash recovery

2007-10-05 Thread Karl Wettin
5 okt 2007 kl. 21.50 skrev vivek sar: Once the writer.addIndexes is done I call writer.optimize() No biggie, but IndexWriter.addIndexes() will automatically optimize, so that is one line of code you can get rid of. it may take hours to re-index /Perhaps/ using IndexWriter.addIndexesNoO

Re: Help with Lucene Indexer crash recovery

2007-10-05 Thread Chris Hostetter
: Once in a while we kill the running application using "kill -9". I To quote a great man, who frequently quotes another great man: "Well there's your problem!" stop using "kill -9" ... i'll say it again because it's important, and i'm even going to violate etiquite and use all caps because it

Re: Querying the Query object

2007-10-05 Thread Chris Hostetter
: Is there any way to find out if an instance of Query has any terms within it? : I have a custom parser (QueryParser does not do everything I need) and it : somtimes creates empty BooleanQuerys. (This happens as a side effect of so you're problem is not "does a Query have any terms in it" it's "

Re: Generalized proximity query performance

2007-10-05 Thread Kyle Maxwell
> > Hmmm.. can you give some more concrete examples of what you mean by this? > both in terms of the use case you are trying to satisfy, and in terms of > how your current code works ... you don't have to post code or give away > trade secrets, just describe it as a black box (ie: what is the input

Re: Help with Lucene Indexer crash recovery

2007-10-05 Thread vivek sar
Thanks for the response Michael. Sorry, I'm using Lucene 2.2. We are using Lucene to index our database (Oracle) into documents for full-text search feature. Here is the process of indexing, 1) Have two IndexWriters which run in two different threads and write to two different directories (tempor

Re: Generalized proximity query performance

2007-10-05 Thread Mike Klaas
On 5-Oct-07, at 11:27 AM, Chris Hostetter wrote: that's what i thought first too, and it is a problem i'd eventaully like to tackle ... it was the part about "c" being in a differnet field from "a" and "b" that confused me ... i don't know what that exactly is being suggested here. I'm

Re: Generalized proximity query performance

2007-10-05 Thread Chris Hostetter
: > : would like to allow for the possibility that a and b are near each other : > in : > : one field, while c is in another field. : I understand the OP to want a PhraseQuery that has an intention (rather than : side-effect) of doing proximity-based scoring. : : "phrase query here"~1000 is the

Re: Generalized proximity query performance

2007-10-05 Thread Mike Klaas
On 5-Oct-07, at 10:54 AM, Chris Hostetter wrote: : I am using a hand rolled query of the following form (implemented with : SpanNearQuery, not a sloppy PhraseQuery): : a b c => +(a AND b AND c) OR "a b"~5 OR "b c"~5 : : The obvious solution, "a b c"~5, is not applicable for my issues, becaus

got stuck in running lucene demo

2007-10-05 Thread sumittyagi
hi, ihave just downloaded lucene 2.2.0 and tomcat, and have set my classpaths , now i am got stuck at the very first stage of indexing files, which ask me to do like this java org.apache.lucene.demo.IndexFiles {full-path-to-lucene}/src i dont know from which directory i have to execute this comma

Re: Generalized proximity query performance

2007-10-05 Thread Chris Hostetter
: I am using a hand rolled query of the following form (implemented with : SpanNearQuery, not a sloppy PhraseQuery): : a b c => +(a AND b AND c) OR "a b"~5 OR "b c"~5 : : The obvious solution, "a b c"~5, is not applicable for my issues, because I : would like to allow for the possibility that a an

Re: Querying the Query object

2007-10-05 Thread Mark Miller
What makes you think the query needs to be rewritten to extract terms? Not the case...unless you want to extract Terms from say, a Wildcard or Fuzzy query. In that case you need an IndexReader however, as the terms in the Wildcard/Fuzzy query are determined by what is in your index... I am als

Querying the Query object

2007-10-05 Thread John Byrne
Hi, Is there any way to find out if an instance of Query has any terms within it? I have a custom parser (QueryParser does not do everything I need) and it somtimes creates empty BooleanQuerys. (This happens as a side effect of recursive parsing - even if there are no terms for a query, I sti

Re: Custom Search algorithm integration

2007-10-05 Thread Grant Ingersoll
You may want to look at the FunctionQuery capability, either in Lucene, or the expanded capabilities (recently added) in Solr. -Grant On Oct 4, 2007, at 2:39 PM, [EMAIL PROTECTED] wrote: Hi, Were planning to use Lucene or Solr within our application and wanted to know if it can support the

Re: Help with Lucene Indexer crash recovery

2007-10-05 Thread Michael McCandless
"vivek sar" <[EMAIL PROTECTED]> wrote: > We are using Lucene 2.3. Do you mean Lucene 2.2? Your stack trace seems to line up with 2.2, and 2.3 isn't quite released yet. > The problem we are facing is quite a few times if our application is > stopped (killed or crash) while Indexer is doing its j