Re: Problem with creating IndexReaders and understanding their implementation use

2009-01-22 Thread Michael McCandless
IndexReader.open opens the latest segments_N file. If that file references only 1 segment, a SegmentReader is returned, else a MultiSegmentReader. I'm confused why you see the SegmentReader impl giving too few results -- that should only be returned if your index legitimately has only 1

Re: Combining results of multiple indexes

2009-01-22 Thread Preetham Kajekar
Hi, Just thought of sharing some more progress I made on this. This time I created multiple (2) indexWriter writing different documents (based on if it is odd or even based on an id - not doc-id) to different indexes and the performance seems to scale up based on the number of threads (and the

Re: Using analyzer while constructing Lucene queries

2009-01-22 Thread Chris Hostetter
: 2> Construct your queries by yourself, by using TermQuery, : PhraseQuery etc. directly. There are no Analyzers used when : choosing this option. You have to know what the effects of the : Analyzer used during indexing had on the field you're searching : and be sure you're doing something compati

Problem with creating IndexReaders and understanding their implementation use

2009-01-22 Thread Micah Jaffe
My environment: Lucene 2.3.2 on Linux, Java 1.6.0_07-b06, running under Tomcat 5.5.26 What I'm trying to do seems pretty simple, but is causing a headache which I can't sleuth out. When I try to build an IndexSearcher using an IndexReader, the IndexReader.open( String_to_index_dir ) call

Re: Nightly source builds of Lucene ..

2009-01-22 Thread Kay Kay
Yes - I was referring to the nightly builds of Lucene. For eg- this page - http://lucene.apache.org/java/docs/index.html (search for "Nightly Source Builds" ) contains the link to http://people.apache.org/builds/lucene/java/nightly/ . It might be worth updating this link. For now - as an

Re: contrib Benchmark enwiki problem

2009-01-22 Thread Grant Ingersoll
There is a little funkiness in the ant script there in that if the original file exists in temp, but hasn't been processed in work, then it doesn't do the proper thing. The workaround is to do the second step to get into work by hand. I believe there is a JIRA issue on it. Also, I highly

Re: check if document is deleted using indexwriter

2009-01-22 Thread Marc Sturlese
I am using IndexWriter updatedocument. If the doc has a duplicate it will delete the old one and insert de new one. If it has no duplicates in the index, it will just insert. I do it for 1000 every time and would like to count how many times it deletes, to be able to count duplicates... Michael M

Re: check if document is deleted using indexwriter

2009-01-22 Thread Michael McCandless
IndexWriter currently has no way to check if a doc is deleted; you'd have to use IndexReader... The deleteDocuments methods in IndexWriter do not return a count (unlike IndexReader) because the request is simply buffered and then processed in bulk when the deletes are flush. So at the ti

Re: contrib Benchmark enwiki problem

2009-01-22 Thread Michael McCandless
An "alg" is simply a file (file.alg) that the benchmarking code runs. You run it something like this: ant run-task -Dtask.alg=/path/to/file.alg -Dtask.mem=1024M For docs... there's the package.html in contrib/benchmark. LIA 2 (only via MEAP right now) also covers benchmark's alg syntax

Re: indexing database

2009-01-22 Thread cemsoft
thnx for your reply, can you give me an example, that opens only one connection while paginating resultset (maybe because of alotof connections, it does not work ...) regards cem chrislusf wrote: > > This is not a lucene question, but a jdbc question. > The code is not releasing the jdbc

Re: Lucene app to run as daemon service in windows and linux

2009-01-22 Thread Koji Sekiguchi
Ganesh, There is Commons Daemon project: http://commons.apache.org/daemon/ I'm not sure it works for you (RMI on Windows) though, but please check. Koji Ganesh wrote: Thanks. http://wrapper.tanukisoftware.org/doc/english/download.jsp is free to use in open source projects. It requires lice