Re: Lucene Web Service - Client Program

2008-05-30 Thread Otis Gospodnetic
Vatsan, are you aware of Solr? http://lucene.apache.org/solr Have a look (and look for solrj - it's a Solr java client) Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: vatsan <[EMAIL PROTECTED]> > To: java-user@lucene.apache.org > Sent: Fri

Re: distributed lucene progress

2008-05-30 Thread Matt Ronge
On May 21, 2008, at 3:19 PM, Otis Gospodnetic wrote: No, that's a separate project on SF, IIRC. I am also interested in distributed lucene. I took a look on Hadoop's wiki and found this: http://wiki.apache.org/hadoop/DistributedLucene?highlight=%28distributed%29 which leads to many differ

Re: FileNotFoundException in ConcurrentMergeScheduler

2008-05-30 Thread Michael McCandless
Paul, One more question: when you hit that exception, does the offending file in fact not exist (when you list the directory yourself)? And, does the exception keep happening consistently (same file missing) once that happens, or, does the same index work fine the next time you try it (i

Re: FileNotFoundException in ConcurrentMergeScheduler

2008-05-30 Thread Michael McCandless
Paul, What is your mergeFactor set to? Can you get the exception to happen with infoStream set on the writer, and post that back? Mike Paul J. Lucas wrote: On May 30, 2008, at 3:05 AM, Michael McCandless wrote: Are you indexing only one document each time you open IndexWriter? Or do

Re: Improving search performance

2008-05-30 Thread Emmanuel Bernard
Not the IndexSearch directly but you can pool the underlying IndexReaders which should lead to the same order of performance. You need to implement a ReaderProvider implementation (see SharedReaderProvider or NotSharedReaderProvider as example) and use hibernate.search.reader.strategy to de

Re: SpanNearQuery: how to get the "intra-span" matching positions?

2008-05-30 Thread Paul Elschot
Op Friday 30 May 200812:10 schreef Claudio Corsi: > Hi all, > I'm querying my index with a SpanNearQuery built on top of some > SpanOrQuery. Now, the Spans object I get form the SpanNearQuery > instance returns me back the sequence of text spans, each defined by > their starting/ending positions. I

Lucene Web Service - Client Program

2008-05-30 Thread vatsan
I am new to web services. This is the situation: We have a document/corpus indexed by Lucene and say it resides on C:\Lucene\Index We are hosting Lucene as a web service (following the instructions provided at www.lucene-ws.net) where I have provided the information regarding the index director

Re: FileNotFoundException in ConcurrentMergeScheduler

2008-05-30 Thread Mark Miller
Paul J. Lucas wrote: On May 30, 2008, at 3:05 AM, Michael McCandless wrote: Are you indexing only one document each time you open IndexWriter? Or do you open a single IndexWriter, add all documents for that directory, then close it? The latter. When the exception occurs, do you know how ma

Re: Opening an index directory inside a jar

2008-05-30 Thread Marcelo Ochoa
Hi Ravi: I am not a Lucene guru but IMO you has to write a new Directory class which opens the jar a provides access to Lucene. May be a sub class of FSDirectory will work, but only for read-only behaviour. I have done this set of classes to implement Lucene storage inside Oracle JVM using B

Re: FileNotFoundException in ConcurrentMergeScheduler

2008-05-30 Thread Paul J. Lucas
On May 30, 2008, at 3:05 AM, Michael McCandless wrote: Are you indexing only one document each time you open IndexWriter? Or do you open a single IndexWriter, add all documents for that directory, then close it? The latter. When the exception occurs, do you know how many simultaneous thre

Re: Opening an index directory inside a jar

2008-05-30 Thread Ravi_116
Any ideas/suggestions ?? Ravi Ravi_116 wrote: > > I get the following error trace - > > java.io.FileNotFoundException: no segments* file found in > org.apache.lucene.store.FSDirectory@/Users/projects/workspace/project_name/web/file:/Users/.m2/repository/com/mycompany/project_name/2.1.0-interna

SpanNearQuery: how to get the "intra-span" matching positions?

2008-05-30 Thread Claudio Corsi
Hi all, I'm querying my index with a SpanNearQuery built on top of some SpanOrQuery. Now, the Spans object I get form the SpanNearQuery instance returns me back the sequence of text spans, each defined by their starting/ending positions. I'm wondering if there is a simple way to get not only the st

RE: how to unsubscribe?

2008-05-30 Thread Daniel Freudenberger
Hey, I've already tried this but the subject line is fixed and I wrote a roman to convince the mail daemon that I'm not interested in spamming.. but it didn't care :) Best regards, Daniel -Original Message- From: Grant Ingersoll [mailto:[EMAIL PROTECTED] Sent: Friday, May 30, 2008 1:23

Re: how to unsubscribe?

2008-05-30 Thread Grant Ingersoll
We don't let people leave ;-) I think, you may just need to put some reasonable looking text (such as "unsubscribe") in the subject/body of the email. Not sure why you need to do that, though. Maybe one of the moderators watching can do it for you if that doesn't work. -Grant On M

how to unsubscribe?

2008-05-30 Thread Daniel Freudenberger
Hello everybody, sorry for posting to the list but I’m kinda helpless. I’m trying to unsubscribe from the mailing list but my unsubscribe email is treated as spam :) SMTP error from remote server after transfer of mail text: host mx1.eu.apache.org[192.87.106.230]: 552 spam score (5.7) exc

Re: IndexReader.reopen memory leak

2008-05-30 Thread Mark Miller
John Wang wrote: My client does not call my reader.reopen(), I have implemented a reload() method off of my reader (void reload()), and it discards the internal reader upon a reload. Due to another issue (an api issue with IndexReader, e.g. all derived implementations have to reimplement reopen b

Re: FileNotFoundException in ConcurrentMergeScheduler

2008-05-30 Thread Michael McCandless
Jamie, The code looks better! You're not forcefully removing the write.lock nor deleting files from the index yourself, anymore, which is good. One thing I spotted is your VolumeIndex.deleteIndex method fails to synchronize on the indexLock. If I understand the code correctly, that mea

Re: FileNotFoundException in ConcurrentMergeScheduler

2008-05-30 Thread Mark Miller
I guess my test index was corrupted some other way...I can not duplicate my results today without breaking things with two lockless Writers first. Oh well. I definitely saw it legitimately while playing with IndexReader.reopen...if I kept enough of the old IndexReaders around long enough I wo

Re: FileNotFoundException in ConcurrentMergeScheduler

2008-05-30 Thread Jamie
Hi Michael / others The one thing I discovered was that it is quite useful to implement a JVM shutdown hook in your code to prevent the index from getting corrupted when an indexing process dies unexpectantly. For those who don't know about shutdown hook mechanism, you do this by implementin

Re: FileNotFoundException in ConcurrentMergeScheduler

2008-05-30 Thread Jamie
Hi Michael Thank you. Your suggestions were great and they were implemented (see attached source code), however, unfortunately, I am still getting file not found errors on the automatic merging of indexes. Regards, Jamie Michael McCandless wrote: Jamie, I'd love to get to the root cause

Re: FileNotFoundException in ConcurrentMergeScheduler

2008-05-30 Thread Michael McCandless
A few more questions, below: Paul J. Lucas wrote: I have a thread than handles the unindexing/reindexing. It gets changed from a BlockingQueue. My unindex code is like: IndexWriter writer = new IndexWriter( INDEX, INDEX_ANALYZER, false ); final Term t = new Term( DIR_FIELD

Re: FileNotFoundException in ConcurrentMergeScheduler

2008-05-30 Thread Michael McCandless
Paul J. Lucas wrote: On May 29, 2008, at 6:35 PM, Michael McCandless wrote: Can you use lsof (or something similar) to see how many files you have? FYI: I personally can't reproduce this; only a coworker can and even then it's sporadic, so it could take a little while. If possible, cou

Re: FileNotFoundException in ConcurrentMergeScheduler

2008-05-30 Thread Michael McCandless
Jamie, I'd love to get to the root cause of your exception. Last time we talked (a few weeks back) I saw several possible causes in the source you had posted: http://markmail.org/message/dqovvcwgwof5f7wl Did you test any of the ideas there? You are potentially manually deleting file

Re: FileNotFoundException in ConcurrentMergeScheduler

2008-05-30 Thread Jamie
Hi Paul, I just noticed the discussion around this. All most all of my customers have/are experiencing the intermittant FileNotFound problem. Our software uses Lucene 2.3.1. I have just upgraded to Lucene 2.3.2 in the hope that this was one of the bugs that was fixed. I would be very inter