Re: IllegalArgumentException: attempt to access a deleted document

2005-10-06 Thread Doug Cutting
Peter Kim wrote: I noticed one way to get around this is to use IndexReader.isDeleted() to check if it's deleted or not. The problem with that is I only have access to a MultiSearcher in my HitCollector which doesn't give me access to the underlying IndexReader. I don't want to have to open an In

RE: IllegalArgumentException: attempt to access a deleted document

2005-10-06 Thread Otis Gospodnetic
Maybe somebody will suggest some workarounds, but I'll only suggest you don't optimize your index after each deletion - see http://www.lucenebook.com/search?query=when+to+optimize - the suggestions are in the snippet of the 1st hit. Otis --- Peter Kim <[EMAIL PROTECTED]> wrote: > I think my bes

RE: IllegalArgumentException: attempt to access a deleted document

2005-10-06 Thread Peter Kim
I think my best option will just be to optimize the index after each deletion. I guess this will be good for me anyways... For the cases where optimizing is not a luxury one can afford, it would be nice if there was another way around this. Peter > -Original Message- > From: Peter Kim [m

Re: searching on special characters as in "C++"

2005-10-06 Thread Chris Lamprecht
StandardAnalyzer's grammar tokenizes C# and C++ down to "C". So you can either use an analyzer that tokenizes differently (such as WhitespaceAnalyzer), or modify the JavaCC grammar for StandardAnalyzer and rebuild your own custom version. If you go the latter route, have a look at NutchAnalysis.j

IllegalArgumentException: attempt to access a deleted document

2005-10-06 Thread Peter Kim
Hi, I'm getting this error from trying to access a document in my custom hit collector: java.lang.IllegalArgumentException: attempt to access a deleted document at org.apache.lucene.index.SegmentReader.document(SegmentReader.java:235 ) at org.apache.lucene.index.MultiReader.docume

searching on special characters as in "C++"

2005-10-06 Thread Filip Anselm
How can I make it possible to search on words that includes special characters like + and # as in "C++" and "C#" ? Filip - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: change of document ids accross optimize

2005-10-06 Thread Yonik Seeley
ids can also change as the result of an add(), not just optimize(). An add can trigger a segment merge which can squeeze out deleted docs and thus change the ids. I think everything else you said is pretty much correct. On 10/6/05, Jack McBane <[EMAIL PROTECTED]> wrote: > > I know that in general

change of document ids accross optimize

2005-10-06 Thread Jack McBane
I know that in general if you optimize an index the document id that a HitCollector would see is not guarenteed to stay the same since any deleted documents would get removed from the index completely and all later documents pushed up to fill in the gaps. I'm wondering though, are the document ids

Re: who could tell me the equation of the scoring in detail? i'm confused about two days

2005-10-06 Thread msftblows
Your site is down...but I would also liek to read that doc. -Original Message- From: jian chen <[EMAIL PROTECTED]> To: java-user@lucene.apache.org; chai qiaozi <[EMAIL PROTECTED]> Sent: Tue, 4 Oct 2005 20:38:25 -0700 Subject: Re: who could tell me the equation of the scoring in detail? i

Re: What is a Hits object?

2005-10-06 Thread Erik Hatcher
On Oct 6, 2005, at 11:28 AM, Cyril Barlow wrote: How long does the Hits object stay in memory for? As long as you keep it around. Lucene isn't responsible for keeping a reference to it, your application is. Until the IndexSearcher closes? And if you just use 1 IndexSearcher would there

Re: repetition of document while indexing..................

2005-10-06 Thread Otis Gospodnetic
This is a question for java-user@ list, not java-dev@ Hello Shadab, You need to make your application perform the "primary key"-like logic. It can, for instance, look in the index first, before adding a document, using a field designated to be the PK. If it finds an existing document, delete it

Re: IndexWriter recreating

2005-10-06 Thread Otis Gospodnetic
Alex, Please wrap this in a unit test (using JUnit) that makes it repeatable. When yo uhave that, please file the bug in JIRA and attach your unit test. Thanks, Otis --- Alex Kiselevski <[EMAIL PROTECTED]> wrote: > > Hi, > > I have a strange exception when I'm trying to recreate an IndexWri

Re: What is a Hits object?

2005-10-06 Thread Cyril Barlow
- Original Message - From: "J.J. Larrea" <[EMAIL PROTECTED]> To: Sent: Wednesday, October 05, 2005 10:31 PM Subject: Re: What is a Hits object? > A Hits object is essentially a cache on query results. It caches in 2 ways: > > 1. When a query returning Hits is requested, only the top 10

Re: IndexSearcher over RMI

2005-10-06 Thread Cyril Barlow
- Original Message - From: "Erik Hatcher" <[EMAIL PROTECTED]> To: Sent: Thursday, October 06, 2005 1:49 PM Subject: Re: IndexSearcher over RMI > > On Oct 6, 2005, at 8:45 AM, Cyril Barlow wrote: > > I'm trying to pass an IndexSearcher over RMI but I'm getting a : > > > > java.rmi.Unma

Re: IndexSearcher in servlet containers

2005-10-06 Thread Cyril Barlow
> > There really is no need to close an IndexSearcher until you need to > instantiate another one, and even then you can let the old instance > go without closing and all will still be well. If you construct > IndexSearcher with a String directory name, there is no need to close > anything other

Re: IndexSearcher over RMI

2005-10-06 Thread Erik Hatcher
On Oct 6, 2005, at 8:45 AM, Cyril Barlow wrote: I'm trying to pass an IndexSearcher over RMI but I'm getting a : java.rmi.UnmarshalException: error unmarshalling return; nested exception is: java.io.WriteAbortedException: writing aborted; java.io.NotSerializableException: org.apach

IndexSearcher over RMI

2005-10-06 Thread Cyril Barlow
I'm trying to pass an IndexSearcher over RMI but I'm getting a : java.rmi.UnmarshalException: error unmarshalling return; nested exception is: java.io.WriteAbortedException: writing aborted; java.io.NotSerializableException: org.apache.lucene.search.IndexSearcher Is there any way around t

Re: IndexSearcher in servlet containers

2005-10-06 Thread Erik Hatcher
On Oct 5, 2005, at 9:32 PM, Cyril Barlow wrote: Creating an IndexSearcher for every request goes against how to use Lucene best. A _single_ IndexSearcher for all searches is optimum. You really ought to look into using a single instance. Erik ---

Re: Query to return all documents in the index

2005-10-06 Thread Cyril Barlow
Have a look at your fields to see if they share a certain term. eg if you have a URL field then 'http' will probably bring them all up. ___ To help you stay safe and secure online, we've developed the all new Yahoo! Securi

IndexWriter recreating

2005-10-06 Thread Alex Kiselevski
Hi, I have a strange exception when I'm trying to recreate an IndexWriter that was previously defined. I did the following steps: 1. mWriter = new IndexWriter(indexPath, analyzer, true); 2. mWriter.addDocument(document); 3. mWriter.optimize(); 4. mWriter.cl

RE: Query to return all documents in the index

2005-10-06 Thread Vanlerberghe, Luc
There is a MatchAllDocsQuery available (in the current development trunk I believe) in org.apache.lucene.search. I simply took the source and compiled it along with my project to use it... Luc -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Chris Hostett