Re: Lucene searching class

2007-10-25 Thread poojasreejith
Hi Steven, It really solved my problem. thanks a lot. pooja Steven Rowe wrote: > > Hi Pooja, > > poojasreejith wrote: >> I am using lucene2.2.0 for my application. I have a searcher.java >> class. >> The problem I am facing is, it is not supporting >> >> Query query = QueryParser.par

Re: lucene indexing doubts

2007-10-25 Thread poojasreejith
hi, thanks for your response. I think you hanven't got what my question is? I will explain with an example. I have a folder which contains the indexed files. so, suppose if i want to add one more indexed data into it, without deleting the whole folder and performing the indexing for all the fil

[ANN]VTD-XML 2.2

2007-10-25 Thread jimmy Zhang
XimpleWare is proud to announce the the release of version 2.2 of VTD-XML, the next generation XML parsers/indexer/slicer/editor. This release significantly expands VTD-XML's ability to slice, split, edit and incrementally update the XML documents. To this end, we introduce the concept of names

Re: Reloading a searcher

2007-10-25 Thread Erick Erickson
What happens if you close the writer before instantiating a new searcher? I can't say for sure whether it matters, but it's worth a try.. Erick On 10/25/07, Guilherme Barile <[EMAIL PROTECTED]> wrote: > > Hello > I wrote a simple class to abstract searching on a text file > (generate by a

Re: Speeding up highlighting by storing a cached TokenStream

2007-10-25 Thread Mark Miller
Test the difference though. With small docs I experience better speed just re-analyzing...if the docs are big enough TokenSources starts to show an advantage...gotto be pretty big though. Your millage may vary though, so best to check. markharw00d wrote: Anyone care to suggest an approach t

RE: Same query different results?

2007-10-25 Thread Warren
> -Original Message- > From: Karl Wettin [mailto:[EMAIL PROTECTED] > Sent: Wednesday, October 24, 2007 10:25 PM > To: java-user@lucene.apache.org > Subject: Re: Same query different results? > > > > 25 okt 2007 kl. 06.33 skrev Warren: > > > This query works correctly in my app and Luk

Re: Speeding up highlighting by storing a cached TokenStream

2007-10-25 Thread markharw00d
Anyone care to suggest an approach to making this faster? See TokenSources.java Cheers Mark - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Speeding up highlighting by storing a cached TokenStream

2007-10-25 Thread Michael Stoppelman
Most of the time the highlighting uses is in getting the next token from the analyzer (tokenStream.next()). I'm wondering how I can access the tokens that are stored in lucene (or store another copy of the TokenStream seperately) and send a pre-tokenized TokenStream to the highlighter so next() is

Re: HTML analyzer

2007-10-25 Thread Karl Wettin
25 okt 2007 kl. 20.18 skrev Cool Coder: Is there any analyzer that can be configured All of them can be. I suggest you take a look at the code of any of them, StandardAnalyzer for instance. -- kar

HTML analyzer

2007-10-25 Thread Cool Coder
Is there any analyzer that can be configured to stop searching url test i.e. href=""? Maybe I need some sort of filter with reg ex so that searcher will skip searching text if it matches regular expressions in the filter. I am not sure whether this is possible? I would appreciate your valuable

Re: Lucene searching class

2007-10-25 Thread Steven Rowe
Hi Pooja, poojasreejith wrote: > I am using lucene2.2.0 for my application. I have a searcher.java class. > The problem I am facing is, it is not supporting > > Query query = QueryParser.parse(q, "contents",new StandardAnalyzer()); it > shows error; the method parse in the type QueryParser is

Re: Using not operator in query

2007-10-25 Thread Karl Wettin
25 okt 2007 kl. 19.57 skrev Vs_Inf: We have a requirement to allow users of our search application to use "NOT" operator on a single term. Is there a work-around for this requirement? You will need to combine the NOT-statement with a MatchAllDocsQuery.

Re: Lucene searching class

2007-10-25 Thread Karl Wettin
25 okt 2007 kl. 19.38 skrev poojasreejith: hi, I am using lucene2.2.0 for my application. I have a searcher.java class. The problem I am facing is, it is not supporting Query query = QueryParser.parse(q, "contents",new StandardAnalyzer ()); it shows error; the method parse in the type

Re: lucene indexing doubts

2007-10-25 Thread Karl Wettin
25 okt 2007 kl. 19.35 skrev poojasreejith: Can anyone of you guide me, how to index into an already indexed folder. Right now, I am deleting the indexed info and running the indexer again. I dont want to do that. I want a method, how to append into the same folder when new files are ind

Using not operator in query

2007-10-25 Thread Vs_Inf
Hi, We have a requirement to allow users of our search application to use "NOT" operator on a single term. For example, if the user enters "not customer", they are expecting the search to return everything that is not "customer". How can i achive this given Lucene limitation on NOT operator whic

Lucene searching class

2007-10-25 Thread poojasreejith
hi, I am using lucene2.2.0 for my application. I have a searcher.java class. The problem I am facing is, it is not supporting Query query = QueryParser.parse(q, "contents",new StandardAnalyzer()); it shows error; the method parse in the type QueryParser is not applicable for the (String, Str

lucene indexing doubts

2007-10-25 Thread poojasreejith
hi, Can anyone of you guide me, how to index into an already indexed folder. Right now, I am deleting the indexed info and running the indexer again. I dont want to do that. I want a method, how to append into the same folder when new files are indexed. Is it possible? I couldn't find any solut

RE: Performance searching over multiple indexes

2007-10-25 Thread Ard Schrijvers
Hello, > Using more than one Index will definitely decrease the > searching performance. The most Lucene search latency is to > load the hits. If there is no hit, the searching takes a > short time, dozens milli seconds and it's a const if the > document number is less than 1M. search 100 inde

Reloading a searcher

2007-10-25 Thread Guilherme Barile
Hello I wrote a simple class to abstract searching on a text file (generate by a legacy system). class MyFile { private Searcher s; private long timestamp; } It creates a timer and checks every ten minutes if textfile.lastModified() is diferent from the number it cached on times

RE: Performance searching over multiple indexes

2007-10-25 Thread Fang_Li
Using more than one Index will definitely decrease the searching performance. The most Lucene search latency is to load the hits. If there is no hit, the searching takes a short time, dozens milli seconds and it's a const if the document number is less than 1M. search 100 indexes will take 100 time

Performance searching over multiple indexes

2007-10-25 Thread Ard Schrijvers
Hello, I am experimenting with lucene MultiSearcher and do some simple BooleanQueries in which I combine a couple of TermQueries. I am experiencing, that a single lucene index for just 100.000 docs (~10 k each) is like 100 times faster than when I have about 100 seperate indexes and use MultiSear

Re: Lucene Queries Over User-Editable Dynamic Categories of Documents

2007-10-25 Thread mark harwood
There are 2 considerations when caching filter results: 1) What was the criteria used to produce the results? 2) What version of the index were these results taken from? CachingWrapperFilter takes care of 2) by using WeakHashMap keyed on IndexReader. The filter you pass to CachingWrapperFilter m

Re: Lucene Queries Over User-Editable Dynamic Categories of Documents

2007-10-25 Thread lucene user
What do you means by 'Most caches are held in WeakHashMap...' is this caching provided by CachingWrappingFilter or do we have to implement it ourselves? I assume the former. We will share results of our testing as soon as we have any - not sure how generalizable they will be. You have been super

Re: Customized search with Lucene?

2007-10-25 Thread Doron Cohen
"Lukas Vlcek" <[EMAIL PROTECTED]> wrote on 25/10/2007 10:25:23: > Doron, > > You definitely added few important (crucial) questions. There > are important > concerns and I am glad to hear that Lucene community is > debating them. I am > not an Lucene viscera expert thus I can hardly compare simple

Re: Customized search with Lucene?

2007-10-25 Thread Lukas Vlcek
Doron, You definitely added few important (crucial) questions. There are important concerns and I am glad to hear that Lucene community is debating them. I am not an Lucene viscera expert thus I can hardly compare simple search engine from the book with Lucene but it is pity to hear that robust im