Re: Concurrent Issue

2011-04-07 Thread Umesh Prasad
Building on Aditya's comments. 1. Lucene in Action has a SearchManager class which does most of 1-5 and manages indexreaders. 2. Reference counting is not the best approach, it is too low level and error prone. Instead Use ReadWriteLock

some basic questions on how Lucene/search engines work

2011-04-07 Thread Yang
I'm new to lucene/search engine , and have been struggling with these questions recently. I'd appreciate a lot of you could shed some light on this. let's say I do a query on dog greyhound note that I did not quote them, i.e. this is not a phrase search. what happens under the hood ? which

Re: ChainedFilter in Lucene 3.1.0

2011-04-07 Thread Robert Muir
Please read the CHANGES.txt and contrib/CHANGES.txt when upgrading, this is really really really really important. On Thu, Apr 7, 2011 at 6:40 PM, Siraj Haider wrote: > I am sorry, but the ChainedFilter was in lucene-misc-3.0.3.jar under > org.apache.lucene.misc but could not find it under the sa

Re: ChainedFilter in Lucene 3.1.0

2011-04-07 Thread Siraj Haider
I am sorry, but the ChainedFilter was in lucene-misc-3.0.3.jar under org.apache.lucene.misc but could not find it under the same location in lucene-misc.3.1.0.jar. On 4/7/2011 6:31 PM, Siraj Haider wrote: We have been using org.apache.lucene.search.ChaniedFilter in our application that uses lu

ChainedFilter in Lucene 3.1.0

2011-04-07 Thread Siraj Haider
We have been using org.apache.lucene.search.ChaniedFilter in our application that uses lucene 3.0.3. Today we downloaded version 3.1.0, but the code wont compile. It says that it could not find ChainedFilter. Did this class got moved to some other package? thanks -siraj ---

Re: Re: Re: A likely bug of TermsPosition.nextPosition

2011-04-07 Thread Michael McCandless
I committed another change to 3.x's CheckIndex (to also invoke .nextPosition); can you run that and see if it can detect this? Is there any way I can get this index? One question: how large are your payloads, typically? Does the exception still occur on the index fully rebuilt with 3.1? Mike h

Re: Indexing Non-Textual Data

2011-04-07 Thread Chris Spencer
My question wasn't just about classification. I'm asking, is there a way to classify non-textual data with Lucene? Yes, I know how to Google, and I've read the mailing list logs. All of those results only concern classifying simple text, not arbitrary numeric features. Regards, Chris On Thu, Apr

Re: Help with delimited text

2011-04-07 Thread Mark Wiltshire
Thanks Ian, your a star :-)RMarkOn 7 Apr 2011, at 11:18, Ian Lea wrote:Mark - I've uploaded some code to http://pastebin.com/mqSVcWUi thatindexes and searches file system paths.  It demonstrates what I'vebeen trying to suggest and may help you get your search up andrunning.--Ian.On Thu, Apr 7, 2011

Re: Help with delimited text

2011-04-07 Thread Ian Lea
Mark - I've uploaded some code to http://pastebin.com/mqSVcWUi that indexes and searches file system paths. It demonstrates what I've been trying to suggest and may help you get your search up and running. -- Ian. On Thu, Apr 7, 2011 at 8:18 AM, Mark Wiltshire wrote: > Hi Thanks Ian for you hel

PhraseQuery with huge "slop"?

2011-04-07 Thread sol myr
Hi, I need to run and "AND" query with a twist: give higher ranking for "exact match". So when searching for BIG BEN - Give high rank for the Document "BIG BEN is in London" - Lower rank for "It's a BIG day for my dear friend BEN" Following good advice from this list, I combined 2 separate que

Re: Analyzer which creates terms of one to n words

2011-04-07 Thread Israel Tsadok
Take a look st http://lucene.apache.org/java/3_0_3/api/contrib-analyzers/org/apache/lucene/analysis/shingle/package-summary.html On Thu, Apr 7, 2011 at 11:30 AM, Clemens Wyss wrote: > Is there an analyzer which takes a text and creates search terms based on > the following rules: > - all single

Analyzer which creates terms of one to n words

2011-04-07 Thread Clemens Wyss
Is there an analyzer which takes a text and creates search terms based on the following rules: - all single words - "two words in a row" - "three word in a row" - ... - "n words in a row" The reason is the following: I have an index which is now being analyzed using WhitespaceAnalyzer. Besides

Re: Help with delimited text

2011-04-07 Thread Mark Wiltshire
Hi Thanks Ian for you help on this, its driving me nuts :-) The StandardAnalyser is only used on the search query term being passed also. But In this case I am just adding a filter to the search. The actual category may be /Top/Books/Accountancy/10_Compliance/Internatio