Re: Differences between BooleanQuery and QueryParser

2012-01-30 Thread Erick Erickson
The parsing will be a trivial part of the overall query time, so small that I wouldn't worry about it in the least. I'd concentrate on doing the thing that takes the least maintenance. In the examples you're positing, it's not at all clear you could even measure the difference... Do what's easies

Re: Differences between BooleanQuery and QueryParser

2012-01-30 Thread Felipe Carvalho
Has anyone ever measured the difference? Could it make a 10%+ difference on a search involving around 5 terms? On Mon, Jan 30, 2012 at 8:05 PM, Brandon Mintern wrote: > A QueryParser has to actually parse your text and build up the > BooleanQuery exactly as you do in your second example. Since th

Boost term according to phonetic representation

2012-01-30 Thread Felipe Carvalho
Consider a people index, containing People documents with the following names: Doc 1 [name: "Marcus"] Doc 2 [name: "Markus"] Doc 3 [name: "Mharcus"] Suppose I use an analyzer so that all 3 names have the same representation. Supposing I use the same analyzer when running a search for name=markus,

Custom Payload Analyzer and Query

2012-01-30 Thread ktjex
I'm working on providing advanced searching for annotated Medical Documents (using UIMA). In the context of an annotated document, I identify relevant medical terms, as well as the negation of certain terms. Following what I've read and seen in Lucene examples, I've been able to provide a sea

Re: Differences between BooleanQuery and QueryParser

2012-01-30 Thread Brandon Mintern
A QueryParser has to actually parse your text and build up the BooleanQuery exactly as you do in your second example. Since the QueryParser is doing extra work that you don't have to do considering that you already know everything about the query, your second example is faster. On Mon, Jan 30, 201

Re: How to avoid filtering stop words like "IS" in StandardAnalyzer

2012-01-30 Thread Pedro Lacerda
I didn't know about CharArraySet.EMPTY_SET, thanks. Pedro Lacerda 2012/1/29 Uwe Schindler > Hi, > > If you want to disable *all* stop words, then CharArraySet.EMPTY_SET is the > right choice. For performance reasons you should also use CharArraySet for > non-empty stop words instead of simple

Re: weightage of each word according to precedence in document

2012-01-30 Thread Ian Lea
They all give exactly the same score, even the 3rd doc which doesn't contain abcd at all? Surprising. What does searcher.explain() say? Is this a simple search with default Similarity or is there stuff you're not telling us? -- Ian. On Sat, Jan 28, 2012 at 4:44 AM, A Z <4azfri...@gmail.com> wr