Re: PorterStemFilter causes wildcard searches to not work

2011-11-21 Thread Erick Erickson
Take a look at admin/analysis to see what happens when. What do you mean by "apply a PorterStemFilter"? At index time? Query time? The analysis page will let you look at what the terms look after they've gone through your analysis chain, and the answer may be obvious then All that said, you c

PorterStemFilter causes wildcard searches to not work

2011-11-21 Thread SBS
It seems that when I use a PorterStemFilter in my custom analyser, wildcard searches malfunction. As an example, I have the words "appendicitis" and "sensitisation" in our content. When I enter a query of "a*itis" I would expect to have "appendicitis" match but instead I get "sensitisation" and n

Re: Filtering search results

2011-11-21 Thread Erick Erickson
http://wiki.apache.org/solr/FieldCollapsing On Mon, Nov 21, 2011 at 8:17 AM, Felipe Carvalho wrote: > Thanks for the tip, Erick! Any pointer to any article showing how to use it? > > Thanks > > On Mon, Nov 21http://wiki.apache.org/solr/FieldCollapsing, 2011 at 10:49 AM, > Erick Erickson wrote: >

Re: Filtering search results

2011-11-21 Thread Felipe Carvalho
Thanks for the tip, Erick! Any pointer to any article showing how to use it? Thanks On Mon, Nov 21, 2011 at 10:49 AM, Erick Erickson wrote: > You might take a look at grouping, aka field collapsing. Faceting gives > you counts for various values in a field, but not a mixture of documents, > wher

Re: Filtering search results

2011-11-21 Thread Erick Erickson
You might take a look at grouping, aka field collapsing. Faceting gives you counts for various values in a field, but not a mixture of documents, whereas grouping will return you the top N members of each group which sounds like what you're after. Best Erick On Mon, Nov 21, 2011 at 5:04 AM, Ian L

Re: Filtering search results

2011-11-21 Thread Ian Lea
Running two queries is fine, as is looping through the results list. I don't think it is clear that one in necessarily "better" than the other. Faceting generally works by looping through results because you typically don't know in advance what values will be present. -- Ian, 2011/11/21 liugang

Re: lucene hits vs topdocs

2011-11-21 Thread Ian Lea
The general recommendation is to run the query again but you are right that it isn't always the correct answer in all circumstances. If you want to guard against the scenario you outline, do it the way you suggest, That's fine. In your fluid environment how do you cope when doc #11 is no longer

Re:RE: Compound file don't work with lucene 3.3 or 3.4

2011-11-21 Thread liugangc
thanks for uwe's answer. -- gang liu email: liuga...@gmail.com At 2011-11-20 17:44:06,"Uwe Schindler" wrote: >Hi, > >Lucene only uses CFS for very small segments, that don't exceed a specific >percentage of the index size. See the method LogMergePolicy.setNoCFSRatio >[http://lucene.apache.or

Re:Filtering search results

2011-11-21 Thread liugangc
hi,faceted search is like group by in SQL, and also need query two time to get two list in your situation. I think there isn't better way to solve your problem. -- gang liu email: liuga...@gmail.com At 2011-11-21 09:35:08,"Felipe Carvalho" wrote: >Hello, > I'm working on a people finder a