Ranking docs with all terms higher

2011-05-18 Thread Christopher Condit
Let's say I have the query (nacho OR foo OR bar) and some documents (single field with norms off) doc a: nacho nacho nacho nacho doc b: foo bar bar doc c: nacho foo bar I'm interested in all of these documents but I would like c to score the highest since it contains all of the search terms, b to

Re: Please help me with a basic question...

2011-05-18 Thread Paul Libbrecht
Richard, in SOLR at least there's an analyzer that avoids duplicates. I think that would solve it. There's also somewhere the option to ignore IDF (in similarity? in solrconfig?). paul Le 18 mai 2011 à 21:30, Rich Heimann a écrit : > Hello all, > > This is my first time on the list and my fir

Please help me with a basic question...

2011-05-18 Thread Rich Heimann
Hello all, This is my first time on the list and my first question...forgive me it this has been hacked out in the past. We have set up Lucene/Solr and are getting somewhat spurious results. It appears to be a result of heterogeneous document sizes. In other words, the top results are sometimes (