TopDocsCollector and sorting

2011-03-15 Thread Ryan Aylward
Does the TopDocsCollector sort the docs it returns? The java doc for... public final TopDocs topDocs(int start) states Returns the documents in the range [start .. pq.size()) that were collected by this collector. Note that if start >= pq.size(), an empty TopDocs is returned. This method i

RE: Using different field when overriding computeNorm

2011-02-03 Thread Ryan Aylward
:51 PM, Ryan Aylward wrote: > I have had to do similar things to other methods of Similarity. In my > example, I wanted to have different behavior for the tf() method for each > field. The tf method does not include a field parameter as an input to it. > The only solution I could

RE: Using different field when overriding computeNorm

2011-02-01 Thread Ryan Aylward
I have had to do similar things to other methods of Similarity. In my example, I wanted to have different behavior for the tf() method for each field. The tf method does not include a field parameter as an input to it. The only solution I could come up with was to add a thread local to set the f

RE: [POLL] Where do you get Lucene/Solr from? Maven? ASF Mirrors?

2011-01-18 Thread Ryan Aylward
[] ASF Mirrors (linked in our release announcements or via the Lucene website) [X] Maven repository (whether you use Maven, Ant+Ivy, Buildr, etc.) [] I/we build them from source via an SVN/Git checkout. [] Other (someone in your company mirrors them internally or via a downstream project) ---

RE: Creating an index with multiple values for a single field

2011-01-10 Thread Ryan Aylward
me field, or turning off tf or idf. -- Anshum Gupta http://ai-cafe.blogspot.com On Sat, Jan 8, 2011 at 6:03 AM, Ryan Aylward wrote: > Our business has a need to allow for multiple values for a single field. > For example, we have an index of employers where an employer often has > mul

Creating an index with multiple values for a single field

2011-01-07 Thread Ryan Aylward
tually for name2 or name3, etc. But this doesn't feel like a clean solution. Any suggestions on how to deal with this? Any ideas would be appreciated. Ryan Aylward

RE: is OpenBitSet / SortedVIntList compressed bit map index?

2011-01-07 Thread Ryan Aylward
I don't recall how we decided to use it, but we are using http://code.google.com/p/compressedbitset/ and it seems to be pretty efficient in terms of memory. -Original Message- From: Federico Fissore [mailto:feder...@fissore.org] Sent: Friday, January 07, 2011 3:12 PM To: java-user@lucen

RE: Multivalued scoring

2010-12-15 Thread Ryan Aylward
Would you be able to create a single index with all photos? Your searches would go against the photo index. At that point, you would have the most relevant photos regardless of album. You could then introduce a sort to your Lucene search to ensure all photos from a given album are grouped togeth

RE: Help Overriding behavior in BooleanQuery scorer

2010-12-07 Thread Ryan Aylward
kage protected thing, but you may not need to go there. This sounds a lot like DisjunctionMaxQuery, have you looked at it? http://lucene.apache.org/java/3_0_2/api/all/index.html <http://lucene.apache.org/java/3_0_2/api/all/index.html>Best Erick On Tue, Dec 7, 2010 at 1:09 AM, Ryan Aylward

Help Overriding behavior in BooleanQuery scorer

2010-12-06 Thread Ryan Aylward
, BooleanScorer, BooleanScorer2, DisjunctionSumScorer, ConjunctionScorer, etc). Does anyone know why many of those classes are package protected and final? Any suggestions for how to customize this part of the scoring algorithm without having to copy all of these classes? Thanks, Ryan Aylward