RE: Field values ...

2008-03-20 Thread Dragon Fly
I was able to get a list of all the values for the Id field and another list of all the values for the Phone field using TermEnum (i.e. two independent lists). However, what I really wanted was: - Get the first Id - Get the Phone associated with the first Id. - Get the second Id - Get the Phon

backup RAMDirectory to file

2008-03-20 Thread roger dimitri
Hi, I am using the Directory class's copy method to periodically sync my RAM based index to a file based index that's supposed to serve as a hot backup. I want to know if this is the right way to maintain a periodic backup of my RAM based index and, if Yes, then is it reliable in a production

Chaining analyzers

2008-03-20 Thread Vinicius Carvalho
Hello there! Is it possible to chain analyzers? If I don't know what is the locale of my document, and considering that all of my docs will always either be in English/Spanish/Portuguese, is it possible to chain analyzers to remove stop-words from all those locales? I know that stem would be a much

Re: document scoring

2008-03-20 Thread Erick Erickson
Try TopDocs. You can use getMaxScore and divide Best Erick On Thu, Mar 20, 2008 at 11:12 AM, Cam Bazz <[EMAIL PROTECTED]> wrote: > Hello, > > I am querying an index by using custom boost factors for each field. > Usually > a query looks like: > > fieldA:"term1"^0.2 fieldB:"term2"^4 > > when

document scoring

2008-03-20 Thread Cam Bazz
Hello, I am querying an index by using custom boost factors for each field. Usually a query looks like: fieldA:"term1"^0.2 fieldB:"term2"^4 when I get scores from HitCollector, they are not necessarily between 0 and 1. How can I normalize these scores? Best. -C.A.

RE: Field values ...

2008-03-20 Thread Dragon Fly
I was able to get a list of all the values for the Id field and another list of all the values for the Phone field using TermEnum (i.e. two independent lists). However, what I really wanted was: - Get the first Id - Get the Phone associated with the first Id. - Get the second Id - Get the Phone

using hitcollector and scoring at the same time

2008-03-20 Thread Cam Bazz
Hello, I recently changed my query logic. Before, I was getting a hits object, and now I am using a bitSet with a hitcollector. The reason for using bitSet is document caching, and being able to count how many hits belong to which categories. Although my new logic works, I have noticed that now t

Re: Field values ...

2008-03-20 Thread Erick Erickson
See TermDocs/TermEnum. The trick is to start one of your enumerations with "" (I forget exactly which), and that'll iterate them all. Best Erick On Thu, Mar 20, 2008 at 9:55 AM, Dragon Fly <[EMAIL PROTECTED]> wrote: > What's the easiest way to extract the values of 2 fields from each > document

Field values ...

2008-03-20 Thread Dragon Fly
What's the easiest way to extract the values of 2 fields from each document in the index. For example, each document has 5 fields: Id Name Address Phone Preference I'd like to extract the values for the Id and Phone fields for each document in the index. Thank you.