Hi everybody,

I need to query for documents not only for search terms but also for numeric values (or other general types). Let me try to explain with a hypothetical example.

Assuming there is a value for the number words in each document (or the number of person names, or whatever), I would want to formulate a query like "Give me documents containing 'jack johnson' AND with token_count > 250".

I've been working with Lucene before and the keyword part is easy, but what would be a good solution to query for numbers etc.?

One first idea I had was storing the numbers (which are basically a HashMap<String,Double>) in the index in some way or the other. But it is not at all obvious for me how to query them then.

Another thing I could think of would be using a separate database of any type, but then how to bring those two together in a way that makes sense?

Any pointers to useful resources and any types of hints are welcome! :-)

Best,

  Niels


--
Niels Ott
Computational Linguist (B.A.)
http://www.drni.de/niels/

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to