Re: controlled indexing with Lucene

2008-11-29 Thread Erick Erickson
I'm not entirely clear what you're trying to accomplish, but there's a bunch of options. You could just index the phrases as normal text, possibly with positionincrementgaps (assuming more than one phrase is possible in a field in a document). Then you could use SpanQuerys to accomplish the "simpl

controlled indexing with Lucene

2008-11-28 Thread Amir Hossein Jadidinejad
Hi, I'm going to index some documents only with known phrases. Let me describe: Suppose that I have a controlled vocabulary of phrases (A list of some candidate phrases). I intend to index ONLY these phrases within my documents and have a retrieval model (for example simple VS-TF.IDF) that each