On Thu, Mar 1, 2012 at 6:15 PM, Mike O'Leary <tmole...@uw.edu> wrote:
> In the Javadoc page for the Similarity class, it says,
>
> "Lucene combines Boolean model (BM) of Information Retrieval with Vector 
> Space Model (VSM) of Information Retrieval - documents "approved" by BM are 
> scored by VSM."
>
> Is the Vector Space Model that is referred to here different than the term 
> vectors that can optionally be stored in index fields?

Yes, it refers to http://en.wikipedia.org/wiki/Vector_space_model,
which uses statistics stored in the index. Term vectors are not used
here.

Instead term vectors are really just like storing a separate
individual inverted index for each document. For example, they are
used by MoreLikeThis to retrieve the terms and frequencies from just
that one document.

-- 
lucidimagination.com

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Reply via email to