Hi Patricio,

As of now, I don't think this is possible. However, we are slowly but surely working on similar problems. Please feel free to add your two cents to http://wiki.apache.org/jakarta-lucene/FlexibleIndexing as we are considering several new ideas related to making indexing more flexible. For now, I guess you need to store them elsewhere. Or, if you have thoughts on what to do you could come up w/ an implementation and submit a patch.

-Grant

Patricio Galeas wrote:
Hello,

I try to use Lucene to make some experiments with the distribution of words
in documents. Using the TermPositionVector, one can compute some statistics
about word positions (mean, standard deviation, etc.).
It is possible to store such statistical values directly in the Lucene
index?

For example, I want to save the statistical values of each Term (X1, X2, .,
Xn) by the index process, and get them later in query time.

Term    =>    <docNum, freq, <X1, X2, ... ,Xn> >

Should I modify the classes: IndexWriter.java and IndexReader.java?

Thank You

pgaleas



--

Grant Ingersoll Sr. Software Engineer Center for Natural Language Processing Syracuse University School of Information Studies 335 Hinds Hall Syracuse, NY 13244 http://www.cnlp.org Voice: 315-443-5484 Fax: 315-443-6886

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to