Hi Patricio,
As of now, I don't think this is possible. However, we are slowly but
surely working on similar problems. Please feel free to add your two
cents to http://wiki.apache.org/jakarta-lucene/FlexibleIndexing as we
are considering several new ideas related to making indexing more
flexible.
For now, I guess you need to store them elsewhere. Or, if you have
thoughts on what to do you could come up w/ an implementation and submit
a patch.
-Grant
Patricio Galeas wrote:
Hello,
I try to use Lucene to make some experiments with the distribution of words
in documents. Using the TermPositionVector, one can compute some statistics
about word positions (mean, standard deviation, etc.).
It is possible to store such statistical values directly in the Lucene
index?
For example, I want to save the statistical values of each Term (X1, X2, .,
Xn) by the index process, and get them later in query time.
Term => <docNum, freq, <X1, X2, ... ,Xn> >
Should I modify the classes: IndexWriter.java and IndexReader.java?
Thank You
pgaleas
--
Grant Ingersoll
Sr. Software Engineer
Center for Natural Language Processing
Syracuse University
School of Information Studies
335 Hinds Hall
Syracuse, NY 13244
http://www.cnlp.org
Voice: 315-443-5484
Fax: 315-443-6886
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]