On Jun 21, 2005, at 12:46 PM, Sebastian Marius Kirsch wrote:

On Tue, Jun 21, 2005 at 04:01:41PM +0200, Roxana Angheluta wrote:

I would like to include phrases (of a certain maximum length given as a parameter) in the index. I know this is non-standard for e.g. searching, where a PhraseQuery can be built which makes use of the terms positions.
However, I am not interested in searching, but rather in using the
indexing terms for some statistics.


I built a filter for exactly this purpose (statistics over word
combinations); I will send it to you directly. It uses some containers
from org.apache.commons.collections, so I guess it's not suitable yet
for inclusion in the lucene distribution.

A contribution with dependencies is fine, especially Apache ones. We can put this code in the contrib area if you'd like to contribute it. If so, please create a Bugzilla issue and attach the sources.

    Erik



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to