On Jun 21, 2005, at 10:01 AM, Roxana Angheluta wrote:

Dear all,

I am using Lucene for indexing documents.

I would like to include phrases (of a certain maximum length given as a parameter) in the index. I know this is non-standard for e.g. searching, where a PhraseQuery can be built which makes use of the terms positions. However, I am not interested in searching, but rather in using the indexing terms for some statistics.

What would be an efficient way to do this? Is it possible to build phrases in a filter after tokenization?

Roxana- could you give us a concrete example of what you're wanting to do?

A TokenFilter could certainly be used to aggregate multiple terms into a single term that represents a phrase. This would happen during the analysis process, which occurs along with tokenization.

    Erik


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to