On Jun 21, 2005, at 10:01 AM, Roxana Angheluta wrote:
Dear all,
I am using Lucene for indexing documents.
I would like to include phrases (of a certain maximum length given
as a parameter) in the index. I know this is non-standard for e.g.
searching, where a PhraseQuery can be built which makes use of the
terms positions. However, I am not interested in searching, but
rather in using the indexing terms for some statistics.
What would be an efficient way to do this? Is it possible to build
phrases in a filter after tokenization?
Roxana- could you give us a concrete example of what you're wanting
to do?
A TokenFilter could certainly be used to aggregate multiple terms
into a single term that represents a phrase. This would happen
during the analysis process, which occurs along with tokenization.
Erik
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]