Assume we have a Lucene index over which several types of analyses are
performed.
Assume that the conclusions of some analysis require that new tokens be added
to existing documents in the index.
For example, a repeating pattern p (sequence of words) that appears in a large
part of the documents should be tagged in every document in its exact position.
Now it is required to execute proximity queries involving standard terms and
also the pattern p (e.g. find all documents in which the word "hello" is
adjacent to the pattern p).
Is there a way of achieving this without re-indexing all the documents where
the pattern p was found ?
In other words, is it possible to maintain a separate index that would keep
only patterns->docIds/positions, and then join between the two indexes ?
If not, is there a plan to support this in the future ?
Thanks,
Arnon.