Thanks a lot. I think TermPositionsVector will solve my problem.
Although it seems to be a little inperformant
Concerning the term representation: our data is way more complex then
just phrasal annotation, it was just an example, because I am not
allowed to talk about our internal organisation. I
Hi,
we also index linguistic data, but (someone correct me if I'm wrong) you
have to deal with what the lucene store is offering.
You can store
usable on the search side :
- a term (TermAttribute)
- the position of the term (PositionIncrementAttribute)
- an arbitrary payload (PayloadAttrib