Re: token positions

2009-11-17 Thread Michael McCandless
The character offset info is only stored if you enable Field.TermVector.WITH_OFFSETS or WITH_POSITIONS_OFFSETS on the field. Then, it can only be retrieved if you get the term vectors for that document, and locate the term & specific occurrence that you're interested in. This is likely quite a bi

token positions

2009-11-17 Thread Christopher Tignor
Hello, Hoping someone might clear up a question for me: When Tokenizing we provide the start and end character offsets for each token locating it within the source text. If I tokenize the text "word" and then search for the term "word" in the same field, how can I recover this character offset i