The character offset info is only stored if you enable
Field.TermVector.WITH_OFFSETS or WITH_POSITIONS_OFFSETS on the field.
Then, it can only be retrieved if you get the term vectors for that
document, and locate the term & specific occurrence that you're
interested in.
This is likely quite a bi
Hello,
Hoping someone might clear up a question for me:
When Tokenizing we provide the start and end character offsets for each
token locating it within the source text.
If I tokenize the text "word" and then search for the term "word" in the
same field, how can I recover this character offset i