Our mails are crossing
Not that I know of. But why don't you just index (or maybe just store)
a separate field containing your offset information? Something like
title_offset with, say, a comma-separated pair denoting char position
and length that you then read in at search time and parse.
OK, I think I understand what's going on - it looks like I am able to set
the token for the full author name (Say, "Steve Suppe") with the correct
offsets, but the analyzer takes it once step further and tokenizes 'Steve'
and 'Suppe' which is giving me a lot more generated offsets and is
confus