Re: Offset Questions (Follow-Up)

2008-03-07 Thread Erick Erickson
Our mails are crossing Not that I know of. But why don't you just index (or maybe just store) a separate field containing your offset information? Something like title_offset with, say, a comma-separated pair denoting char position and length that you then read in at search time and parse.

Re: Offset Questions (Follow-Up)

2008-03-07 Thread Steve Suppe
OK, I think I understand what's going on - it looks like I am able to set the token for the full author name (Say, "Steve Suppe") with the correct offsets, but the analyzer takes it once step further and tokenizes 'Steve' and 'Suppe' which is giving me a lot more generated offsets and is confus