Hi,
Was wondering what people's experience is with storing sentence (or
other) boundary information in Lucene. For instance, for phrase
queries, you may not want to match when two terms lie on either side of
a sentence boundary. I know for phrase queries the common approach is
to make the position increment larger than one, which solves that
immediate problem, but I have other uses for such information, too.
Should I just store some type of boundary marker at the appropriate
position and check to see if I have a boundary marker when doing my
processing? I know I need an Analyzer that can detect the boundaries,
for starters. What other issues have people run up against?
Thanks,
Grant
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]