Sentence boundary storage

Grant Ingersoll Fri, 28 Oct 2005 14:46:33 -0700

Hi,

Was wondering what people's experience is with storing sentence (orother) boundary information in Lucene. For instance, for phrasequeries, you may not want to match when two terms lie on either side ofa sentence boundary. I know for phrase queries the common approach isto make the position increment larger than one, which solves thatimmediate problem, but I have other uses for such information, too.Should I just store some type of boundary marker at the appropriateposition and check to see if I have a boundary marker when doing myprocessing? I know I need an Analyzer that can detect the boundaries,for starters. What other issues have people run up against?


Thanks,
Grant

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Sentence boundary storage

Reply via email to