Example of Field.TermVector.WITH_POSITIONS_OFFSETS usage?

Sean O'Connor Tue, 23 Aug 2005 14:41:52 -0700

Hello,

I am trying to work through term positions and how to get them froma collection of hits. Does setting TermVector.WITH_POSITIONS_OFFSETS totrue save the start/end position of the term in the source text file? (I_think_ it does).

If so, where would I start for trying to make that informationaccessible in a "result set"? I believe it would be extending a query, ascorer, a hit, and/or a weight object. I will be wanting to process ALLhits, so I think will need to implement a hitcollector.

As an example of what I want, if I were looking for the offsetposition of "brown" in a properly indexed field containing "the lazybrown fox", I would like to get:

start==10
end==15 (assuming my counting is right)

Based on Paul Elschot's previous response to a similar question Ihad (which I am still working on), I _think_ I need to extend somethinglike the ExactPhraseScorer. While debugging with my IDE (Eclipse) I cansee that the weight object in the scorer contains a reference to thequery. The query contains the fields:

   Vector positions (just has ints of term positions in phrase?)
   Vector terms (vector of Term, just field name and field contents?)

The weight also seems to have an array of TermPositions, which haveSegmentTermPositions. I thought this was what I wanted, but I don't seethe proper start/end fields, or anything which seems to be on the righttrack.


   Can anyone point me in the right direction?
Thanks,

Sean



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Example of Field.TermVector.WITH_POSITIONS_OFFSETS usage?

Reply via email to