Thanks, Mike and Jack! Those are really good options. But, just to clarify, is there a way to get, let's say, a vector of position increments directly from the index, without re-parsing document contents?
-- Best Regards, Igor 23.05.2013, 16:13, "Jack Krupansky" <j...@basetechnology.com>: > It might be nice to inquire as to the largest position for a field in a > document. Is that information kept anywhere? Not that I know of, although I > suppose it can be calculated at runtime by running though all the terms of > the field. Then he could just divide by 1000. > > -- Jack Krupansky > > -----Original Message----- > From: Michael McCandless > Sent: Thursday, May 23, 2013 6:28 AM > To: Lucene Users > Subject: Re: Getting position increments directly from the the index > > Do you actually index the sentence boundary as a token? If so, you > could just get the totalTermFreq of that token? > > Mike McCandless > > http://blog.mikemccandless.com > > On Wed, May 22, 2013 at 10:11 AM, Igor Shalyminov > <ishalymi...@yandex-team.ru> wrote: > >> Hello! >> >> I'm storing sentence bounds in the index as position increments of 1000. >> I want to get the total number of sentences in the index, i. e. the number >> of "1000" increment values. >> Can I do that some other way rather than just loading each document and >> extracting position increments with a custom Analyzer? >> >> -- >> Best Regards, >> Igor Shalyminov >> >> --------------------------------------------------------------------- >> To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org >> For additional commands, e-mail: java-user-h...@lucene.apache.org > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > For additional commands, e-mail: java-user-h...@lucene.apache.org > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > For additional commands, e-mail: java-user-h...@lucene.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org