OK, let's back up a level. WHY are you building these vectors? Where I'm going with this is I wonder if this is an XY problem, see: http://people.apache.org/~hossman/#xyproblem
Best Erick On Thu, May 27, 2010 at 7:49 PM, kannan chandrasekaran <ckanna...@yahoo.com>wrote: > Uwe, > > I now see the problem with overlapping terms across segments...Thanks... > > Erik, > > Good point...My usecase for this is , > > I am trying to build vectors for individual terms and documents and I need > to know the size to handle memory constraints > > Thanks > Kannan > > >