Uwe, I now see the problem with overlapping terms across segments...Thanks...
Erik, Good point...My usecase for this is , I am trying to build vectors for individual terms and documents and I need to know the size to handle memory constraints Thanks Kannan