Re: Term Limit?

2009-04-04 Thread deminix
ll be our authoritative store and the data isn't within our control. I'll see what I can do about trying to break it during perf testing ;) On Sat, Apr 4, 2009 at 9:06 AM, Michael McCandless < luc...@mikemccandless.com> wrote: > On Sat, Apr 4, 2009 at 11:57 AM, deminix wrot

Re: Term Limit?

2009-04-04 Thread deminix
Yea. That is all that matters anyway right, is the limit at the segment level? On Sat, Apr 4, 2009 at 8:44 AM, Michael McCandless < luc...@mikemccandless.com> wrote: > On Sat, Apr 4, 2009 at 10:25 AM, deminix wrote: > > > AFAIK there isn't an api that returns the

Re: Term Limit?

2009-04-04 Thread deminix
My crude regex'ing of the code has me thinking it is only term vectors that are limited to 32 bits, since they allocate arrays. Otherwise it seems good. Does that sound right? On Sat, Apr 4, 2009 at 7:25 AM, deminix wrote: > Thanks for the clarification. > > I'm partit

Re: Term Limit?

2009-04-04 Thread deminix
> >> > >> But in a least a couple places internally, Lucene uses a java int to > >> hold the term number, which is actually a limit of 2,147,483,648 > >> terms. I'll update fileformats.html for 2.9. > > > > > >> > >> Mike > &

Term Limit?

2009-04-03 Thread deminix
http://lucene.apache.org/java/2_4_1/fileformats.html The file format page at the bottom cites that there is a 32 bit limit to term numbers. I fail to see where in the file formats documentation that is actually true. Is the bottom of the page simply out of date? I'm also wondering whether the c