Re: Big number of values for facets

2013-04-27 Thread Michael McCandless
On Fri, Apr 26, 2013 at 12:48 PM, Shai Erera wrote: > You can also try to use a different IntEncoder which compresses the values > better. Try FourFlags and the like. Perhaps it will allow you to index more > facets per document and it will be enough... though i should add "for the > time being" b

Re: Optimizing NRT search

2013-04-27 Thread Michael McCandless
On Fri, Apr 26, 2013 at 5:04 PM, Aleksey wrote: > Thanks for the response, Mike. Yes, I've come upon your blog before, it's > very helpful. > > I tried bigger batches, it seems the highest throughput I can get is > roughly 250 docs a second. From your blog, you updated your index at about > 1MB pe

Re: Too many unique terms

2013-04-27 Thread Manuel Le Normand
Hi, real thanks for the previous reply. For now i'm not able to make a separation between these useless words, whether they contain words or digits. I liked the idea of iterating with TermsEnum. Will it also delete the occurances of these terms in the other file formats (termVectors etc.)? As i un