On Fri, Apr 26, 2013 at 12:48 PM, Shai Erera wrote:
> You can also try to use a different IntEncoder which compresses the values
> better. Try FourFlags and the like. Perhaps it will allow you to index more
> facets per document and it will be enough... though i should add "for the
> time being" b
On Fri, Apr 26, 2013 at 5:04 PM, Aleksey wrote:
> Thanks for the response, Mike. Yes, I've come upon your blog before, it's
> very helpful.
>
> I tried bigger batches, it seems the highest throughput I can get is
> roughly 250 docs a second. From your blog, you updated your index at about
> 1MB pe
Hi, real thanks for the previous reply.
For now i'm not able to make a separation between these useless words,
whether they contain words or digits.
I liked the idea of iterating with TermsEnum. Will it also delete the
occurances of these terms in the other file formats (termVectors etc.)?
As i un