Pluggable compression allowing for alternatives to gzip for text compression for storing. Specifically I am interested in bzip2[1] as implemented in Apache Commons Compress[2]. While bzip2 compression is considerable slower than gzip (although decompression is not too much slower than gzip) it compresses much better than gzip (especially text).
Having the choice would be helpful, and for Lucene usage for non-text indexing, content specific compression algorithms may outperform the default gzip. And in these days of multi-core / multi-threading, perhaps we could convince the Apache Commons Compress team to implement a parallel Java version of bzip2 compression (theirs is single threaded), like pbzip2[3]. -glen [1]http://en.wikipedia.org/wiki/Bzip2 [2]http://commons.apache.org/compress/ [3]http://compression.ca/pbzip2/ On 24 February 2010 08:42, Grant Ingersoll <gsing...@apache.org> wrote: > What would it be? > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > For additional commands, e-mail: java-user-h...@lucene.apache.org > > -- - --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org