subject:"Compression algorithm for posting lists"

RE: Compression algorithm for posting lists

2016-04-03 Thread Uwe Schindler

Hi, > Hey Adrien, > We are thinking of implementing XZ compression instead of > LZ4 for stored fields. Will it serve our purpose of saving the disk space > while trading off the speed? We were eager to know that why XZ > compression > is not given as an option for compression. XZ

Re: Compression algorithm for posting lists

2016-04-03 Thread Vishwas Jain

Hey Adrien, We are thinking of implementing XZ compression instead of LZ4 for stored fields. Will it serve our purpose of saving the disk space while trading off the speed? We were eager to know that why XZ compression is not given as an option for compression. Thanks Are posting

Re: Compression algorithm for posting lists

2016-03-31 Thread Adrien Grand

Are posting lists the biggest disk user of your index? Usually it is rather stored fields or term vectors. You can tell Lucene to compress stored fields more aggressively by passing BEST_COMPRESSION to the Lucene54Codec constructor. Also maybe there are some features of the index that you do not ne

Re: Compression algorithm for posting lists

2016-03-31 Thread Vishwas Jain

Hi Adrien, Thanks for the help, actually we are trying to compress the actual posting lists. Our main aim here is to save the disk space as much as possible occupied by the index created. Is only compressing the posting lists will suffice the problem or we have to explore more opti

Re: Compression algorithm for posting lists

2016-03-29 Thread Adrien Grand

BlockTreeTermsWriter.TermsWriter.finish writes a FST that serves as an index of the terms dictionary. It will be used at search time when seeking terms in the terms dictionary. Le lun. 28 mars 2016 à 14:02, Vishwas Jain a écrit : > Thanks for the reply and information. > I have som

Re: Compression algorithm for posting lists

2016-03-28 Thread Vishwas Jain

Thanks for the reply and information. I have some doubts regarding the implemenation of lucene54 codec when writing the posting lists using the lucene50 postinglistwriter while going through the code. What exactly does the finish() method in the TermsWriter class of the BlockTreeTerms

Re: Compression algorithm for posting lists

2016-03-28 Thread Vishwas Jain

Thanks for the reply and information. I have some doubts regarding the implemenation of lucene54 codec when writing the posting lists using the lucene50 postinglistwriter while going through the code. What exactly does the finish() method in the TermsWriter class of the BlockTreeTerms

Re: Compression algorithm for posting lists

2016-03-28 Thread Greg Bowyer

The posting list is compressed using a specialised technique aimed at pure numbers. Currently the codec uses a variant of Patched Frame of Reference coding to perform this compression. A good survey of such techniques can be found in the good IR books (https://mitpress.mit.edu/books/information-r

Compression algorithm for posting lists

2016-03-28 Thread Vishwas Jain

Hello , We are trying to implement better compression techniques in lucene54 codec of Apache Lucene. Currently there is no such compression for posting lists in lucene54 codec but LZ4 compression technique is used for stored fields. Does anyone know why there is no compression technique

RE: Compression algorithm for posting lists

Re: Compression algorithm for posting lists

Re: Compression algorithm for posting lists

Re: Compression algorithm for posting lists

Re: Compression algorithm for posting lists

Re: Compression algorithm for posting lists

Re: Compression algorithm for posting lists

Re: Compression algorithm for posting lists

Compression algorithm for posting lists

9 matches

Site Navigation

Mail list logo

Footer information