Adrien,

Thank you very much, and in particular:

On Mar 27, 2013, at 4:57 PM, Adrien Grand <[email protected]> wrote:

>  A codec describes
> the formats to use for every index file: postings format, stored
> fields format, term vectors format, norms format, etc. whereas a
> postings format only describes the format of the terms dictionary and
> postings lists.

That 1 sentence clarifies things immensely.  This was the level of thing that I 
felt fuzzy on, and once reading it, makes perfect sense.  (I'm sure it's 
explained somewhere and I just managed to miss it)

So there's "SimpleText" the posting format, and also "SimpleText" the codec, 
and they are related, and a deeper coupling than other codec vs. posting combos.

And this was also helpful:
> Codecs can't be chained. Some postings formats can: for example our
> BloomFilter postings format can wrap any other postings format.

And looks like this is the direction I want:
> ...changing the codec is a little harder: you need to define a CodecFactory 
> and
> configure it in your solrconfig.xml (see 
> http://wiki.apache.org/solr/SolrConfigXml#codecFactory).

Reply via email to