So basically prepend the information in the header of each message? Wouldn't 
this require changes in all readers/writers?

On Aug 20, 2013, at 9:19 AM, Guozhang Wang <wangg...@gmail.com> wrote:

> Hello Mark,
> 
> The Kafka message format starts with a magic byte indicating what kind of
> serialization is used for this message. And if this byte indicates Avro,
> you can layout your message as starting with the schemaId and then followed
> by message payload. Upon consumption, you can first get the schemaId, query
> Avro for the schema given the id, and then use schema to deserialize the
> message.
> 
> Guozhang
> 
> 
> On Tue, Aug 20, 2013 at 7:08 AM, Mark <static.void....@gmail.com> wrote:
> 
>> Can someone break down how message serialization would work with Avro?
>> I've read instead of adding a schema to every single event it would be wise
>> to add some sort of fingerprint with each message to identify which schema
>> it should used. What I'm having trouble understanding is, how do we read
>> the fingerprint without a schema? Don't we need the schema to deserialize?
>> Same question goes for working with Hadoop.. how does the input format
>> know which schema to use?
>> 
>> Thanks
> 
> 
> 
> 
> -- 
> -- Guozhang

Reply via email to