Neha, thanks for the response. 

So the only point of the magic byte is to indicate that the rest of the message 
is Avro encoded? I noticed that in Camus a 4 byte int id of the schema is 
written instead of the 16 byte SHA. Is this the new preferred way? Which is 
compatible with https://issues.apache.org/jira/browse/AVRO-1124?

Thanks again

On Aug 21, 2013, at 8:38 PM, Neha Narkhede <neha.narkh...@gmail.com> wrote:

> We define the LinkedIn Kafka message to have a magic byte (indicating Avro
> serialization), MD5 header followed by the payload. The Hadoop consumer
> reads the MD5, looks up the schema in the repository and deserializes the
> message.
> 
> Thanks,
> Neha
> 
> 
> On Wed, Aug 21, 2013 at 8:15 PM, Mark <static.void....@gmail.com> wrote:
> 
>> Does LinkedIn include the SHA of the schema into the header of each Avro
>> message they write or do they wrap the avro message and prepend the SHA?
>> 
>> In either case, how does the Hadoop consumer know what schema to read?

Reply via email to