Dear All: I need to generate some data by Samza to Kafka and then write to Parquet formate file. I was asked why I choose Avro type as my Samza output to Kafka instead of Protocol Buffer. Since currently our data on Kafka are all Protocol buffer. I explained for Avro encoded message -- The encoded size is smaller, no extra code compile, implementation easier. fast to serialize/deserialize and support a lot language. However some people believe when encoded the Avro message take as much space as Protocol buffer, but with schema, the size could be much bigger.
I am wondering if there are any other advantages make you choose Avro as your message type at Kafka? Sincerely, Selina