samza-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Selina Tech <>
Subject Avro vs Protocol buffer for Samza output
Date Wed, 18 Nov 2015 23:43:14 GMT
Dear All:

      I need to generate some data by Samza to Kafka and then write to
Parquet formate file.  I was asked why I choose Avro type as my Samza
output to Kafka instead of Protocol Buffer. Since currently our data on
Kafka are all Protocol buffer.
      I explained for Avro encoded message -- The encoded size is smaller,
no extra code compile, implementation easier.  fast to
serialize/deserialize and support a lot language.  However some people
believe when encoded the Avro message take as much space as Protocol
buffer, but with schema, the size could be much bigger.

      I am wondering if there are any other advantages make you choose Avro
as your message type at Kafka?


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message