[ https://issues.apache.org/jira/browse/KAFKA-2367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14694407#comment-14694407 ]
Gwen Shapira commented on KAFKA-2367: ------------------------------------- I'm a strong +1 on using Avro instead of maintaining our own clone of selected parts. Main reason is that in Sqoop2 we maintained a clone of selected parts and we spent way too much time fixing issues in that layer and maintaining it. Avro is well... Avro. Its been used and tested quite a bit. I agree that Avro is not perfect, however, we can work with the Avro community to resolve particularly painful issues. > Add Copycat runtime data API > ---------------------------- > > Key: KAFKA-2367 > URL: https://issues.apache.org/jira/browse/KAFKA-2367 > Project: Kafka > Issue Type: Sub-task > Components: copycat > Reporter: Ewen Cheslack-Postava > Assignee: Ewen Cheslack-Postava > Fix For: 0.8.3 > > > Design the API used for runtime data in Copycat. This API is used to > construct schemas and records that Copycat processes. This needs to be a > fairly general data model (think Avro, JSON, Protobufs, Thrift) in order to > support complex, varied data types that may be input from/output to many data > systems. > This should issue should also address the serialization interfaces used > within Copycat, which translate the runtime data into serialized byte[] form. > It is important that these be considered together because the data format can > be used in multiple ways (records, partition IDs, partition offsets), so it > and the corresponding serializers must be sufficient for all these use cases. -- This message was sent by Atlassian JIRA (v6.3.4#6332)