[jira] [Commented] (KAFKA-2367) Add Copycat runtime data API

Gwen Shapira (JIRA) Wed, 12 Aug 2015 16:29:08 -0700

    [ 
https://issues.apache.org/jira/browse/KAFKA-2367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14694407#comment-14694407
 ]


Gwen Shapira commented on KAFKA-2367:
-------------------------------------

I'm a strong +1 on using Avro instead of maintaining our own clone of selected 
parts. Main reason is that in Sqoop2 we maintained a clone of selected parts 
and we spent way too much time fixing issues in that layer and maintaining it. 

Avro is well... Avro. Its been used and tested quite a bit.

I agree that Avro is not perfect, however, we can work with the Avro community 
to resolve particularly painful issues.

> Add Copycat runtime data API
> ----------------------------
>
>                 Key: KAFKA-2367
>                 URL: https://issues.apache.org/jira/browse/KAFKA-2367
>             Project: Kafka
>          Issue Type: Sub-task
>          Components: copycat
>            Reporter: Ewen Cheslack-Postava
>            Assignee: Ewen Cheslack-Postava
>             Fix For: 0.8.3
>
>
> Design the API used for runtime data in Copycat. This API is used to 
> construct schemas and records that Copycat processes. This needs to be a 
> fairly general data model (think Avro, JSON, Protobufs, Thrift) in order to 
> support complex, varied data types that may be input from/output to many data 
> systems.
> This should issue should also address the serialization interfaces used 
> within Copycat, which translate the runtime data into serialized byte[] form. 
> It is important that these be considered together because the data format can 
> be used in multiple ways (records, partition IDs, partition offsets), so it 
> and the corresponding serializers must be sufficient for all these use cases.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (KAFKA-2367) Add Copycat runtime data API

Reply via email to