Hi all, With the initial patch of of the Copycat APIs under review, I want to get started on the data API. I wrote up some notes with a rough proposal for what it should cover and a few requirements here: https://cwiki.apache.org/confluence/display/KAFKA/Copycat+Data+API
Since they are very related, this also talks about serializers and includes some sketches of how I think different serialization formats could be implemented since we want Copycat to work with a variety of pluggable serialization formats. I think it's probably easy to bikeshed on the set of supported primitive types or the semantics of optional fields & default values. But I think the most important output from this discussion would be agreement that the schemas described will be sufficiently rich for connectors, can handle important cases like schema updates, but can still sufficiently abstract away the underlying serialization libraries such that we aren't effectively tied to one format despite providing a pluggable interface. -- Thanks, Ewen