Thanks, I will review the PR. - Sijie
On Mon, Feb 15, 2021 at 2:47 AM Enrico Olivelli <eolive...@gmail.com> wrote: > Sijie, > > I managed to implement Avro support In KafkaBytesSource following your > suggestions. Thanks. > > I would like to commit this initial patch and then add support for all of > the primitive Schemas as you did in (1) and for JSON. > If you prefer I can continue to enhance this patch. > > Enrico > > (1) > > https://github.com/streamnative/pulsar-io-kafka/blob/master/src/main/java/io/streamnative/connectors/kafka/KafkaSource.java#L338 > > Il giorno lun 15 feb 2021 alle ore 06:01 Sijie Guo <guosi...@gmail.com> ha > scritto: > > > Hi Enrico, > > > > Thank you for working on this! > > > > But as I mentioned in the pull request, we should avoid using a > > one-connector-per-schema model. That model probably works with other > > connectors that have a very limited number of schemas. If you are going > to > > implement a schema-aware Kafka connector, that model is impossible to > > maintain, because it will introduce N * N connectors where N is the > number > > of supported schemas. > > > > We should maintain one "bytes" connector and transfer the Kafka schema to > > the Pulsar schema. I have written an enhanced Kafka connector > > <https://github.com/streamnative/pulsar-io-kafka> two years ago. > > > > You just need to maintain one connector: > > > > > https://github.com/streamnative/pulsar-io-kafka/blob/master/src/main/java/io/streamnative/connectors/kafka/KafkaSource.java#L94 > > Then convert Kafka SerDe to Pulsar schema: > > > > > https://github.com/streamnative/pulsar-io-kafka/blob/master/src/main/java/io/streamnative/connectors/kafka/KafkaSource.java#L338 > > > > I am happy to submit a PR to merge those changes back. > > > > - Sijie > > > > On Thu, Feb 11, 2021 at 11:48 PM Enrico Olivelli <eolive...@gmail.com> > > wrote: > > > > > Hello everyone, > > > here in our Pulsar repository we have a simple Kafka Connector for > Pulsar > > > IO composed by a Sink and a Source. > > > https://github.com/apache/pulsar/tree/master/pulsar-io/kafka > > > > > > I have started to work on a set of enhancements to this connector in > > order > > > to make it more powerful and to better fit the needs of enterprise > users. > > > > > > The first patch I have submitted is about supporting Avro encoded > > messages > > > + Confluent Schema Registry in the KafkaSource > > > https://github.com/apache/pulsar/pull/9448 > > > > > > The patch is only the first one of a bigger work that we have to do in > > > order to have a fully usable Connector for non-trivial use cases. > > > > > > I will be happy to follow up with other patches and especially to draw > a > > > little roadmap about the features that we want to implement and provide > > to > > > the community. > > > > > > Please take a look to the patch and share your thoughts > > > > > > Regards > > > Enrico > > > > > >