Il Lun 1 Mar 2021, 22:20 Sijie Guo <guosi...@gmail.com> ha scritto: > Enrico - I have just reviewed the PR. I don't think you addressed your > comments. I still have the concern how this PR is implemented. I'd prefer > to keep the Kafka deserializer as simple as possible. We should keep the > schema cache and the logic to fetch confluent schema in the source > connector. >
Okay. I will update the patch accordingly. Thanks Enrico > - Sijie > > On Mon, Mar 1, 2021 at 1:04 PM Sijie Guo <guosi...@gmail.com> wrote: > > > Apologized for the delay! Reviewing it now. > > > > - Sijie > > > > On Sun, Feb 28, 2021 at 11:29 PM Enrico Olivelli <eolive...@gmail.com> > > wrote: > > > >> Hello, > >> Please bear with me, I really want this work to go forward :-) > >> > >> @Sijie, I know that you are super busy, so I would like to not put > >> pressure on you, and I thank you very much for your useful comments on > >> the PR. > >> > >> Our Pulsar community is big and it is still growing > >> IMHO it would be a very good thing that others in the community take a > >> look as well. > >> > >> The first patch is not so big work and it is hard to review. > >> As a general approach I prefer to send little patches, this way it is > >> easy to understand what's going on. > >> > >> Code is not written in the stone, and we can always make improvements. > >> My plan is to continue working on the Kafka connector and send more > >> patches until I have covered all of the use cases of my interest > >> (basically around enterprise features, like Schema, Multi topic...) > >> > >> I would like to work directly here within the project by sending pull > >> requests to the ASF repo and I am not willing to not create my own > >> Kafka Connector fork. > >> I believe this is the best approach for the community, > >> but I need some support from the group. > >> > >> Best regards > >> Enrico > >> > >> > >> > >> Il giorno gio 25 feb 2021 alle ore 08:56 Sijie Guo > >> <guosi...@gmail.com> ha scritto: > >> > > >> > Apologized for the delay! Will review it again today or tomorrow. > >> > > >> > - Sijie > >> > > >> > On Wed, Feb 24, 2021 at 3:49 AM Enrico Olivelli <eolive...@gmail.com> > >> wrote: > >> > > >> > > Hello community, > >> > > It looks like only Sijie started to review this work. > >> > > https://github.com/apache/pulsar/pull/9448 > >> > > > >> > > I wonder if others that are interested in Kafka compatibility may > >> have > >> > > time to check it out > >> > > > >> > > As said, this is only the first part of a series of implementations > >> we want > >> > > to do about this Connector > >> > > > >> > > Enrico > >> > > > >> > > Il giorno mar 16 feb 2021 alle ore 05:31 Sijie Guo < > >> guosi...@gmail.com> ha > >> > > scritto: > >> > > > >> > > > Thanks, I will review the PR. > >> > > > > >> > > > - Sijie > >> > > > > >> > > > On Mon, Feb 15, 2021 at 2:47 AM Enrico Olivelli < > >> eolive...@gmail.com> > >> > > > wrote: > >> > > > > >> > > > > Sijie, > >> > > > > > >> > > > > I managed to implement Avro support In KafkaBytesSource > following > >> your > >> > > > > suggestions. Thanks. > >> > > > > > >> > > > > I would like to commit this initial patch and then add support > >> for all > >> > > of > >> > > > > the primitive Schemas as you did in (1) and for JSON. > >> > > > > If you prefer I can continue to enhance this patch. > >> > > > > > >> > > > > Enrico > >> > > > > > >> > > > > (1) > >> > > > > > >> > > > > > >> > > > > >> > > > >> > https://github.com/streamnative/pulsar-io-kafka/blob/master/src/main/java/io/streamnative/connectors/kafka/KafkaSource.java#L338 > >> > > > > > >> > > > > Il giorno lun 15 feb 2021 alle ore 06:01 Sijie Guo < > >> guosi...@gmail.com > >> > > > > >> > > > ha > >> > > > > scritto: > >> > > > > > >> > > > > > Hi Enrico, > >> > > > > > > >> > > > > > Thank you for working on this! > >> > > > > > > >> > > > > > But as I mentioned in the pull request, we should avoid using > a > >> > > > > > one-connector-per-schema model. That model probably works with > >> other > >> > > > > > connectors that have a very limited number of schemas. If you > >> are > >> > > going > >> > > > > to > >> > > > > > implement a schema-aware Kafka connector, that model is > >> impossible to > >> > > > > > maintain, because it will introduce N * N connectors where N > is > >> the > >> > > > > number > >> > > > > > of supported schemas. > >> > > > > > > >> > > > > > We should maintain one "bytes" connector and transfer the > Kafka > >> > > schema > >> > > > to > >> > > > > > the Pulsar schema. I have written an enhanced Kafka connector > >> > > > > > <https://github.com/streamnative/pulsar-io-kafka> two years > >> ago. > >> > > > > > > >> > > > > > You just need to maintain one connector: > >> > > > > > > >> > > > > > > >> > > > > > >> > > > > >> > > > >> > https://github.com/streamnative/pulsar-io-kafka/blob/master/src/main/java/io/streamnative/connectors/kafka/KafkaSource.java#L94 > >> > > > > > Then convert Kafka SerDe to Pulsar schema: > >> > > > > > > >> > > > > > > >> > > > > > >> > > > > >> > > > >> > https://github.com/streamnative/pulsar-io-kafka/blob/master/src/main/java/io/streamnative/connectors/kafka/KafkaSource.java#L338 > >> > > > > > > >> > > > > > I am happy to submit a PR to merge those changes back. > >> > > > > > > >> > > > > > - Sijie > >> > > > > > > >> > > > > > On Thu, Feb 11, 2021 at 11:48 PM Enrico Olivelli < > >> > > eolive...@gmail.com> > >> > > > > > wrote: > >> > > > > > > >> > > > > > > Hello everyone, > >> > > > > > > here in our Pulsar repository we have a simple Kafka > >> Connector for > >> > > > > Pulsar > >> > > > > > > IO composed by a Sink and a Source. > >> > > > > > > > https://github.com/apache/pulsar/tree/master/pulsar-io/kafka > >> > > > > > > > >> > > > > > > I have started to work on a set of enhancements to this > >> connector > >> > > in > >> > > > > > order > >> > > > > > > to make it more powerful and to better fit the needs of > >> enterprise > >> > > > > users. > >> > > > > > > > >> > > > > > > The first patch I have submitted is about supporting Avro > >> encoded > >> > > > > > messages > >> > > > > > > + Confluent Schema Registry in the KafkaSource > >> > > > > > > https://github.com/apache/pulsar/pull/9448 > >> > > > > > > > >> > > > > > > The patch is only the first one of a bigger work that we > have > >> to do > >> > > > in > >> > > > > > > order to have a fully usable Connector for non-trivial use > >> cases. > >> > > > > > > > >> > > > > > > I will be happy to follow up with other patches and > >> especially to > >> > > > draw > >> > > > > a > >> > > > > > > little roadmap about the features that we want to implement > >> and > >> > > > provide > >> > > > > > to > >> > > > > > > the community. > >> > > > > > > > >> > > > > > > Please take a look to the patch and share your thoughts > >> > > > > > > > >> > > > > > > Regards > >> > > > > > > Enrico > >> > > > > > > > >> > > > > > > >> > > > > > >> > > > > >> > > > >> > > >