Hi Yijie,

I also agree with Jark on separating the Catalog part into another FLIP.

With FLIP-27[1] also in the air, it is also probably great to split and
unblock the sink implementation contribution.
I would suggest either putting in a detail implementation plan section in
the doc, or (maybe too much separation?) splitting them into different
FLIPs. What do you guys think?

--
Rong

[1]
https://cwiki.apache.org/confluence/display/FLINK/FLIP-27%3A+Refactor+Source+Interface

On Wed, Oct 16, 2019 at 9:00 PM Jark Wu <imj...@gmail.com> wrote:

> Hi Yijie,
>
> Thanks for the design document. I agree with Bowen that the catalog part
> needs more details.
> And I would suggest to separate Pulsar Catalog as another FLIP. IMO, it has
> little to do with source/sink.
> Having a separate FLIP can unblock the contribution for sink (or source)
> and keep the discussion more focus.
> I also left some comments in the documentation.
>
> Thanks,
> Jark
>
> On Thu, 17 Oct 2019 at 11:24, Yijie Shen <henry.yijies...@gmail.com>
> wrote:
>
> > Hi Bowen,
> >
> > Thanks for your comments. I'll add catalog details as you suggested.
> >
> > One more question: since we decide to not implement source part of the
> > connector at the moment.
> > What can users do with a Pulsar catalog?
> > Create a table backed by Pulsar and check existing pulsar tables to see
> > their schemas? Drop tables maybe?
> >
> > Best,
> > Yijie
> >
> > On Thu, Oct 17, 2019 at 1:04 AM Bowen Li <bowenl...@gmail.com> wrote:
> >
> > > Hi Yijie,
> > >
> > > Per the discussion, maybe you can move pulsar source to 'future work'
> > > section in the FLIP for now?
> > >
> > > Besides, the FLIP seems to be quite rough at the moment, and I'd
> > recommend
> > > to add more details .
> > >
> > > A few questions mainly regarding the proposed pulsar catalog.
> > >
> > >    - Can you provide some background of pulsar schema registry and how
> it
> > >    works?
> > >    - The proposed design of pulsar catalog is very vague now, can you
> > >    share some details of how a pulsar catalog would work internally?
> E.g.
> > >       - which APIs does it support exactly? E.g. I see from your
> > >       prototype that table creation is supported but not alteration.
> > >       - is it going to connect to a pulsar schema registry via a http
> > >       client or a pulsar client, etc
> > >       - will it be able to handle multiple versions of pulsar, or just
> > >       one? How is compatibility handles between different Flink-Pulsar
> > versions?
> > >       - will it support only reading from pulsar schema registry , or
> > >       both read/write? Will it work end-to-end in Flink SQL for users
> to
> > create
> > >       and manipulate a pulsar table such as "CREATE TABLE t WITH
> > >       PROPERTIES(type=pulsar)" and "DROP TABLE t"?
> > >       - Is a pulsar topic always gonna be a non-partitioned table? How
> is
> > >       a partitioned topic mapped to a Flink table?
> > >    - How to map Flink's catalog/database namespace to pulsar's
> > >    multi-tenant namespaces? I'm not very familiar with how multi
> tenancy
> > works
> > >    in pulsar, and some background context/use cases may help here too.
> > E.g.
> > >       - can a pulsar client/consumer/producer be multiple-tenant at the
> > >       same time?
> > >       - how does authentication work in pulsar's multi-tenancy and the
> > >       catalog? asking since I didn't see the proposed pulsar catalog
> has
> > >       username/password configs
> > >       - the FLIP seems propose mapping a pulsar cluster and
> > >       'tenant/namespace' respectively to Flink's 'catalog' and
> > 'database'. I
> > >       wonder whether it totally makes sense, or should we actually map
> > "tenant"
> > >       to "catalog", and "namespace" to "database"?
> > >
> > > Cheers,
> > > Bowen
> > >
> > > On Fri, Sep 20, 2019 at 1:16 AM Yijie Shen <henry.yijies...@gmail.com>
> > > wrote:
> > >
> > >> Hi everyone,
> > >>
> > >> Per discussion in the previous thread
> > >> <
> > >>
> >
> http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-Contribute-Pulsar-Flink-connector-back-to-Flink-tc32538.html
> > >> >,
> > >> I have created FLIP-72 to kick off a more detailed discussion on the
> > Flink
> > >> Pulsar connector:
> > >>
> > >>
> > >>
> >
> https://cwiki.apache.org/confluence/display/FLINK/FLIP-72%3A+Introduce+Pulsar+Connector
> > >>
> > >> In short, the connector has the following features:
> > >>
> > >>    -
> > >>
> > >>    Pulsar as a streaming source with exactly-once guarantee.
> > >>    -
> > >>
> > >>    Sink streaming results to Pulsar with at-least-once semantics.
> > >>    -
> > >>
> > >>    Build upon Flink new Table API Type system (FLIP-37
> > >>    <
> > >>
> >
> https://cwiki.apache.org/confluence/display/FLINK/FLIP-37%3A+Rework+of+the+Table+API+Type+System
> > >> >
> > >>    ), and can automatically (de)serialize messages with the help of
> > Pulsar
> > >>    schema.
> > >>    -
> > >>
> > >>    Integrate with Flink new Catalog API (FLIP-30
> > >>    <
> > >>
> >
> https://cwiki.apache.org/confluence/display/FLINK/FLIP-30%3A+Unified+Catalog+APIs
> > >> >),
> > >>    which enables the use of Pulsar topics as tables in Table API as
> well
> > >> as
> > >>    SQL client.
> > >>
> > >>
> > >>
> > >>
> >
> https://docs.google.com/document/d/1rES79eKhkJxrRfQp1b3u8LB2aPaq-6JaDHDPJIA8kMY/edit#heading=h.28v5v23yeq1u
> > >>
> > >>
> > >> Would love to here your thoughts on this.
> > >>
> > >> Best,
> > >> Yijie
> > >>
> > >
> >
>

Reply via email to