Hi , I have started a discussion about improving the new TableSource and TableSink: http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-FLIP-146-Improve-new-TableSource-and-TableSink-interfaces-td45161.html It includes parallelism setting, welcome to join the discussion and look forward to your comments.
Best, Jingsong On Mon, Sep 21, 2020 at 11:03 AM Jark Wu <imj...@gmail.com> wrote: > Since FLIP-95, the parallelism is decoupled from the runtime class > (DataStream/SourceFunction), > so we need to have an API to tell the planner what the parallelism of the > source/sink is. > > This is indeed the purpose of a previous discussion: [DISCUSS] Introduce > SupportsParallelismReport and SupportsStatisticsReport > We can continue the discussion there. > > Best, > Jark > > [1]: > > http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-Introduce-SupportsParallelismReport-and-SupportsStatisticsReport-for-Hive-and-Filesystem-td43531.html > > On Sun, 20 Sep 2020 at 23:14, 刘大龙 <ld...@zju.edu.cn> wrote: > > > > > +1 > > > > > -----原始邮件----- > > > 发件人: "Benchao Li" <libenc...@apache.org> > > > 发送时间: 2020-09-20 16:28:20 (星期日) > > > 收件人: dev <dev@flink.apache.org> > > > 抄送: > > > 主题: Re: [DISCUSS] Support source/sink parallelism config in Flink sql > > > > > > Hi admin, > > > > > > Thanks for bringing up this discussion. > > > IMHO, it's a valuable feature. We also added this feature for our > > internal > > > SQL engine. > > > And our way is very similar to your proposal. > > > > > > Regarding the implementation, there is one shorthand that we should > > modify > > > each connector > > > to support this property. > > > We can wait for others' opinion whether this is a valid proposal. If > yes, > > > then we can discuss > > > the implementation detailedly. > > > > > > admin <17626017...@163.com> 于2020年9月10日周四 上午1:19写道: > > > > > > > Hi devs: > > > > Currently,Flink sql does not support source/sink parallelism > > config.So,it > > > > will result in wasting or lacking resources in some cases. > > > > I think it is necessary to introduce configuration of source/sink > > > > parallelism in sql. > > > > From my side,i have the solution for this feature.Add parallelism > > config > > > > in ‘with’ properties of DDL. > > > > > > > > Before 1.11,we can get parallelism and then set it to > > > > StreamTableSink#consumeDataStream or StreamTableSource#getDataStream > > > > After 1.11,we can get parallelism from catalogTable and then set it > to > > > > transformation in CommonPhysicalTableSourceScan or > CommonPhysicalSink. > > > > > > > > What do you think? > > > > > > > > > > > > > > > > > > > > > > > > > > -- > > > > > > Best, > > > Benchao Li > > > -- Best, Jingsong Lee