Re: [DISCUSS] Features for Apache Flink 1.10

Biao Liu Wed, 11 Sep 2019 21:07:36 -0700

Thanks Gary for kicking off the discussion.

+1 for the feature freeze time. Also thanks Gary and Yu Li for volunteering
as the release manager.


BTW, I'm working on refactoring of `CheckpointCoordinator` [1]. It would be
great if it is included in 1.10.

1. https://issues.apache.org/jira/browse/FLINK-13698

Thanks,
Biao /'bɪ.aʊ/



On Wed, 11 Sep 2019 at 18:33, Aljoscha Krettek <aljos...@apache.org> wrote:

> Hi,
>
> Thanks for putting together the list! And I’m +1 for the suggested
> release timeline and also for Gary and Yu as the release managers.
>
> Best,
> Aljoscha
>
> On 9 Sep 2019, at 7:39, Yu Li wrote:
>
> > Hi Xuefu,
> >
> > If I understand it correctly, the data type support work should be
> > included
> > in the "Table API improvements->Finish type system" part, please check
> > it
> > and let us know if anything missing there. Thanks.
> >
> > Best Regards,
> > Yu
> >
> >
> > On Mon, 9 Sep 2019 at 11:14, Xuefu Z <usxu...@gmail.com> wrote:
> >
> >> Looking at feature list, I don't see an item for complete the data
> >> type
> >> support. Specifically, high precision timestamp is needed to Hive
> >> integration, as it's so common. Missing it would damage the
> >> completeness of
> >> our Hive effort.
> >>
> >> Thanks,
> >> Xuefu
> >>
> >> On Sat, Sep 7, 2019 at 7:06 PM Xintong Song <tonysong...@gmail.com>
> >> wrote:
> >>
> >>> Thanks Gray and Yu for compiling the feature list and kicking off
> >>> this
> >>> discussion.
> >>>
> >>> +1 for Gary and Yu being the release managers for Flink 1.10.
> >>>
> >>> Thank you~
> >>>
> >>> Xintong Song
> >>>
> >>>
> >>>
> >>> On Sat, Sep 7, 2019 at 4:58 PM Till Rohrmann <trohrm...@apache.org>
> >> wrote:
> >>>
> >>>> Thanks for compiling the list of 1.10 efforts for the community
> >>>> Gary. I
> >>>> think this helps a lot to better understand what the community is
> >>> currently
> >>>> working on.
> >>>>
> >>>> Thanks for volunteering as the release managers for the next major
> >>>> release. +1 for Gary and Yu being the RMs for Flink 1.10.
> >>>>
> >>>> Cheers,
> >>>> Till
> >>>>
> >>>> On Sat, Sep 7, 2019 at 7:26 AM Zhu Zhu <reed...@gmail.com> wrote:
> >>>>
> >>>>> Thanks Gary for kicking off this discussion.
> >>>>> Really appreciate that you and Yu offer to help to manage 1.10
> >> release.
> >>>>>
> >>>>> +1 for Gary and Yu as release managers.
> >>>>>
> >>>>> Thanks,
> >>>>> Zhu Zhu
> >>>>>
> >>>>> Dian Fu <dian0511...@gmail.com> 于2019年9月7日周六
> >>>>> 下午12:26写道：
> >>>>>
> >>>>>> Hi Gary,
> >>>>>>
> >>>>>> Thanks for kicking off the release schedule of 1.10. +1 for you
> >>>>>> and
> >>> Yu
> >>>> Li
> >>>>>> as the release manager.
> >>>>>>
> >>>>>> The feature freeze/release time sounds reasonable.
> >>>>>>
> >>>>>> Thanks,
> >>>>>> Dian
> >>>>>>
> >>>>>>> 在 2019年9月7日，上午11:30，Jark Wu <imj...@gmail.com>
> >>>>>>> 写道：
> >>>>>>>
> >>>>>>> Thanks Gary for kicking off the discussion for 1.10 release.
> >>>>>>>
> >>>>>>> +1 for Gary and Yu as release managers. Thank you for you
> >>>>>>> effort.
> >>>>>>>
> >>>>>>> Best,
> >>>>>>> Jark
> >>>>>>>
> >>>>>>>
> >>>>>>>> 在 2019年9月7日，00:52，zhijiang
> >>>>>>>> <wangzhijiang...@aliyun.com.INVALID>
> >>> 写道：
> >>>>>>>>
> >>>>>>>> Hi Gary,
> >>>>>>>>
> >>>>>>>> Thanks for kicking off the features for next release 1.10.  I
> >>>>>>>> am
> >>>> very
> >>>>>> supportive of you and Yu Li to be the relaese managers.
> >>>>>>>>
> >>>>>>>> Just mention another two improvements which want to be covered
> >> in
> >>>>>> FLINK-1.10 and I already confirmed with Piotr to reach an
> >>>>>> agreement
> >>>>> before.
> >>>>>>>>
> >>>>>>>> 1. Data serialize and copy only once for broadcast partition
> >> [1]:
> >>> It
> >>>>>> would improve the throughput performance greatly in broadcast
> >>>>>> mode
> >>> and
> >>>>> was
> >>>>>> actually proposed in Flink-1.8. Most of works already done before
> >> and
> >>>>> only
> >>>>>> left the last critical jira/PR. It will not take much efforts to
> >> make
> >>>> it
> >>>>>> ready.
> >>>>>>>>
> >>>>>>>> 2. Let Netty use Flink's buffers directly in credit-based mode
> >>> [2] :
> >>>>> It
> >>>>>> could avoid memory copy from netty stack to flink managed network
> >>>> buffer.
> >>>>>> The obvious benefit is decreasing the direct memory overhead
> >> greatly
> >>> in
> >>>>>> large-scale jobs. I also heard of some user cases encounter
> >>>>>> direct
> >>> OOM
> >>>>>> caused by netty memory overhead. Actually this improvment was
> >>> proposed
> >>>> by
> >>>>>> nico in FLINK-1.7 and always no time to focus then. Yun Gao
> >>>>>> already
> >>>>>> submitted a PR half an year ago but have not been reviewed yet. I
> >>> could
> >>>>>> help review the deign and PR codes to make it ready.
> >>>>>>>>
> >>>>>>>> And you could make these two items as lowest priority if
> >> possible.
> >>>>>>>>
> >>>>>>>> [1] https://issues.apache.org/jira/browse/FLINK-10745
> >>>>>>>> [2] https://issues.apache.org/jira/browse/FLINK-10742
> >>>>>>>>
> >>>>>>>> Best,
> >>>>>>>> Zhijiang
> >>>>>>>>
> >> ------------------------------------------------------------------
> >>>>>>>> From:Gary Yao <g...@apache.org>
> >>>>>>>> Send Time:2019年9月6日(星期五) 17:06
> >>>>>>>> To:dev <dev@flink.apache.org>
> >>>>>>>> Cc:carp84 <car...@gmail.com>
> >>>>>>>> Subject:[DISCUSS] Features for Apache Flink 1.10
> >>>>>>>>
> >>>>>>>> Hi community,
> >>>>>>>>
> >>>>>>>> Since Apache Flink 1.9.0 has been released more than 2 weeks
> >> ago,
> >>> I
> >>>>>> want to
> >>>>>>>> start kicking off the discussion about what we want to achieve
> >> for
> >>>> the
> >>>>>> 1.10
> >>>>>>>> release.
> >>>>>>>>
> >>>>>>>> Based on discussions with various people as well as
> >>>>>>>> observations
> >>>> from
> >>>>>>>> mailing
> >>>>>>>> list threads, Yu Li and I have compiled a list of features that
> >> we
> >>>>> deem
> >>>>>>>> important to be included in the next release. Note that the
> >>> features
> >>>>>>>> presented
> >>>>>>>> here are not meant to be exhaustive. As always, I am sure that
> >>> there
> >>>>>> will be
> >>>>>>>> other contributions that will make it into the next release.
> >> This
> >>>>> email
> >>>>>>>> thread
> >>>>>>>> is merely to kick off a discussion, and to give users and
> >>>> contributors
> >>>>>> an
> >>>>>>>> understanding where the focus of the next release lies. If
> >>>>>>>> there
> >>> is
> >>>>>> anything
> >>>>>>>> we have missed that somebody is working on, please reply to
> >>>>>>>> this
> >>>>> thread.
> >>>>>>>>
> >>>>>>>>
> >>>>>>>> ** Proposed features and focus
> >>>>>>>>
> >>>>>>>> Following the contribution of Blink to Apache Flink, the
> >> community
> >>>>>> released
> >>>>>>>> a
> >>>>>>>> preview of the Blink SQL Query Processor, which offers better
> >> SQL
> >>>>>> coverage
> >>>>>>>> and
> >>>>>>>> improved performance for batch queries, in Flink 1.9.0.
> >>>>>>>> However,
> >>> the
> >>>>>>>> integration of the Blink query processor is not fully completed
> >>> yet
> >>>> as
> >>>>>> there
> >>>>>>>> are still pending tasks, such as implementing full TPC-DS
> >> support.
> >>>>> With
> >>>>>> the
> >>>>>>>> next Flink release, we aim at finishing the Blink integration.
> >>>>>>>>
> >>>>>>>> Furthermore, there are several ongoing work threads addressing
> >>>>>> long-standing
> >>>>>>>> issues reported by users, such as improving checkpointing under
> >>>>>>>> backpressure,
> >>>>>>>> and limiting RocksDBs native memory usage, which can be
> >> especially
> >>>>>>>> problematic
> >>>>>>>> in containerized Flink deployments.
> >>>>>>>>
> >>>>>>>> Notable features surrounding Flink’s ecosystem that are
> >>>>>>>> planned
> >>> for
> >>>>> the
> >>>>>> next
> >>>>>>>> release include active Kubernetes support (i.e., enabling
> >> Flink’s
> >>>>>>>> ResourceManager to launch new pods), improved Hive integration,
> >>> Java
> >>>>> 11
> >>>>>>>> support, and new algorithms for the Flink ML library.
> >>>>>>>>
> >>>>>>>> Below I have included the list of features that we compiled
> >>> ordered
> >>>> by
> >>>>>>>> priority – some of which already have ongoing mailing list
> >>> threads,
> >>>>>> JIRAs,
> >>>>>>>> or
> >>>>>>>> FLIPs.
> >>>>>>>>
> >>>>>>>> - Improving Flink’s build system & CI [1] [2]
> >>>>>>>> - Support Java 11 [3]
> >>>>>>>> - Table API improvements
> >>>>>>>>   - Configuration Evolution [4] [5]
> >>>>>>>>   - Finish type system: Expression Re-design [6] and UDF
> >> refactor
> >>>>>>>>   - Streaming DDL: Time attribute (watermark) and Changelog
> >>> support
> >>>>>>>>   - Full SQL partition support for both batch & streaming [7]
> >>>>>>>>   - New Java Expression DSL [8]
> >>>>>>>>   - SQL CLI with DDL and DML support
> >>>>>>>> - Hive compatibility completion (DDL/UDF) to support full Hive
> >>>>>> integration
> >>>>>>>>   - Partition/Function/View support
> >>>>>>>> - Remaining Blink planner/runtime merge
> >>>>>>>>   - Support all TPC-DS queries [9]
> >>>>>>>> - Finer grained resource management
> >>>>>>>>   - Unified TaskExecutor Memory Configuration [10]
> >>>>>>>>   - Fine Grained Operator Resource Management [11]
> >>>>>>>>   - Dynamic Slots Allocation [12]
> >>>>>>>> - Finish scheduler re-architecture [13]
> >>>>>>>>   - Allows implementing more sophisticated scheduling
> >>>>>>>> strategies
> >>>> such
> >>>>> as
> >>>>>>>> better batch scheduler or speculative execution.
> >>>>>>>> - New DataStream Source Interface [14]
> >>>>>>>>   - A new source connector architecture to unify the
> >>> implementation
> >>>> of
> >>>>>>>> source connectors and make it simpler to implement custom
> >>>>>>>> source
> >>>>>> connectors.
> >>>>>>>> - Add more source/system metrics
> >>>>>>>>   - For better flink job monitoring and facilitate customized
> >>>>> solutions
> >>>>>>>> like auto-scaling.
> >>>>>>>> - Executor Interface / Client API [15]
> >>>>>>>>   - Allow Flink downstream projects to easier and better
> >>>>>>>> monitor
> >>> and
> >>>>>>>> control flink jobs.
> >>>>>>>> - Interactive Programming [16]
> >>>>>>>>   - Allow users to cache the intermediate results in Table API
> >> for
> >>>>> later
> >>>>>>>> usage to avoid redundant computation when a Flink application
> >>>> contains
> >>>>>>>> multiple jobs.
> >>>>>>>> - Python User Defined Function [17]
> >>>>>>>>   - Support native user-defined functions in Flink Python,
> >>> including
> >>>>>>>> UDF/UDAF/UDTF in Table API and Python-Java mixed UDF.
> >>>>>>>> - Spillable heap backend [18]
> >>>>>>>>   - A new state backend supporting automatic data spill and
> >>>>>>>> load
> >>>> when
> >>>>>>>> memory exhausted/regained.
> >>>>>>>> - RocksDB backend memory control [19]
> >>>>>>>>   - Prevent excessive memory usage from RocksDB, especially in
> >>>>> container
> >>>>>>>> environment.
> >>>>>>>> - Unaligned checkpoints [20]
> >>>>>>>>   - Resolve the checkpoint timeout issue under backpressure.
> >>>>>>>> - Separate framework and user class loader in per-job mode
> >>>>>>>> - Active Kubernetes Integration [21]
> >>>>>>>>   - Allow ResourceManager talking to Kubernetes to launch new
> >> pods
> >>>>>>>> similar to Flink's Yarn/Mesos integration
> >>>>>>>> - ML pipeline/library
> >>>>>>>>   - Aims at delivering several core algorithms, including
> >> Logistic
> >>>>>>>> Regression, Native Bayes, Random Forest, KMeans, etc.
> >>>>>>>> - Add vertex subtask log url on WebUI [22]
> >>>>>>>>
> >>>>>>>>
> >>>>>>>> ** Suggested release timeline
> >>>>>>>>
> >>>>>>>> Based on our usual time-based release schedule [23], and
> >>> considering
> >>>>>> that
> >>>>>>>> several events, such as Flink Forward Europe and Asia, are
> >>>> overlapping
> >>>>>> with
> >>>>>>>> the current release cycle, we should aim at releasing 1.10
> >> around
> >>>> the
> >>>>>>>> beginning of January 2020. To give the community enough testing
> >>>> time,
> >>>>> I
> >>>>>>>> propose the feature freeze to be at the end of November. We
> >> should
> >>>>>> announce
> >>>>>>>> an
> >>>>>>>> exact date later in the release cycle.
> >>>>>>>>
> >>>>>>>> Lastly, I would like to use the opportunity to propose Yu Li
> >>>>>>>> and
> >>>>> myself
> >>>>>> as
> >>>>>>>> release managers for the upcoming release.
> >>>>>>>>
> >>>>>>>> What do you think?
> >>>>>>>>
> >>>>>>>>
> >>>>>>>> Best,
> >>>>>>>> Gary
> >>>>>>>>
> >>>>>>>> [1]
> >>>>>>>>
> >>>>>>
> >>>>>
> >>>>
> >>>
> >>
> https://lists.apache.org/thread.html/775447a187410727f5ba6f9cefd6406c58ca5cc5c580aecf30cf213e@%3Cdev.flink.apache.org%3E
> >>>>>>>> [2]
> >>>>>>>>
> >>>>>>
> >>>>>
> >>>>
> >>>
> >>
> https://lists.apache.org/thread.html/b90aa518fcabce94f8e1de4132f46120fae613db6e95a2705f1bd1ea@%3Cdev.flink.apache.org%3E
> >>>>>>>> [3] https://issues.apache.org/jira/browse/FLINK-10725
> >>>>>>>> [4]
> >>>>>>>>
> >>>>>>
> >>>>>
> >>>>
> >>>
> >>
> https://cwiki.apache.org/confluence/display/FLINK/FLIP-54%3A+Evolve+ConfigOption+and+Configuration
> >>>>>>>> [5]
> >>>>>>>>
> >>>>>>
> >>>>>
> >>>>
> >>>
> >>
> https://cwiki.apache.org/confluence/display/FLINK/FLIP-59%3A+Enable+execution+configuration+from+Configuration+object
> >>>>>>>> [6]
> >>>>>>>>
> >>>>>>
> >>>>>
> >>>>
> >>>
> >>
> https://cwiki.apache.org/confluence/display/FLINK/FLIP-51%3A+Rework+of+the+Expression+Design
> >>>>>>>> [7]
> >>>>>>>>
> >>>>>>
> >>>>>
> >>>>
> >>>
> >>
> https://cwiki.apache.org/confluence/display/FLINK/FLIP-63%3A+Rework+table+partition+support
> >>>>>>>> [8]
> >>>>>>>>
> >>>>>>
> >>>>>
> >>>>
> >>>
> >>
> https://cwiki.apache.org/confluence/display/FLINK/FLIP-55%3A+Introduction+of+a+Table+API+Java+Expression+DSL
> >>>>>>>> [9] https://issues.apache.org/jira/browse/FLINK-11491
> >>>>>>>> [10]
> >>>>>>>>
> >>>>>>
> >>>>>
> >>>>
> >>>
> >>
> https://cwiki.apache.org/confluence/display/FLINK/FLIP-49%3A+Unified+Memory+Configuration+for+TaskExecutors
> >>>>>>>> [11]
> >>>>>>>>
> >>>>>>
> >>>>>
> >>>>
> >>>
> >>
> https://cwiki.apache.org/confluence/display/FLINK/FLIP-53%3A+Fine+Grained+Operator+Resource+Management
> >>>>>>>> [12]
> >>>>>>>>
> >>>>>>
> >>>>>
> >>>>
> >>>
> >>
> https://cwiki.apache.org/confluence/display/FLINK/FLIP-56%3A+Dynamic+Slot+Allocation
> >>>>>>>> [13] https://issues.apache.org/jira/browse/FLINK-10429
> >>>>>>>> [14]
> >>>>>>>>
> >>>>>>
> >>>>>
> >>>>
> >>>
> >>
> https://cwiki.apache.org/confluence/display/FLINK/FLIP-27%3A+Refactor+Source+Interface
> >>>>>>>> [15]
> >>>>>>>>
> >>>>>>
> >>>>>
> >>>>
> >>>
> >>
> https://lists.apache.org/thread.html/498dd3e0277681cda356029582c1490299ae01df912e15942e11ae8e@%3Cdev.flink.apache.org%3E
> >>>>>>>> [16]
> >>>>>>>>
> >>>>>>
> >>>>>
> >>>>
> >>>
> >>
> https://cwiki.apache.org/confluence/display/FLINK/FLIP-36%3A+Support+Interactive+Programming+in+Flink
> >>>>>>>> [17]
> >>>>>>>>
> >>>>>>
> >>>>>
> >>>>
> >>>
> >>
> https://cwiki.apache.org/confluence/display/FLINK/FLIP-58%3A+Flink+Python+User-Defined+Stateless+Function+for+Table
> >>>>>>>> [18]
> >>>>>>>>
> >>>>>>
> >>>>>
> >>>>
> >>>
> >>
> https://cwiki.apache.org/confluence/display/FLINK/FLIP-50%3A+Spill-able+Heap+Keyed+State+Backend
> >>>>>>>> [19] https://issues.apache.org/jira/browse/FLINK-7289
> >>>>>>>> [20]
> >>>>>>>>
> >>>>>>
> >>>>>
> >>>>
> >>>
> >>
> http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/Checkpointing-under-backpressure-td31616.html
> >>>>>>>> [21]
> >>>>>>>>
> >>>>>>
> >>>>>
> >>>>
> >>>
> >>
> http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-Best-practice-to-run-flink-on-kubernetes-td31532.html
> >>>>>>>> [22] https://issues.apache.org/jira/browse/FLINK-13894
> >>>>>>>> [23]
> >>>>>>
> >>> https://cwiki.apache.org/confluence/display/FLINK/Time-based+releases
> >>>>>>>>
> >>>>>>>
> >>>>>>
> >>>>>>
> >>>>>
> >>>>
> >>>
> >>
> >>
> >> --
> >> Xuefu Zhang
> >>
> >> "In Honey We Trust!"
> >>
>

Re: [DISCUSS] Features for Apache Flink 1.10

Reply via email to