+1 for the 1.9.0 feature list. Excited to see it is happening. Regards, Jark
On Thu, 2 May 2019 at 17:07, himansh1...@gmail.com <himansh1...@gmail.com> wrote: > +1 for Protobuf, Hive Metastore integration & features related to > savepoint. > > I was hoping if we could support for ORC File format can be added in > StreamingFileSink Writer, Currently only Parquet is supported with respect > to columnar file formats > > > On 2019/05/01 05:15:23, "Tzu-Li (Gordon) Tai" <tzuli...@apache.org> > wrote: > > Hi community, > > > > Apache Flink 1.8.0 has been released a few weeks ago, so naturally, it’s > > time to start thinking about what we want to aim for 1.9.0. > > > > Kurt and I had collected some features that would be reasonable to > consider > > including for the next release, based on talking with various people as > > well as observations from mailing list discussions and questions. > > > > Note that having specific features listed here does not mean that no > other > > pull requests or topics will be reviewed. I am sure that there are other > > ongoing efforts that we missed here and will likely make it as an > > improvement or new feature in the next release. This discussion is merely > > for bootstrapping a discussion for 1.9, as well as to give contributors > an > > idea of what the community is looking to focus on in the next couple of > > weeks. > > > > *Proposed features and focus* > > > > In the previous major release, Apache Flink 1.8.0, the community had > > prepared for some major Table & SQL additions from the Blink branch. With > > this in mind, for the next release, it would be great to wind up those > > efforts by merging in the Blink-based Table / SQL planner and runtime for > > 1.9. > > > > Following Stephan’s previous thread [1] in the mailing list about > features > > in Blink, we should also start focusing on preparing for Blink’s other > > several enhancements for batch execution. This includes resource > > optimization, fine-grained failover, pluggable shuffle service, adapting > > stream operators for batch execution, as well as better integration with > > commonly used systems by batch executions such as Apache Hive. > > > > Moreover, besides efforts related to the Blink merge, we would also like > us > > to work towards pushing forward some of the most discussed and > anticipated > > features by the community. Most of these had discussions in the mailing > > lists that span multiple releases, and are also frequently brought up in > > community events such as Flink Forward. This includes features such as > > source event-time alignment and the source interface rework, a savepoint > > connector that allows users to manipulate and query state in savepoints, > > interactive programming, as well as terminating a job with a final > > savepoint. > > > > Last but not least, we have several existing contributions or discussions > > for the ecosystem surrounding Flink, which we think is also very valuable > > to try to merge in for 1.9. This includes a web UI rework (recently > already > > merged), active K8s integration, Google PubSub connector, native support > > for the Protobuf format, Python support in the Table API, as well as > > reworking Flink’s support for machine learning. > > > > To wrap this up as a list of items, some of which already have JIRAs or > > mailing list threads to track them: > > > > - Merge Blink runner for Table & SQL [2] > > - > > > > Restructure flink-table to separate API from core runtime > > - > > > > Make table planners pluggable > > - > > > > Rework Table / SQL type system to integrate better with the SQL > > standard [3] > > - > > > > Merge Blink planner and runtime for Table / SQL > > - Further preparations for more batch execution optimization from > > Blink > > - > > > > Dedicated scheduler component [4] > > - > > > > Fine grained failover for batch [5] > > - > > > > Selectable input stream operator [6] > > - > > > > Pluggable Shuffle Service [7] > > - > > > > FLIP-30: Unified Catalog API & Hive metastore integration [8] > > - Heavily anticipated / discussed features in the community > > - > > > > FLIP-27: Source interface rework [9] > > - > > > > Savepoint connector [10] > > - > > > > FLIP-34: Terminate / Suspend job with savepoint [11] > > - > > > > FLIP-36: Interactive Programming [12] > > - Ecosystem > > - > > > > Web UI rework [13] > > - > > > > Active K8s integration [14] > > - > > > > Google PubSub connector [15] > > - > > > > First-class Protobuf support [16] > > - > > > > FLIP-38: Python support in Table API [17] > > - > > > > FLIP-39: Flink ML pipeline and libraries on top of Table API [18] > > > > *Suggested release timeline* > > > > Apache Flink 1.8.0 was released earlier this month, so based on our usual > > timely release schedule, we should aim for releasing 1.9.0 around mid to > > end July. > > > > Since it seems that this is going to be a fairly large release, to give > the > > community enough testing time, I propose that the feature freeze to be > near > > the end of June (8-9 weeks from now, probable June 28). This is of > course a > > ballpark estimation for now; we should follow-up with a separate thread > > later in the release cycle to prepare contributors with an official > feature > > freeze date. > > > > I’d also like to use this opportunity to propose myself and Kurt as the > > release managers for 1.9. > > AFAIK, we did not used to have 2 RMs for a single release in the past, > but > > 1.9.0 is definitely quite ambitious so it would not hurt to have one more > > on board :) Cheers, Gordon [1] > > > http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-Flink-1-6-features-td22632.html > > > > [2] https://issues.apache.org/jira/browse/FLINK-11439 > > > > [3] https://issues.apache.org/jira/browse/FLINK-12251 > > > > [4] https://issues.apache.org/jira/browse/FLINK-10429 > > > > [5] > > > http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-Backtracking-for-failover-regions-td28293.html > > > > [6] > > > http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-Enhance-Operator-API-to-Support-Dynamically-Selective-Reading-and-EndOfInput-Event-td26753.html > > > > [7] https://issues.apache.org/jira/browse/FLINK-10653 > > > > [8] https://issues.apache.org/jira/browse/FLINK-11275 > > > > [9] > > > http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-FLIP-27-Refactor-Source-Interface-td24952i20.html > > > > [10] https://issues.apache.org/jira/browse/FLINK-12047 > > > > [11] > > > http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-FLIP-33-Terminate-Suspend-Job-with-Savepoint-td26927.html > > > > [12] > > > https://cwiki.apache.org/confluence/display/FLINK/FLIP-36%3A+Support+Interactive+Programming+in+Flink > > > > [13] https://issues.apache.org/jira/browse/FLINK-10705 > > > > [14] https://issues.apache.org/jira/browse/FLINK-9953 > > > > [15] https://issues.apache.org/jira/browse/FLINK-9311 > > > > [16] https://issues.apache.org/jira/browse/FLINK-11333 > > > > [17] > > > http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-FLIP-38-Support-python-language-in-flink-TableAPI-td28061.html > > [18] > > > http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-FLIP-39-Flink-ML-pipeline-and-ML-libs-td28633.html > > >