Re: [DISCUSS] Support notifyOnMaster for notifyCheckpointComplete

2019-09-09 Thread shimin yang
Hi Jingsong, Although it would be nice if the accumulators in GlobalAggregateManager is fault-tolerant, we could still take advantage of managed state to guarantee the semantic and use the accumulators to implement distributed barrier or lock to solve the distributed access problem. Best, Shimin

[DISCUSS] Retrieval services in non-high-availability scenario

2019-09-09 Thread Zili Chen
Hi devs, I'd like to start a discussion thread on the topic how we provide retrieval services in non-high-availability scenario. To clarify terminology, non-high-availability scenario refers to StandaloneHaServices and EmbeddedHaServices. ***The problem*** We notice that retrieval services of cu

[jira] [Created] (FLINK-14008) Flink Scala 2.12 binary distribution contains incorrect NOTICE-binary file

2019-09-09 Thread Till Rohrmann (Jira)
Till Rohrmann created FLINK-14008: - Summary: Flink Scala 2.12 binary distribution contains incorrect NOTICE-binary file Key: FLINK-14008 URL: https://issues.apache.org/jira/browse/FLINK-14008 Project:

[jira] [Created] (FLINK-14009) Cron jobs broken due to verifying incorrect NOTICE-binary file

2019-09-09 Thread Till Rohrmann (Jira)
Till Rohrmann created FLINK-14009: - Summary: Cron jobs broken due to verifying incorrect NOTICE-binary file Key: FLINK-14009 URL: https://issues.apache.org/jira/browse/FLINK-14009 Project: Flink

Re: [ANNOUNCE] Kostas Kloudas joins the Flink PMC

2019-09-09 Thread Kostas Kloudas
Thanks a lot everyone for the warm welcome! Cheers, Kostas On Mon, Sep 9, 2019 at 4:54 AM Yun Gao wrote: > > Congratulations, Kostas! > > Best, > Yun‍‍‍ > > > > > -- > From:Becket Qin > Send Time:2019 Sep. 9 (Mon.)

Re: [ANNOUNCE] Kostas Kloudas joins the Flink PMC

2019-09-09 Thread JingsongLee
Congrats, Kostas! Well deserved. Best, Jingsong Lee -- From:Kostas Kloudas Send Time:2019年9月9日(星期一) 15:50 To:dev ; Yun Gao Subject:Re: [ANNOUNCE] Kostas Kloudas joins the Flink PMC Thanks a lot everyone for the warm welcome! Che

Re: [DISCUSS] Contribute Pulsar Flink connector back to Flink

2019-09-09 Thread Sijie Guo
Thank you Bowen and Becket. What's the take from Flink community? Shall we wait for FLIP-27 or shall we proceed to next steps? And what the next steps are? :-) Thanks, Sijie On Thu, Sep 5, 2019 at 2:43 PM Bowen Li wrote: > Hi, > > I think having a Pulsar connector in Flink can be a good mutual

Re: Checkpointing clarification

2019-09-09 Thread Till Rohrmann
Yes you are correct Dominik. The committed Kafka offsets tell you what the program has read as input from the Kafka topic. But depending on the actual program logic this does not mean that you have output the results of processing these input events up to this point. As you have said, there are Fli

[jira] [Created] (FLINK-14010) Dispatcher & JobManagers don't give up leadership when AM is shut down

2019-09-09 Thread TisonKun (Jira)
TisonKun created FLINK-14010: Summary: Dispatcher & JobManagers don't give up leadership when AM is shut down Key: FLINK-14010 URL: https://issues.apache.org/jira/browse/FLINK-14010 Project: Flink

Call for approving Elasticsearch 7.x connector

2019-09-09 Thread vino yang
Hi guys, There is an issue about supporting Elasticsearch 7.x.[1] Based on our validation and discussion. We found that Elasticsearch 7.x does not guarantee API compatibility. Therefore, it does not have the ability to provide a universal connector like Kafka. It seems that we have to provide a ne

[jira] [Created] (FLINK-14011) Make some fields final and initialize them during construction in AsyncWaitOperator

2019-09-09 Thread Alex (Jira)
Alex created FLINK-14011: Summary: Make some fields final and initialize them during construction in AsyncWaitOperator Key: FLINK-14011 URL: https://issues.apache.org/jira/browse/FLINK-14011 Project: Flink

Re: [DISCUSS] StreamingFile with ParquetBulkWriter bucketing limitations

2019-09-09 Thread Kostas Kloudas
Hi Enrico, Sorry for the late reply. I think your understanding is correct. The best way to do it is to write your own ParquetBulkWriter and the corresponding factory. Out of curiosity, I guess that in the BucketingSink you were using the AvroKeyValueSinkWriter, right? Cheers, Kostas On Fri, Au

Re: [DISCUSS] Retrieval services in non-high-availability scenario

2019-09-09 Thread Till Rohrmann
Hi Tison, thanks for starting this discussion. I think your mail includes multiple points which are worth being treated separately (might even make sense to have separate discussion threads). Please correct me if I understood things wrongly: 1. Adding new non-ha HAServices: Based on your descrip

[jira] [Created] (FLINK-14012) Failed to start job for consuming Secure Kafka after the job cancel

2019-09-09 Thread Daebeom Lee (Jira)
Daebeom Lee created FLINK-14012: --- Summary: Failed to start job for consuming Secure Kafka after the job cancel Key: FLINK-14012 URL: https://issues.apache.org/jira/browse/FLINK-14012 Project: Flink

Re: [DISCUSS] StreamingFile with ParquetBulkWriter bucketing limitations

2019-09-09 Thread Enrico Agnoli
Thanks for confirming. We have a ``` public class ParquetSinkWriter implements Writer ``` that handles the serialization of the data. We implemented it starting from: https://medium.com/hadoop-noob/flink-parquet-writer-d127f745b519 https://stackoverflow.com/questions/48098011/how-to-use-apache-fli

Re: [DISCUSS] FLIP-66: Support time attribute in SQL DDL

2019-09-09 Thread Jark Wu
Hi all, Thanks all for so much feedbacks received in the doc so far. I saw a general agreement on using computed column to support proctime attribute and extract timestamps. So we will prepare a computed column FLIP and share in the dev ML soon. Feel free to leave more comments! Best, Jark On

Re: [VOTE] Release 1.8.2, release candidate #1

2019-09-09 Thread Dian Fu
+1 (non-binding) - built from source successfully (mvn clean install -DskipTests) - checked gpg signature and hashes of the source release and binary release packages - All artifacts have been deployed to the maven central repository - no new dependencies were added since 1.8.1 - run a couple of

Re: [VOTE] FLIP-58: Flink Python User-Defined Function for Table API

2019-09-09 Thread Dian Fu
Thanks Jincheng a lot for the remind and thanks all for the voting. I'm closing the vote now. So far, the vote has received: - 5 binding +1 votes (Jincheng, Hequn, Jark, Shaoxuan, Becket) - 5 non-binding +1 votes (Wei, Xingbo, Terry, Yu, Jeff) - No 0/-1 votes There are more than 3 binding +

[jira] [Created] (FLINK-14013) Support Flink Python User-Defined Stateless Function for Table

2019-09-09 Thread Dian Fu (Jira)
Dian Fu created FLINK-14013: --- Summary: Support Flink Python User-Defined Stateless Function for Table Key: FLINK-14013 URL: https://issues.apache.org/jira/browse/FLINK-14013 Project: Flink Issue T

[jira] [Created] (FLINK-14014) Introduce PythonScalarFunctionRunner to handle the communication with Python worker for Python ScalarFunction execution

2019-09-09 Thread Dian Fu (Jira)
Dian Fu created FLINK-14014: --- Summary: Introduce PythonScalarFunctionRunner to handle the communication with Python worker for Python ScalarFunction execution Key: FLINK-14014 URL: https://issues.apache.org/jira/browse/

[jira] [Created] (FLINK-14015) Introduce PythonScalarFunctionOperator as a standalone StreamOperator for Python ScalarFunction execution

2019-09-09 Thread Dian Fu (Jira)
Dian Fu created FLINK-14015: --- Summary: Introduce PythonScalarFunctionOperator as a standalone StreamOperator for Python ScalarFunction execution Key: FLINK-14015 URL: https://issues.apache.org/jira/browse/FLINK-14015

[jira] [Created] (FLINK-14016) Introduce RelNodes FlinkLogicalPythonScalarFunctionExec and DataStreamPythonScalarFunctionExec which are containers for Python PythonScalarFunctions

2019-09-09 Thread Dian Fu (Jira)
Dian Fu created FLINK-14016: --- Summary: Introduce RelNodes FlinkLogicalPythonScalarFunctionExec and DataStreamPythonScalarFunctionExec which are containers for Python PythonScalarFunctions Key: FLINK-14016 URL: https://

[jira] [Created] (FLINK-14017) Support to start up Python worker in process mode

2019-09-09 Thread Dian Fu (Jira)
Dian Fu created FLINK-14017: --- Summary: Support to start up Python worker in process mode Key: FLINK-14017 URL: https://issues.apache.org/jira/browse/FLINK-14017 Project: Flink Issue Type: Sub-task

[jira] [Created] (FLINK-14018) Add Python building blocks to make sure the basic functionality of Python ScalarFunction could work

2019-09-09 Thread Dian Fu (Jira)
Dian Fu created FLINK-14018: --- Summary: Add Python building blocks to make sure the basic functionality of Python ScalarFunction could work Key: FLINK-14018 URL: https://issues.apache.org/jira/browse/FLINK-14018

[jira] [Created] (FLINK-14019) Python environment and dependency management

2019-09-09 Thread Dian Fu (Jira)
Dian Fu created FLINK-14019: --- Summary: Python environment and dependency management Key: FLINK-14019 URL: https://issues.apache.org/jira/browse/FLINK-14019 Project: Flink Issue Type: Sub-task

[jira] [Created] (FLINK-14020) User Apache Arrow as the serializer for data transmission between Java operator and Python harness

2019-09-09 Thread Dian Fu (Jira)
Dian Fu created FLINK-14020: --- Summary: User Apache Arrow as the serializer for data transmission between Java operator and Python harness Key: FLINK-14020 URL: https://issues.apache.org/jira/browse/FLINK-14020

[jira] [Created] (FLINK-14021) Add rules to push down the Python ScalarFunctions contained in the join condition of Correlate node

2019-09-09 Thread Dian Fu (Jira)
Dian Fu created FLINK-14021: --- Summary: Add rules to push down the Python ScalarFunctions contained in the join condition of Correlate node Key: FLINK-14021 URL: https://issues.apache.org/jira/browse/FLINK-14021

[jira] [Created] (FLINK-14022) Add validation check for places where Python ScalarFunction cannot be used

2019-09-09 Thread Dian Fu (Jira)
Dian Fu created FLINK-14022: --- Summary: Add validation check for places where Python ScalarFunction cannot be used Key: FLINK-14022 URL: https://issues.apache.org/jira/browse/FLINK-14022 Project: Flink

[jira] [Created] (FLINK-14023) Support accessing job parameters in Python user-defined functions

2019-09-09 Thread Dian Fu (Jira)
Dian Fu created FLINK-14023: --- Summary: Support accessing job parameters in Python user-defined functions Key: FLINK-14023 URL: https://issues.apache.org/jira/browse/FLINK-14023 Project: Flink Issu

[jira] [Created] (FLINK-14024) Support use-defined metrics in Python user-defined functions

2019-09-09 Thread Dian Fu (Jira)
Dian Fu created FLINK-14024: --- Summary: Support use-defined metrics in Python user-defined functions Key: FLINK-14024 URL: https://issues.apache.org/jira/browse/FLINK-14024 Project: Flink Issue Typ

[jira] [Created] (FLINK-14025) Support to run the Python worker in docker mode

2019-09-09 Thread Dian Fu (Jira)
Dian Fu created FLINK-14025: --- Summary: Support to run the Python worker in docker mode Key: FLINK-14025 URL: https://issues.apache.org/jira/browse/FLINK-14025 Project: Flink Issue Type: Sub-task

[jira] [Created] (FLINK-14026) Manage the resource of Python worker properly

2019-09-09 Thread Dian Fu (Jira)
Dian Fu created FLINK-14026: --- Summary: Manage the resource of Python worker properly Key: FLINK-14026 URL: https://issues.apache.org/jira/browse/FLINK-14026 Project: Flink Issue Type: Sub-task

[jira] [Created] (FLINK-14027) Add documentation for Python user-defined functions

2019-09-09 Thread Dian Fu (Jira)
Dian Fu created FLINK-14027: --- Summary: Add documentation for Python user-defined functions Key: FLINK-14027 URL: https://issues.apache.org/jira/browse/FLINK-14027 Project: Flink Issue Type: Sub-tas

[jira] [Created] (FLINK-14028) Support logging aggregation in Python user-defined functions

2019-09-09 Thread Dian Fu (Jira)
Dian Fu created FLINK-14028: --- Summary: Support logging aggregation in Python user-defined functions Key: FLINK-14028 URL: https://issues.apache.org/jira/browse/FLINK-14028 Project: Flink Issue Typ

[jira] [Created] (FLINK-14029) Update Flink's Mesos scheduling behavior to reject all expired offers

2019-09-09 Thread Piyush Narang (Jira)
Piyush Narang created FLINK-14029: - Summary: Update Flink's Mesos scheduling behavior to reject all expired offers Key: FLINK-14029 URL: https://issues.apache.org/jira/browse/FLINK-14029 Project: Flin

Re: [ANNOUNCE] Kostas Kloudas joins the Flink PMC

2019-09-09 Thread Biao Liu
Congrats, Kostas! Thanks, Biao /'bɪ.aʊ/ On Mon, 9 Sep 2019 at 16:07, JingsongLee wrote: > Congrats, Kostas! Well deserved. > > Best, > Jingsong Lee > > > -- > From:Kostas Kloudas > Send Time:2019年9月9日(星期一) 15:50 > To:dev ; Yun G

[jira] [Created] (FLINK-14030) Nonequivalent conversion happens in Table planner

2019-09-09 Thread Leonard Xu (Jira)
Leonard Xu created FLINK-14030: -- Summary: Nonequivalent conversion happens in Table planner Key: FLINK-14030 URL: https://issues.apache.org/jira/browse/FLINK-14030 Project: Flink Issue Type: Bu

[jira] [Created] (FLINK-14031) flink-examples should add blink dependency on flink-examples-table

2019-09-09 Thread Jimmy Wong (Jira)
Jimmy Wong created FLINK-14031: -- Summary: flink-examples should add blink dependency on flink-examples-table Key: FLINK-14031 URL: https://issues.apache.org/jira/browse/FLINK-14031 Project: Flink

Re: [VOTE] FLIP-49: Unified Memory Configuration for TaskExecutors

2019-09-09 Thread Stephan Ewen
One thing that I just came across: Some of these options should also have a corresponding value for the JobManager, like JVM overhead, metaspace, direct memory. On Fri, Sep 6, 2019 at 4:34 AM Xintong Song wrote: > Thanks all for the votes. > So far, we have > >- 4 binding +1 votes (Stephan,

[jira] [Created] (FLINK-14032) Make the cache size of RocksDBPriorityQueueSetFactory configurable

2019-09-09 Thread Yun Tang (Jira)
Yun Tang created FLINK-14032: Summary: Make the cache size of RocksDBPriorityQueueSetFactory configurable Key: FLINK-14032 URL: https://issues.apache.org/jira/browse/FLINK-14032 Project: Flink I

[jira] [Created] (FLINK-14033) Distributed caches are not registered in Yarn Per Job Cluster Mode

2019-09-09 Thread Zhenqiu Huang (Jira)
Zhenqiu Huang created FLINK-14033: - Summary: Distributed caches are not registered in Yarn Per Job Cluster Mode Key: FLINK-14033 URL: https://issues.apache.org/jira/browse/FLINK-14033 Project: Flink

[jira] [Created] (FLINK-14034) In FlinkKafkaProducer, KafkaTransactionState should be made public or invoke should be made final

2019-09-09 Thread Niels van Kaam (Jira)
Niels van Kaam created FLINK-14034: -- Summary: In FlinkKafkaProducer, KafkaTransactionState should be made public or invoke should be made final Key: FLINK-14034 URL: https://issues.apache.org/jira/browse/FLINK-14

[DISCUSS] modular built-in functions

2019-09-09 Thread Bowen Li
Hi all, During the discussion of how to support Hive built-in functions in Flink in FLIP-57 [1], an idea of "modular built-in functions" was brought up with examples of "Extension" in Postgres [2] and "Plugin" in Presto [3]. Thus I'd like to kick off a discussion to see if we should adopt such an

[jira] [Created] (FLINK-14035) Introduce/Change some log for snapshot to better analysis checkpoint problem

2019-09-09 Thread Congxian Qiu(klion26) (Jira)
Congxian Qiu(klion26) created FLINK-14035: - Summary: Introduce/Change some log for snapshot to better analysis checkpoint problem Key: FLINK-14035 URL: https://issues.apache.org/jira/browse/FLINK-14035

Re: [DISCUSS] Contribute Pulsar Flink connector back to Flink

2019-09-09 Thread Becket Qin
Hi Sijie, If we agree that the goal is to have Pulsar connector in 1.10, how about we do the following: 0. Start a FLIP to add Pulsar connector to Flink main repo as it is a new public interface to Flink main repo. 1. Start to review the Pulsar sink right away as there is no change to the sink in

[jira] [Created] (FLINK-14036) function log(f0,f1) in Table API do not support decimal type

2019-09-09 Thread Leonard Xu (Jira)
Leonard Xu created FLINK-14036: -- Summary: function log(f0,f1) in Table API do not support decimal type Key: FLINK-14036 URL: https://issues.apache.org/jira/browse/FLINK-14036 Project: Flink

[jira] [Created] (FLINK-14037) Deserializing the input/output formats failed: unread block data

2019-09-09 Thread liupengcheng (Jira)
liupengcheng created FLINK-14037: Summary: Deserializing the input/output formats failed: unread block data Key: FLINK-14037 URL: https://issues.apache.org/jira/browse/FLINK-14037 Project: Flink

[jira] [Created] (FLINK-14038) ExecutionGraph deploy failed due to akka timeout

2019-09-09 Thread liupengcheng (Jira)
liupengcheng created FLINK-14038: Summary: ExecutionGraph deploy failed due to akka timeout Key: FLINK-14038 URL: https://issues.apache.org/jira/browse/FLINK-14038 Project: Flink Issue Type:

Re: [DISCUSS] Support notifyOnMaster for notifyCheckpointComplete

2019-09-09 Thread Dian Fu
Hi Jingsong, Good point! 1. If it doesn't matter which task performs the finalize work, then I think task-0 suggested by Jark is a very good solution. 2. If it requires the last finished task to perform the finalize work, then we have to consider other solutions. WRT fault-tolerant of Stream

Re: [VOTE] Release 1.8.2, release candidate #1

2019-09-09 Thread Kurt Young
+1 (binding) - build from source and passed all tests locally - checked the difference between 1.8.1 and 1.8.2, no legal risk found - went through all commits checked in between 1.8.1 and 1.8.2, make sure all the issues set the proper "fixVersion" property Best, Kurt On Mon, Sep 9, 2019 at 8:45

Re: [DISCUSS] FLIP-63: Rework table partition support

2019-09-09 Thread Biao Liu
Hi Jingsong, Thank you for bringing this discussion. Since I don't have much experience of Flink table/SQL, I'll ask some questions from runtime or engine perspective. > ... where we describe how to partition support in flink and how to integrate to hive partition. FLIP-27 [1] introduces "partit