[jira] [Created] (FLINK-22897) FlinkSQL1.12 Sink to Hive with diffrent parallelism will due to produce many small files

2021-06-07 Thread zhengjiewen (Jira)
zhengjiewen created FLINK-22897: --- Summary: FlinkSQL1.12 Sink to Hive with diffrent parallelism will due to produce many small files Key: FLINK-22897 URL: https://issues.apache.org/jira/browse/FLINK-22897

[jira] [Created] (FLINK-22898) HiveParallelismInference limit return wrong parallelism

2021-06-07 Thread Jingsong Lee (Jira)
Jingsong Lee created FLINK-22898: Summary: HiveParallelismInference limit return wrong parallelism Key: FLINK-22898 URL: https://issues.apache.org/jira/browse/FLINK-22898 Project: Flink Issue

[jira] [Created] (FLINK-22899) ValuesUpsertSinkFunction needs to use global upsert

2021-06-07 Thread Jingsong Lee (Jira)
Jingsong Lee created FLINK-22899: Summary: ValuesUpsertSinkFunction needs to use global upsert Key: FLINK-22899 URL: https://issues.apache.org/jira/browse/FLINK-22899 Project: Flink Issue Typ

Re: [DISCUSS] Support specifying custom transactional.id prefix in FlinkKafkaProducer

2021-06-07 Thread Till Rohrmann
Thanks for starting this discussion Wenhao. I've given you permission to create a FLIP. Cheers, Till On Sat, Jun 5, 2021 at 9:48 AM Wenhao Ji wrote: > Hi everyone, > > Currently, the "transactional.id"s of the Kafka producers in > FlinkKafkaProducer are generated based on the task name. This me

[jira] [Created] (FLINK-22900) flink 1.11.2 fileSystem source table read fileSystem sink table path multi-partition error

2021-06-07 Thread bigdataf (Jira)
bigdataf created FLINK-22900: Summary: flink 1.11.2 fileSystem source table read fileSystem sink table path multi-partition error Key: FLINK-22900 URL: https://issues.apache.org/jira/browse/FLINK-22900 P

Re: [DISCUSS] Watermark propagation with Sink API

2021-06-07 Thread Arvid Heise
Hi Eron, you either have very specific use cases in mind or have a misconception about idleness in Flink with the new sources. The basic idea is that you have watermark generators only at the sources and the user supplies them. As a source author, you have no option to limit that. Here a bit of ba

Re: [DISCUSS] Feedback Collection Jira Bot

2021-06-07 Thread Arvid Heise
One more idea for the bot. Could we have a label to exclude certain tickets from the life-cycle? I'm thinking about long-term tickets such as improving DataStream to eventually replace DataSet. We would collect ideas over the next couple of weeks without any visible progress on the implementation.

Re: Add control mode for flink

2021-06-07 Thread Jark Wu
Thanks Xintong for the summary, I'm big +1 for this feature. Xintong's summary for Table/SQL's needs is correct. The "custom (broadcast) event" feature is important to us and even blocks further awesome features and optimizations in Table/SQL. I also discussed offline with @Yun Gao several times

[jira] [Created] (FLINK-22901) Introduce getChangeLogUpsertKeys in FlinkRelMetadataQuery

2021-06-07 Thread Jingsong Lee (Jira)
Jingsong Lee created FLINK-22901: Summary: Introduce getChangeLogUpsertKeys in FlinkRelMetadataQuery Key: FLINK-22901 URL: https://issues.apache.org/jira/browse/FLINK-22901 Project: Flink Iss

[jira] [Created] (FLINK-22902) Port KafkaSink to FLIP-143

2021-06-07 Thread Arvid Heise (Jira)
Arvid Heise created FLINK-22902: --- Summary: Port KafkaSink to FLIP-143 Key: FLINK-22902 URL: https://issues.apache.org/jira/browse/FLINK-22902 Project: Flink Issue Type: Improvement Co

Re: recover from svaepoint

2021-06-07 Thread Piotr Nowojski
Hi, Thanks Tianxin and 周瑞' for reporting and tracking down the problem. Indeed that could be the reason behind it. Have either of you already created a JIRA ticket for this bug? > Concerning the required changing of the UID of an operator Piotr, is this a known issue and documented somewhere? I f

[jira] [Created] (FLINK-22903) Code of method xxx of class "StreamExecCalc$1248" grows beyond 64 KB

2021-06-07 Thread smith jayden (Jira)
smith jayden created FLINK-22903: Summary: Code of method xxx of class "StreamExecCalc$1248" grows beyond 64 KB Key: FLINK-22903 URL: https://issues.apache.org/jira/browse/FLINK-22903 Project: Flink

[jira] [Created] (FLINK-22904) Performance regression on 25.05.2020 in mapRebalanceMapSink

2021-06-07 Thread Piotr Nowojski (Jira)
Piotr Nowojski created FLINK-22904: -- Summary: Performance regression on 25.05.2020 in mapRebalanceMapSink Key: FLINK-22904 URL: https://issues.apache.org/jira/browse/FLINK-22904 Project: Flink

[jira] [Created] (FLINK-22905) Versioned Table's SQL Script was missing a "," at Line 7 which yields Could not execute SQL statement ERROR

2021-06-07 Thread liuyan (Jira)
liuyan created FLINK-22905: -- Summary: Versioned Table's SQL Script was missing a "," at Line 7 which yields Could not execute SQL statement ERROR Key: FLINK-22905 URL: https://issues.apache.org/jira/browse/FLINK-22905

[jira] [Created] (FLINK-22906) Add build time to Flink documentation

2021-06-07 Thread Seth Wiesman (Jira)
Seth Wiesman created FLINK-22906: Summary: Add build time to Flink documentation Key: FLINK-22906 URL: https://issues.apache.org/jira/browse/FLINK-22906 Project: Flink Issue Type: Improvement

Re: [DISCUSS]FLIP-150: Introduce Hybrid Source

2021-06-07 Thread Arvid Heise
Sorry for joining the party so late, but it's such an interesting FLIP with a huge impact that I wanted to add my 2 cents. [1] I'm mirroring some basic question from the PR review to this thread because it's about the name: We could rename the thing to ConcatenatedSource(s), SourceSequence, or sim

Re: [DISCUSS] Watermark propagation with Sink API

2021-06-07 Thread Eron Wright
Piotr, David, and Arvid, we've had an expansive discussion but ultimately the proposal is narrow. It is: 1. When a watermark arrives at the sink operator, tell the sink function. 2. When the sink operator is idled, tell the sink function. With these enhancements, we will significantly improve cor

[jira] [Created] (FLINK-22907) SQL Client queries fails on select statement

2021-06-07 Thread Ryan Darling (Jira)
Ryan Darling created FLINK-22907: Summary: SQL Client queries fails on select statement Key: FLINK-22907 URL: https://issues.apache.org/jira/browse/FLINK-22907 Project: Flink Issue Type: Bug

[DISCUSS] Limit size of already processed files in File Source SplitEnumerator

2021-06-07 Thread Tianxin Zhao
Hi! Currently Flink File Source relies on a Set pathsAlreadyProcessed in SplitEnumerator to decide which file has been processed and avoids reprocessing files if a file is already in this set. However this set could be ever growing and ultimately exceed memory size if there are new files continuou

Re: Add control mode for flink

2021-06-07 Thread 刘建刚
Thanks Xintong Song for the detailed supplement. Since flink is long-running, it is similar to many services. So interacting with it or controlling it is a common desire. This was our initial thought when implementing the feature. In our inner flink, many configs used in yaml can be adjusted by dyn

[jira] [Created] (FLINK-22908) FileExecutionGraphInfoStoreTest.testPutSuspendedJobOnClusterShutdown fails on azure

2021-06-07 Thread Xintong Song (Jira)
Xintong Song created FLINK-22908: Summary: FileExecutionGraphInfoStoreTest.testPutSuspendedJobOnClusterShutdown fails on azure Key: FLINK-22908 URL: https://issues.apache.org/jira/browse/FLINK-22908

Re: [DISCUSS] FLIP-169: DataStream API for Fine-Grained Resource Requirements

2021-06-07 Thread wenlong.lwl
Thanks Yangze for the flip, it is great for users to be able to declare the fine-grained resource requirements for the job. I have one minor suggestion: can we support setting resource requirements by configuration? Currently most of the config options in execution config can be configured by conf

[jira] [Created] (FLINK-22909) Supports change log inputs for event time operators

2021-06-07 Thread Jingsong Lee (Jira)
Jingsong Lee created FLINK-22909: Summary: Supports change log inputs for event time operators Key: FLINK-22909 URL: https://issues.apache.org/jira/browse/FLINK-22909 Project: Flink Issue Typ

Re: Add control mode for flink

2021-06-07 Thread JING ZHANG
Thanks Jiangang for bringing this up. As mentioned in Jiangang's email, `dynamic configuration framework` provides many useful functions in Kuaishou, because it could update job behavior without relaunching the job. The functions are very popular in Kuaishou, we also see similar demands in maillist

Apply for permission to edit wiki

2021-06-07 Thread Senhong Liu
Hello, My username is Senhong Liu (senhong...@gmail.com) and I want to apply for permission to propose a FLIP. Anyone who can help me? THX! Best, Senhong

Re: Add control mode for flink

2021-06-07 Thread kai wang
I'm big +1 for this feature. 1. Limit the input qps. 2. Change log level for debug. in my team, the two examples above are needed JING ZHANG 于2021年6月8日周二 上午11:18写道: > Thanks Jiangang for bringing this up. > As mentioned in Jiangang's email, `dynamic configuration framework` > provides ma

[jira] [Created] (FLINK-22910) ShuffleMaster enhancement for pluggable shuffle service framework

2021-06-07 Thread Yingjie Cao (Jira)
Yingjie Cao created FLINK-22910: --- Summary: ShuffleMaster enhancement for pluggable shuffle service framework Key: FLINK-22910 URL: https://issues.apache.org/jira/browse/FLINK-22910 Project: Flink

Re: [DISCUSS] FLIP-169: DataStream API for Fine-Grained Resource Requirements

2021-06-07 Thread Yangze Guo
Thanks for the feedbacks, Xintong and Wenlong! @Wenlong I think that is a good idea, adjust the resource without re-compiling the job will facilitate the tuning process. We can define a pattern "slot-sharing-group.resource.{ssg name}" (welcome any proposal for the prefix naming) for the resource s

Re: [DISCUSS] FLIP-169: DataStream API for Fine-Grained Resource Requirements

2021-06-07 Thread Yangze Guo
@Wenlong After another consideration, the config option approach I mentioned above might not be appropriate. The resource requirements for SSG should be a job level configuration and should no be set in the flink-conf. I think we can define a JSON format, which would be the ResourceSpecs mapped by

Re: Re: Add control mode for flink

2021-06-07 Thread Yun Gao
Very thanks Jiangang for bringing this up and very thanks for the discussion! I also agree with the summarization by Xintong and Jing that control flow seems to be a common buidling block for many functionalities and dynamic configuration framework is a representative application that frequentl

Re: [DISCUSS] FLIP-169: DataStream API for Fine-Grained Resource Requirements

2021-06-07 Thread Xintong Song
I think being able to specify fine grained resource requirements without having to change the codes and recompile the job is indeed a good idea. It definitely improves the usability. However, this requires more careful designs, which probably deserves a separate thread. I'd be good to have that di

[jira] [Created] (FLINK-22911) Align FLIP-136 (Improve interoperability between DataStream and Table API) in PyFlink Table API

2021-06-07 Thread Dian Fu (Jira)
Dian Fu created FLINK-22911: --- Summary: Align FLIP-136 (Improve interoperability between DataStream and Table API) in PyFlink Table API Key: FLINK-22911 URL: https://issues.apache.org/jira/browse/FLINK-22911

[jira] [Created] (FLINK-22912) Support state ttl in Python DataStream API

2021-06-07 Thread Dian Fu (Jira)
Dian Fu created FLINK-22912: --- Summary: Support state ttl in Python DataStream API Key: FLINK-22912 URL: https://issues.apache.org/jira/browse/FLINK-22912 Project: Flink Issue Type: Improvement

[jira] [Created] (FLINK-22913) Support Python UDF chaining in Python DataStream API

2021-06-07 Thread Dian Fu (Jira)
Dian Fu created FLINK-22913: --- Summary: Support Python UDF chaining in Python DataStream API Key: FLINK-22913 URL: https://issues.apache.org/jira/browse/FLINK-22913 Project: Flink Issue Type: Improv

Re: Re: Add control mode for flink

2021-06-07 Thread Xintong Song
+1 on separating the effort into two steps: 1. Introduce a common control flow framework, with flexible interfaces for generating / reacting to control messages for various purposes. 2. Features that leverating the control flow can be worked on concurrently Meantime, keeping collectin

Re: Re: Add control mode for flink

2021-06-07 Thread Steven Wu
I can see the benefits of control flow. E.g., it might help the old (and inactive) FLIP-17 side input. I would suggest that we add more details of some of the potential use cases. Here is one mismatch with using control flow for dynamic config. Dynamic config is typically targeted/loaded by one sp

Re: [DISCUSS]FLIP-150: Introduce Hybrid Source

2021-06-07 Thread Steven Wu
> hybrid sounds to me more like the source would constantly switch back and forth Initially, the focus of hybrid source is more like a sequenced chain. But in the future it would be cool that hybrid sources can intelligently switch back and forth between historical data source (like Iceberg) and

[jira] [Created] (FLINK-22914) Use Kafka New Source in Table/SQL connector

2021-06-07 Thread Qingsheng Ren (Jira)
Qingsheng Ren created FLINK-22914: - Summary: Use Kafka New Source in Table/SQL connector Key: FLINK-22914 URL: https://issues.apache.org/jira/browse/FLINK-22914 Project: Flink Issue Type: New

[jira] [Created] (FLINK-22915) Extend Flink ML API to support Estimator/Transformer DAG

2021-06-07 Thread Dong Lin (Jira)
Dong Lin created FLINK-22915: Summary: Extend Flink ML API to support Estimator/Transformer DAG Key: FLINK-22915 URL: https://issues.apache.org/jira/browse/FLINK-22915 Project: Flink Issue Type: