Re: How to handle Flink Job with 400MB+ Uberjar with 800+ containers ?

2019-08-30 Thread SHI Xiaogang
Hi Dadashov, You may have a look at method YarnResourceManager#onContainersAllocated which will launch containers (via NMClient#startContainer) after containers are allocated. The launching is performed in the main thread of YarnResourceManager and the launching is synchronous/blocking. Consequent

Re: [DISCUSS] Flink client api enhancement for downstream project

2019-08-30 Thread Zili Chen
Great Kostas! Looking forward to your POC! Best, tison. Jeff Zhang 于2019年8月30日周五 下午11:07写道: > Awesome, @Kostas Looking forward your POC. > > Kostas Kloudas 于2019年8月30日周五 下午8:33写道: > > > Hi all, > > > > I am just writing here to let you know that I am working on a POC that > > tries to refacto

Re: [SURVEY] Is the default restart delay of 0s causing problems?

2019-08-30 Thread Zhu Zhu
In our production, we usually override the restart delay to be 10 s. We once encountered cases that external services are overwhelmed by reconnections from frequent restarted tasks. As a safer though not optimized option, a default delay larger than 0 s is better in my opinion. 未来阳光 <2217232...@q

[ANNOUNCE] Kinesis connector becomes part of Flink releases

2019-08-30 Thread Bowen Li
Hi all, I'm glad to announce that, as #9494 was merged today, flink-connector-kinesis is officially of Apache 2.0 license now in master branch and its artifact will be deployed to Maven central as part of Flink releases starting from Flink 1.10.0. Users c

[jira] [Created] (FLINK-13932) PyTest ExecutionConfigTests.test_equals_and_hash fail

2019-08-30 Thread TisonKun (Jira)
TisonKun created FLINK-13932: Summary: PyTest ExecutionConfigTests.test_equals_and_hash fail Key: FLINK-13932 URL: https://issues.apache.org/jira/browse/FLINK-13932 Project: Flink Issue Type: Bug

Re: [DISCUSS] FLIP-54: Evolve ConfigOption and Configuration

2019-08-30 Thread Becket Qin
Hi Timo, Thanks for the reply. I am still a little concerned over the mutability of the Configurable which could be the value in Configuration. Re: 1 > But in general, people should not use any internal fields. > Configurable objects are meant for simple little helper POJOs, not > complex arbitr

Re: [ANNOUNCE] Apache Flink-shaded 8.0 released

2019-08-30 Thread Hequn Cheng
Thanks a lot to Chesney! Also thanks to everyone who helped to make this release possible. Best, Hequn On Fri, Aug 30, 2019 at 7:17 PM jincheng sun wrote: > Thanks a lot Chesnay and to the community for making this release possible > ! > > Cheers, > Jincheng > > Chesnay Schepler 于2019年8月30日周五

[jira] [Created] (FLINK-13931) Support Hive version 2.0.x

2019-08-30 Thread Xuefu Zhang (Jira)
Xuefu Zhang created FLINK-13931: --- Summary: Support Hive version 2.0.x Key: FLINK-13931 URL: https://issues.apache.org/jira/browse/FLINK-13931 Project: Flink Issue Type: Improvement Co

Re: Potential block size issue with S3 binary files

2019-08-30 Thread Ken Krugler
Hi Stephan (switching to dev list), > On Aug 29, 2019, at 2:52 AM, Stephan Ewen wrote: > > That is a good point. > > Which way would you suggest to go? Not relying on the FS block size at all, > but using a fix (configurable) block size? There’s value to not requiring a fixed block size, as t

[ANNOUNCEMENT] September 2019 Bay Area Apache Flink Meetup

2019-08-30 Thread Xuefu Zhang
Hi all, As promised, we planned to have quarterly Flink meetup and now it's about the time. Thus, I'm happy to announce that the next Bay Area Apache Flink Meetup [1] is scheduled on Sept. 24 at Yelp, 140 New Montgomery in San Francisco. Schedule: 6:30 - Door open 6:30 - 7:00 PM Networking and R

Re: How to handle Flink Job with 400MB+ Uberjar with 800+ containers ?

2019-08-30 Thread Elkhan Dadashov
Thanks everyone for valuable input and sharing your experience for tackling the issue. Regarding suggestions : - We provision some common jars in all cluster nodes *-->* but this requires dependence on Infra Team schedule for handling common jars/updating - Making Uberjar slimmer *-->* tried e

[jira] [Created] (FLINK-13930) Support Hive version 3.1.x

2019-08-30 Thread Xuefu Zhang (Jira)
Xuefu Zhang created FLINK-13930: --- Summary: Support Hive version 3.1.x Key: FLINK-13930 URL: https://issues.apache.org/jira/browse/FLINK-13930 Project: Flink Issue Type: Improvement Co

Re: [DISCUSS] Releasing Flink 1.8.2

2019-08-30 Thread Till Rohrmann
+1 for a 1.8.2 bug fix release. Thanks for kicking this discussion off Jincheng. Cheers, Till On Fri, Aug 30, 2019 at 6:45 PM Jark Wu wrote: > Thanks Jincheng for bringing this up. > > +1 to the 1.8.2 release, because it already contains a couple of important > fixes and it has been a long time

[jira] [Created] (FLINK-13929) Revisit REST & JM URL

2019-08-30 Thread TisonKun (Jira)
TisonKun created FLINK-13929: Summary: Revisit REST & JM URL Key: FLINK-13929 URL: https://issues.apache.org/jira/browse/FLINK-13929 Project: Flink Issue Type: Improvement Components:

Re: [DISCUSS] Releasing Flink 1.8.2

2019-08-30 Thread Jark Wu
Thanks Jincheng for bringing this up. +1 to the 1.8.2 release, because it already contains a couple of important fixes and it has been a long time since 1.8.1 came out. I'm willing to help the community as much as possible. I'm wondering if I can be the release manager of 1.8.2 or work with you to

[jira] [Created] (FLINK-13928) Make windows api more extendable

2019-08-30 Thread zhihao zhang (Jira)
zhihao zhang created FLINK-13928: Summary: Make windows api more extendable Key: FLINK-13928 URL: https://issues.apache.org/jira/browse/FLINK-13928 Project: Flink Issue Type: Improvement

[jira] [Created] (FLINK-13927) Add note about hadoop dependencies for local debug

2019-08-30 Thread Andrey Zagrebin (Jira)
Andrey Zagrebin created FLINK-13927: --- Summary: Add note about hadoop dependencies for local debug Key: FLINK-13927 URL: https://issues.apache.org/jira/browse/FLINK-13927 Project: Flink Issu

[jira] [Created] (FLINK-13926) `ProcessingTimeSessionWindows` and `EventTimeSessionWindows` should be generic

2019-08-30 Thread zhihao zhang (Jira)
zhihao zhang created FLINK-13926: Summary: `ProcessingTimeSessionWindows` and `EventTimeSessionWindows` should be generic Key: FLINK-13926 URL: https://issues.apache.org/jira/browse/FLINK-13926 Proje

[jira] [Created] (FLINK-13925) ClassLoader in BlobLibraryCacheManager is not using context class loader

2019-08-30 Thread Jira
Jan Lukavský created FLINK-13925: Summary: ClassLoader in BlobLibraryCacheManager is not using context class loader Key: FLINK-13925 URL: https://issues.apache.org/jira/browse/FLINK-13925 Project: Fli

Re: [DISCUSS] Flink client api enhancement for downstream project

2019-08-30 Thread Jeff Zhang
Awesome, @Kostas Looking forward your POC. Kostas Kloudas 于2019年8月30日周五 下午8:33写道: > Hi all, > > I am just writing here to let you know that I am working on a POC that > tries to refactor the current state of job submission in Flink. > I want to stress out that it introduces NO CHANGES to the cur

[jira] [Created] (FLINK-13924) Add summarizer and summary for sparse vector and dense vector.

2019-08-30 Thread Xu Yang (Jira)
Xu Yang created FLINK-13924: --- Summary: Add summarizer and summary for sparse vector and dense vector. Key: FLINK-13924 URL: https://issues.apache.org/jira/browse/FLINK-13924 Project: Flink Issue T

[jira] [Created] (FLINK-13923) Add summarizer and summary for table.

2019-08-30 Thread Xu Yang (Jira)
Xu Yang created FLINK-13923: --- Summary: Add summarizer and summary for table. Key: FLINK-13923 URL: https://issues.apache.org/jira/browse/FLINK-13923 Project: Flink Issue Type: Sub-task Co

Re: State of FLIPs

2019-08-30 Thread Dian Fu
Hi Chesnay, Thanks a lot for the remind. FLIP-38 has been released in 1.9 and I have updated the status in the wiki page. Regards, Dian On Fri, Aug 30, 2019 at 9:38 PM Becket Qin wrote: > Hi Chesnay, > > You are right. FLIP-36 actually has not passed the vote yet. In fact some > of the key des

Re: ClassLoader created by BlobLibraryCacheManager is not using context classloader

2019-08-30 Thread Till Rohrmann
Hi Jan, this looks to me like a bug for which you could create a JIRA and PR to fix it. Just to make sure, I've pulled in Aljoscha who is the author of this change to check with him whether we are forgetting something. Cheers, Till On Fri, Aug 30, 2019 at 3:44 PM Jan Lukavský wrote: > Hi, > >

??????[SURVEY] Is the default restart delay of 0s causing problems?

2019-08-30 Thread ????????
Hi, I thinks it's better to increase the default value. +1 Best. -- -- ??: "Till Rohrmann"; : 2019??8??30??(??) 10:07 ??: "dev"; "user"; : [SURVEY] Is the default restart delay of 0s causing problems? Hi everyone, I

Re: [DISCUSS] Simplify Flink's cluster level RestartStrategy configuration

2019-08-30 Thread Till Rohrmann
After an offline discussion with Stephan, we concluded that changing the default restart strategy for batch jobs is not that easy because the cluster level restart configuration does not necessarily know about the type of job which is submitted. We concluded that we would like to keep the batch beh

[SURVEY] Is the default restart delay of 0s causing problems?

2019-08-30 Thread Till Rohrmann
Hi everyone, I wanted to reach out to you and ask whether decreasing the default delay to `0 s` for the fixed delay restart strategy [1] is causing trouble. A user reported that he would like to increase the default value because it can cause restart storms in case of systematic faults [2]. The d

Re: [DISCUSS] FLIP-54: Evolve ConfigOption and Configuration

2019-08-30 Thread Timo Walther
Hi Becket, 1. First of all, you are totally right. The FLIP contains a bug due to the last minute changes that Dawid suggested: by having immutable objects created by a factory we loose the serializability of the Configuration because the factory itself is not stored in the Configuration. I w

ClassLoader created by BlobLibraryCacheManager is not using context classloader

2019-08-30 Thread Jan Lukavský
Hi, I have come across an issue with classloading in Flink's MiniCluster. The issue is that when I run local flink job from a thread, that has a non-default context classloader (for whatever reason), this classloader is not taken into account when classloading user defined functions. This is

Re: State of FLIPs

2019-08-30 Thread Becket Qin
Hi Chesnay, You are right. FLIP-36 actually has not passed the vote yet. In fact some of the key designs may have to change due to the later code changes. I'll update the wiki and start a new vote. Thanks, Jiangjie (Becket) Qin On Fri, Aug 30, 2019 at 8:44 PM Chesnay Schepler wrote: > The fol

Re: [DISCUSS] FLIP-54: Evolve ConfigOption and Configuration

2019-08-30 Thread Becket Qin
Hi Timo, Thanks again for the clarification. Please see a few more questions below. Re: 1 > Please also keep in mind that Configuration must not consist of only > strings, it manages a Map for efficient access. Every > map entry can have a string representation for persistence, but in most > cas

Re: [DISCUSS] Simplify Flink's cluster level RestartStrategy configuration

2019-08-30 Thread Till Rohrmann
The current default behaviour for batch is `NoRestartStrategy` if nothing is configured. We could say that we set the default value of `restart-strategy` to `FixedDelayRestartStrategy(Integer.MAX_VALUE, "0 s")` independent of the checkpointing. The only downside I could see is that some faulty batc

Re: [DISCUSS] Simplify Flink's cluster level RestartStrategy configuration

2019-08-30 Thread Dawid Wysakowicz
Also +1 in general. I have a few questions though: - does it only apply to the logic in org.apache.flink.runtime.executiongraph.restart.RestartStrategyFactory#createRestartStrategyFactory, which is only the cluster side configuration? Or do you want to change the logic also on the job side in Exe

Re: [DISCUSS] FLIP-59: Enable execution configuration from Configuration object

2019-08-30 Thread Gyula Fóra
Hi Dawid, Sorry I misread one of the interfaces a little (Configuration instead of ConfigurationReader), you are right. I was referring to: - void StreamExecutionEnvironment.configure(ConfigurationReader) This might be slightly orthogonal to the changes that you made here but what I mea

Re: instable checkpointing after migration to flink 1.8

2019-08-30 Thread Stephan Ewen
Hi all! A thought would be that this has something to do with timers. Does the task with that behavior use timers (windows, or process function)? If that is the case, some theories to check: - Could it be a timer firing storm coinciding with a checkpoint? Currently, that storm synchronously fir

[jira] [Created] (FLINK-13922) Support anchors for restart strategies in Chinese documentation

2019-08-30 Thread Till Rohrmann (Jira)
Till Rohrmann created FLINK-13922: - Summary: Support anchors for restart strategies in Chinese documentation Key: FLINK-13922 URL: https://issues.apache.org/jira/browse/FLINK-13922 Project: Flink

Re: [DISCUSS] Simplify Flink's cluster level RestartStrategy configuration

2019-08-30 Thread Stephan Ewen
+1 in general What is the default in batch, though? No restarts? I always found that somewhat uncommon. Should we also change that part, if we are changing the default anyways? On Fri, Aug 30, 2019 at 2:35 PM Till Rohrmann wrote: > Hi everyone, > > I wanted to discuss how to simplify Flink's c

Re: instable checkpointing after migration to flink 1.8

2019-08-30 Thread Congxian Qiu
CC flink dev mail list Update for those who may be interested in this issue, we'are still diagnosing this problem currently. Best, Congxian Congxian Qiu 于2019年8月29日周四 下午8:58写道: > Hi Bekir > > Currently, from what we have diagnosed, there is some task complete its > checkpoint too late (maybe 1

State of FLIPs

2019-08-30 Thread Chesnay Schepler
The following FLIPs are marked as "Under discussion" in the wiki , but actually seem to be in progress (i.e. have open pull requests) and some even  have code merged to master: * FLIP-36 (Interactive Programming)

[DISCUSS] Simplify Flink's cluster level RestartStrategy configuration

2019-08-30 Thread Till Rohrmann
Hi everyone, I wanted to discuss how to simplify Flink's cluster level RestartStrategy configuration [1]. Currently, Flink's behaviour with respect to configuring the {{RestartStrategies}} is quite complicated and convoluted. The reason for this is that we evolved the way it has been configured an

Re: [DISCUSS] Flink client api enhancement for downstream project

2019-08-30 Thread Kostas Kloudas
Hi all, I am just writing here to let you know that I am working on a POC that tries to refactor the current state of job submission in Flink. I want to stress out that it introduces NO CHANGES to the current behaviour of Flink. It just re-arranges things and introduces the notion of an Executor,

[jira] [Created] (FLINK-13921) Simplify cluster level RestartStrategy configuration

2019-08-30 Thread Till Rohrmann (Jira)
Till Rohrmann created FLINK-13921: - Summary: Simplify cluster level RestartStrategy configuration Key: FLINK-13921 URL: https://issues.apache.org/jira/browse/FLINK-13921 Project: Flink Issue

Re: [DISCUSS] FLIP-60: Restructure the Table API & SQL documentation

2019-08-30 Thread Dawid Wysakowicz
+1 to the idea of restructuring the docs. My only suggestion to consider is how about moving the User-Defined-Extensions subpages to corresponding broader topics? Sources & Sinks >> Connect to external systems Catalogs >> Connect to external systems and then have a Functions sections with subse

Re: [VOTE] FLIP-54: Evolve ConfigOption and Configuration

2019-08-30 Thread Dawid Wysakowicz
+1 to the design On 29/08/2019 15:53, Timo Walther wrote: > I converted the mentioned Google doc into a wiki page: > > https://cwiki.apache.org/confluence/display/FLINK/FLIP-54%3A+Evolve+ConfigOption+and+Configuration > > > The core semantics have not changed. > > Happy voting, > Timo > > On 29.08

Re: [ANNOUNCE] Apache Flink-shaded 8.0 released

2019-08-30 Thread jincheng sun
Thanks a lot Chesnay and to the community for making this release possible ! Cheers, Jincheng Chesnay Schepler 于2019年8月30日周五 下午6:56写道: > The Apache Flink community is very happy to announce the release of > Apache Flink-shaded 8.0. > > The flink-shaded project contains a number of shaded depend

[jira] [Created] (FLINK-13920) Move list of old releases into _config.yml

2019-08-30 Thread Chesnay Schepler (Jira)
Chesnay Schepler created FLINK-13920: Summary: Move list of old releases into _config.yml Key: FLINK-13920 URL: https://issues.apache.org/jira/browse/FLINK-13920 Project: Flink Issue Type

[jira] [Created] (FLINK-13919) Remove 1.6.4 from downloads page and SVN

2019-08-30 Thread Chesnay Schepler (Jira)
Chesnay Schepler created FLINK-13919: Summary: Remove 1.6.4 from downloads page and SVN Key: FLINK-13919 URL: https://issues.apache.org/jira/browse/FLINK-13919 Project: Flink Issue Type:

Re: [DISCUSS] Releasing Flink 1.8.2

2019-08-30 Thread Hequn Cheng
Hi Jincheng, +1 for a 1.8.2 release. Thanks a lot for raising the discussion. It would be nice to have these critical fixes. Best, Hequn On Fri, Aug 30, 2019 at 6:31 PM Maximilian Michels wrote: > Hi Jincheng, > > +1 I would be for a 1.8.2 release such that we can fix the problems with > the

[ANNOUNCE] Apache Flink-shaded 8.0 released

2019-08-30 Thread Chesnay Schepler
The Apache Flink community is very happy to announce the release of Apache Flink-shaded 8.0. The flink-shaded project contains a number of shaded dependencies for Apache Flink. Apache Flink® is an open-source stream processing framework for distributed, high-performing, always-available, and

Re: [DISCUSS] Releasing Flink 1.8.2

2019-08-30 Thread Dian Fu
Hi Jincheng, Thanks a lot for bring up this discussion. +1 for this release. Regards, Dian > 在 2019年8月30日,下午6:31,Maximilian Michels 写道: > > Hi Jincheng, > > +1 I would be for a 1.8.2 release such that we can fix the problems with the > nested closure cleaner which currently block 1.8.1 users

Re: [DISCUSS] Releasing Flink 1.8.2

2019-08-30 Thread Maximilian Michels
Hi Jincheng, +1 I would be for a 1.8.2 release such that we can fix the problems with the nested closure cleaner which currently block 1.8.1 users with Beam: https://issues.apache.org/jira/browse/FLINK-13367 Thanks, Max On 30.08.19 11:25, jincheng sun wrote: Hi Flink devs, It has been near

[jira] [Created] (FLINK-13918) Add class for TokenizerMapper.

2019-08-30 Thread Xu Yang (Jira)
Xu Yang created FLINK-13918: --- Summary: Add class for TokenizerMapper. Key: FLINK-13918 URL: https://issues.apache.org/jira/browse/FLINK-13918 Project: Flink Issue Type: Sub-task Component

[jira] [Created] (FLINK-13917) Add class for RegexTokenizerMapper.

2019-08-30 Thread Xu Yang (Jira)
Xu Yang created FLINK-13917: --- Summary: Add class for RegexTokenizerMapper. Key: FLINK-13917 URL: https://issues.apache.org/jira/browse/FLINK-13917 Project: Flink Issue Type: Sub-task C

[jira] [Created] (FLINK-13916) Add class for NGramMapper.

2019-08-30 Thread Xu Yang (Jira)
Xu Yang created FLINK-13916: --- Summary: Add class for NGramMapper. Key: FLINK-13916 URL: https://issues.apache.org/jira/browse/FLINK-13916 Project: Flink Issue Type: Sub-task Components

Re: [DISCUSS] FLIP-59: Enable execution configuration from Configuration object

2019-08-30 Thread Dawid Wysakowicz
Hi Gyula, Thank you for the support on those changes. I am not sure if I understood your idea for the "reconfiguration" logic. The configure method on those objects would take ConfigurationReader. So user can provide a thin wrapper around Configuration for e.g. filtering certain logic, changing

[jira] [Created] (FLINK-13915) Add class for DocWordSplitCount.

2019-08-30 Thread Xu Yang (Jira)
Xu Yang created FLINK-13915: --- Summary: Add class for DocWordSplitCount. Key: FLINK-13915 URL: https://issues.apache.org/jira/browse/FLINK-13915 Project: Flink Issue Type: Sub-task Comp

[jira] [Created] (FLINK-13914) Add class for DocHashTFVectorizerMapper.

2019-08-30 Thread Xu Yang (Jira)
Xu Yang created FLINK-13914: --- Summary: Add class for DocHashTFVectorizerMapper. Key: FLINK-13914 URL: https://issues.apache.org/jira/browse/FLINK-13914 Project: Flink Issue Type: Sub-task

[jira] [Created] (FLINK-13913) Add class for NLPConstant.

2019-08-30 Thread Xu Yang (Jira)
Xu Yang created FLINK-13913: --- Summary: Add class for NLPConstant. Key: FLINK-13913 URL: https://issues.apache.org/jira/browse/FLINK-13913 Project: Flink Issue Type: Sub-task Components

[jira] [Created] (FLINK-13912) Remove ClusterClient#getClusterConnectionInfo

2019-08-30 Thread TisonKun (Jira)
TisonKun created FLINK-13912: Summary: Remove ClusterClient#getClusterConnectionInfo Key: FLINK-13912 URL: https://issues.apache.org/jira/browse/FLINK-13912 Project: Flink Issue Type: Improvement

[jira] [Created] (FLINK-13911) Add the interface of ModelDataConverter, and several base classes that implement this interface.

2019-08-30 Thread Xu Yang (Jira)
Xu Yang created FLINK-13911: --- Summary: Add the interface of ModelDataConverter, and several base classes that implement this interface. Key: FLINK-13911 URL: https://issues.apache.org/jira/browse/FLINK-13911

[DISCUSS] Releasing Flink 1.8.2

2019-08-30 Thread jincheng sun
Hi Flink devs, It has been nearly 2 months since the 1.8.1 released. So, what do you think about releasing Flink 1.8.2 soon? We already have some blocker and critical fixes in the release-1.8 branch: [Blocker] - FLINK-13159 java.lang.ClassNotFoundException when restore job - FLINK-10368 'Kerberi

[DISCUSS] FLIP-60: Restructure the Table API & SQL documentation

2019-08-30 Thread Timo Walther
Hi everyone, the Table API & SQL documentation was already in a very good shape in Flink 1.8. However, in the past it was mostly presented as an addition to DataStream API. As the Table and SQL world is growing quickly, stabilizes in its concepts, and is considered as another top-level API an

[jira] [Created] (FLINK-13910) Many serializable classes have no explicit 'serialVersionUID'

2019-08-30 Thread Yun Tang (Jira)
Yun Tang created FLINK-13910: Summary: Many serializable classes have no explicit 'serialVersionUID' Key: FLINK-13910 URL: https://issues.apache.org/jira/browse/FLINK-13910 Project: Flink Issue

Re: [PROPOSAL] Force rebase on master before merge

2019-08-30 Thread Chesnay Schepler
I think this is a non-issue; every committer I know checks beforehand if the build passes. Piotr has provided good arguments for why this approach isn't practical. Additionally, there are simply technical limitations that prevent this from working as expected. a) we cannot attach Travis check

Re: [PROPOSAL] Force rebase on master before merge

2019-08-30 Thread Piotr Nowojski
Hi, Thanks for the proposal. I have similar concerns as Kurt. If we enforced such rule I would be afraid that everybody would be waiting for tests on his PR to complete, racing others committers to be “the first guy that clicks the merge button”, then forcing all of the others to rebase manual

[jira] [Created] (FLINK-13909) LinkElement does not support different anchors required for localization

2019-08-30 Thread Till Rohrmann (Jira)
Till Rohrmann created FLINK-13909: - Summary: LinkElement does not support different anchors required for localization Key: FLINK-13909 URL: https://issues.apache.org/jira/browse/FLINK-13909 Project: F

[jira] [Created] (FLINK-13907) Master documentation cannot be built

2019-08-30 Thread Chesnay Schepler (Jira)
Chesnay Schepler created FLINK-13907: Summary: Master documentation cannot be built Key: FLINK-13907 URL: https://issues.apache.org/jira/browse/FLINK-13907 Project: Flink Issue Type: Bug

[jira] [Created] (FLINK-13908) Broken markdown of "Breaking the lines of too long statements" section

2019-08-30 Thread Dawid Wysakowicz (Jira)
Dawid Wysakowicz created FLINK-13908: Summary: Broken markdown of "Breaking the lines of too long statements" section Key: FLINK-13908 URL: https://issues.apache.org/jira/browse/FLINK-13908 Projec

[DISCUSS] StreamingFile with ParquetBulkWriter bucketing limitations

2019-08-30 Thread Enrico Agnoli
StreamingFile limitations Hi community, I'm working toward the porting of our code from `BucketingSink<>` to `StreamingFileSink`. In this case we use the sink to write AVRO via Parquet and the suggested implementation of the Sink should be something like: ``` val parquetWriterFactory = Parquet

[DISCUSS] Rework Behavior of "within" In CEP Library

2019-08-30 Thread Yufei Liu
Hi all, I‘ve got several troubles when I use library CEP. 1. The funtion "within" in PatternAPI is kind of misleading. I can set within time in each part of pattern, but only the smallest one is functional. Pattern.begin("begin").where(...) .followBy("middle0").where(...).within(Time.second(1)

Re: [DISCUSS] FLIP-54: Evolve ConfigOption and Configuration

2019-08-30 Thread Dawid Wysakowicz
Hi, Ad. 1 The advantage of our approach is that you have the type definition close to the option definition. The only difference is that it enables expressing simple pojos with the primitives like ConfigOption, ConfigOption etc. Otherwise as Timo said you will start having the parsing logic scat

Re: How to handle Flink Job with 400MB+ Uberjar with 800+ containers ?

2019-08-30 Thread Zhu Zhu
One optimization that we take is letting yarn to reuse the flink-dist jar which was localized when running previous jobs. Thanks, Zhu Zhu Jörn Franke 于2019年8月30日周五 下午4:02写道: > Increase replication factor and/or use HDFS cache > https://hadoop.apache.org/docs/r2.4.1/hadoop-project-dist/hadoop-hd

Re: How to handle Flink Job with 400MB+ Uberjar with 800+ containers ?

2019-08-30 Thread Jörn Franke
Increase replication factor and/or use HDFS cache https://hadoop.apache.org/docs/r2.4.1/hadoop-project-dist/hadoop-hdfs/CentralizedCacheManagement.html Try to reduce the size of the Jar, eg the Flink libraries do not need to be included. > Am 30.08.2019 um 01:09 schrieb Elkhan Dadashov : > > De

[jira] [Created] (FLINK-13906) ExecutionConfigTests.test_equals_and_hash failed on Travis

2019-08-30 Thread Till Rohrmann (Jira)
Till Rohrmann created FLINK-13906: - Summary: ExecutionConfigTests.test_equals_and_hash failed on Travis Key: FLINK-13906 URL: https://issues.apache.org/jira/browse/FLINK-13906 Project: Flink

Re: How to handle Flink Job with 400MB+ Uberjar with 800+ containers ?

2019-08-30 Thread Jeff Zhang
I can think of 2 approaches: 1. Allow flink to specify the replication of the submitted uber jar. 2. Allow flink to specify config flink.yarn.lib which is all the flink related jars that are hosted on hdfs. This way users don't need to build and submit a fat uber jar every time. And those flink ja

Re: Using Avro SpecficRecord serialization instead of slower ReflectDatumWriter/GenericDatumWriter

2019-08-30 Thread Till Rohrmann
Hi Roshan, these kind of questions should be posted to Flink's user mailing list. I've cross posted it now. If you are using Flink's latest version and your type extends `SpecificRecord`, then Flink's AvroSerializer should use the `SpecificDatumWriter`. If this is not the case, then this sounds l

Re: [DISCUSS] FLIP-54: Evolve ConfigOption and Configuration

2019-08-30 Thread Timo Walther
Hi Becket, thanks for the discussion. 1. ConfigOptions in their current design are bound to classes. Regarding, the option is "creating some Configurable objects instead of defining the config to create those Configurable"? We just moved this logic to a factory, this factory can then also be

Re: [PROPOSAL] Force rebase on master before merge

2019-08-30 Thread Till Rohrmann
Hi Tison, thanks for starting this discussion. In general, I'm in favour of automations which remove human mistakes out of the equation. Do you know how these status checks work concretely? Will Github reject commits for which there is no passed Travis run? How would hotfix commits being distingu

Re: How to handle Flink Job with 400MB+ Uberjar with 800+ containers ?

2019-08-30 Thread Till Rohrmann
For point 2. there exists already a JIRA issue [1] and a PR. I hope that we can merge it during this release cycle. [1] https://issues.apache.org/jira/browse/FLINK-13184 Cheers, Till On Fri, Aug 30, 2019 at 4:06 AM SHI Xiaogang wrote: > Hi Datashov, > > We faced similar problems in our product

Re: [DISCUSS] FLIP-50: Spill-able Heap Keyed State Backend

2019-08-30 Thread Yu Li
Done. Thanks for the reminder Tison! Best Regards, Yu On Thu, 29 Aug 2019 at 21:03, Zili Chen wrote: > Hi Yu, > > Notice that the wiki is still marked as "*Under Discussion*" state. > > I think you can update it correspondingly. > > Best, > tison. > > > Yu Li 于2019年8月20日周二 下午10:28写道: > > > So