[jira] [Created] (FLINK-14703) Port the Kafka SQL related tests.

2019-11-11 Thread Zheng Hu (Jira)
Zheng Hu created FLINK-14703:


 Summary: Port the Kafka SQL related tests.
 Key: FLINK-14703
 URL: https://issues.apache.org/jira/browse/FLINK-14703
 Project: Flink
  Issue Type: Sub-task
Reporter: Zheng Hu


Port the three BASH  SQL Kafka test cases : 
1. test_sql_client_kafka.sh
2. test_sql_client_kafka010.sh
3. test_sql_client_kafka011.sh

The rough patch of Kafka SQL e2e tests is here: 
https://github.com/openinx/flink/commit/24b7a6e34d1e504bc7a411781b43c996644c0910

Once the framework pull request (https://github.com/apache/flink/pull/10106) 
get merged,  will create the PR.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (FLINK-14704) Remove uesless SynchronousSavepointLatch

2019-11-11 Thread Yun Tang (Jira)
Yun Tang created FLINK-14704:


 Summary: Remove uesless SynchronousSavepointLatch
 Key: FLINK-14704
 URL: https://issues.apache.org/jira/browse/FLINK-14704
 Project: Flink
  Issue Type: Improvement
Reporter: Yun Tang
 Fix For: 1.11.0


After FLINK-12482, we have already let checkpoint trigger and run via the mail 
box on task side, which means previous {{SynchronousSavepointLatch}} is no 
longer useful.

We should remove {{SynchronousSavepointLatch}} and its test 
{{SynchronousSavepointSyncLatchTest}}.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


Re: Flink Read thousands of files with batch

2019-11-11 Thread Zhu Zhu
Hi Dominik,

Would you check whether the JM GC status?
One possible cause is that the large number of file metas
inHadoopInputFormat is burdening the JM memory.

`akka.ask.timeout` is the default RPC timeout, while some RPCs may override
this timeout for their own purpose. e.g. the RPCs from web usually use
`web.timeout`.
Providing the detailed call stack of the AskTimeoutException may help to
identify where this timeout happened.

Thanks,
Zhu Zhu

Dominik Wosiński  于2019年11月11日周一 上午3:35写道:

> Hey,
> I have a very specific use case. I have a history of records stored as
> Parquet in S3. I would like to read and process them with Flink. The issue
> is that the number of files is quite large ( >100k). If I provide the full
> list of files to HadoopInputFormat that I am using it will fail with
> AskTimeoutException, which Is weird since I am using YARN and setting the
> -yD akka.ask.timeout=600s, even thought according to the logs the setting
> is processed properly, the job execution still with AskTimeoutException
> after 10s, which seems weird to me. I have managed to go around this, by
> grouping files and reading them in a loop, so that finally I have the
> Seq[DataSet]. But if I try to union those datasets, then I will
> receive the AskTimeoutException again. So my question is, what can be the
> reason behind this exception being thrown and why is the setting ignored,
> even if this is pared properly.
>
> I will be glad for any help.
>
> Best Regards,
> Dom.
>


Re: [ANNOUNCE] Jark Wu is now part of the Flink PMC

2019-11-11 Thread Rui Li
Congrats Jark!

On Mon, Nov 11, 2019 at 2:49 PM Yadong Xie  wrote:

> Congratulations Jark!
>
> Danny Chan  于2019年11月11日周一 上午10:41写道:
>
> > Congrats Jark! Well deserved !
> >
> > Best,
> > Danny Chan
> > 在 2019年11月11日 +0800 AM9:59,dev@flink.apache.org,写道:
> > >
> > > Congrats Jark!
> >
>


-- 
Best regards!
Rui Li


[jira] [Created] (FLINK-14705) Remove nullables argument of BatchTestBase.registerCollection in blink planner

2019-11-11 Thread Danny Chen (Jira)
Danny Chen created FLINK-14705:
--

 Summary: Remove nullables argument of 
BatchTestBase.registerCollection in blink planner
 Key: FLINK-14705
 URL: https://issues.apache.org/jira/browse/FLINK-14705
 Project: Flink
  Issue Type: Improvement
Reporter: Danny Chen


The register collection method use TypeInformation to register collections, the 
type information does not have nullability attribute, we expect to use the new 
DataType instead.

So for these test cases, 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (FLINK-14706) Remove legacy KvStateMessage

2019-11-11 Thread Yun Tang (Jira)
Yun Tang created FLINK-14706:


 Summary: Remove legacy KvStateMessage
 Key: FLINK-14706
 URL: https://issues.apache.org/jira/browse/FLINK-14706
 Project: Flink
  Issue Type: Sub-task
Reporter: Yun Tang
 Fix For: 1.10.0


All {{KvStateMessage}} are actor messages for {{InternalKvState}} lookup and 
registration, and actor gateway has been removed in FLINK-12323, we should also 
remove these legacy actor messages.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (FLINK-14707) Refactor checkpoint related methods within Environment

2019-11-11 Thread Yun Tang (Jira)
Yun Tang created FLINK-14707:


 Summary: Refactor checkpoint related methods within Environment
 Key: FLINK-14707
 URL: https://issues.apache.org/jira/browse/FLINK-14707
 Project: Flink
  Issue Type: Improvement
Reporter: Yun Tang
 Fix For: 1.10.0


After FLINK-7720 fixed, the two methods of 
{{Environment#acknowledgeCheckpoint}} have been actually useless, and their 
usage have been overridden by {{TaskStateManager#reportTaskStateSnapshots}}. 
More generally, we would consider {{Environment}} interface as getter to access 
components instead of doing real actions. I prefer to remove 
{{acknowledgeCheckpoint}} and {{declineCheckpoint}} methods or at least 
deprecated them.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


Re: [DISCUSS] Stateful Functions - Contribution Details

2019-11-11 Thread Stephan Ewen
Thanks, all for the discussion!

About the name:

  - Like Igal mentioned, the name "Stateful Functions" and the abbreviation
"statefun" underwent some iterations and testing with a small sample of
developers from a few companies.
If anyone has an amazing suggestion for another name, please share.
Would be great to also test it with a small sample of developers from a few
companies, just to make sure we have at least a bit of outside feedback.

  - fun vs. fn vs. func: I think these are more or less equivalent, there
are examples of each one in some language. Working with the code over the
last months, we found "statefun" to be somehow appealing.
Maybe as a datapoint, Beam uses "DoFn" but pronounces it "doo-fun". So,
why not go with "fun" directly?

About mailing lists:

  - There are pros and cons for separating the mailing lists or not to do
that.
  - Having the same mailing lists gives synergies around questions for
operating the system.
  - Having the same mailing lists can create confusion. For example,
statefun uses a simpler, more restrictive, easier to understand
serialization scheme. Answers coming from serialization in Flink core can
easily be confusing there.
  - Having the same mailing lists can be overwhelming for developers that
are new and only interested in that particular angle.
  - Having a different dev mailing list makes only sense if we use a
different Jira project, because FLINK-X issue creation is linked to the
mailing list.

  => I think it is fine to start with the same mailing list and observe
first. If we find it problematic, we can separate the mailing lists.

About the repository name:

  - The project is still called "Stateful Functions", but it is a mouth
full, so it would be nice to have something more concise for the repo name,
hence the suggestion for "statefun".
  - @Chesnay - Are you concerned about the project name (Stateful
Functions) or the abbreviation (statefun) ?

Best,
Stephan




On Mon, Nov 11, 2019 at 6:21 AM tison  wrote:

> I second Chesnay's opinions, which I'd like to pick up is that I highly
> recommend
> reuse existing mailing lists. We can always build a separated list when the
> specific
> community grows, but it is hard to do it in the contract direction.
>
> I don't stick to the name but vote my coin to "statefun". Playing with
> statefun will be
> fun, I think :-) (Generally, Erlang uses "fun", Go uses "func" and Rust
> uses "fn", I
> don't find a strong reason that "func" is an objective better choice
>
> Best,
> tison.
>
>
> Xuefu Z  于2019年11月9日周六 上午4:16写道:
>
> > Regarding the package name, etc:
> >
> > statefun certainly sounds more interesting, but it's confusing in my
> > opinion and doesn't reflect its true nature. A letter "c" at the end may
> > helps as "func" is more used as a short for "function" in CS.
> >
> > Thanks,
> > Xuefu
> >
> > On Fri, Nov 8, 2019 at 3:52 AM Igal Shilman  wrote:
> >
> > > Hi Chesnay,
> > >
> > > The correct link for [1] is:
> > >
> > >
> >
> http://mail-archives.apache.org/mod_mbox/flink-dev/201911.mbox/%3CCANC1h_vicBWQSGws6Q%2BTXJXde0K%2BAMoVN4VqGU_Hykb1N7J8ng%40mail.gmail.com%3E
> > >
> > > 1) There is no relevant post, this is the name that is currently used
> > both
> > > for the website and internally.
> > > The name is not the original name, and it evolved out of internal
> > > discussions and a/b-testing with few early users, this name
> > > was able to "position" the project at the correct place better than
> > others.
> > > If more people would feel unconvinced, or you would strongly oppose to
> > it,
> > > then we can create a separate discussion thread.
> > >
> > > 4)  Ok, I will change the proposal to option (b).
> > >
> > > Kind regards,
> > > Igal.
> > >
> > > On Thu, Nov 7, 2019 at 5:29 PM Chesnay Schepler 
> > > wrote:
> > >
> > > > [1] Does not directly link to the voting thread.
> > > >
> > > > 1) I skimmed all 3 threads about the stateful functions proposal and
> > > > could not find a rational for the repository name, I'd appreciate a
> > > > direct link to the relevant post.
> > > >
> > > > 2.1) +1 as we use o.a.f also for flink-shaded
> > > >
> > > > 3) +1 as it follows the existing package conventions for libraries.
> > > >
> > > > 4) b; I see no reason why we would isolate mailing lists when we
> > haven't
> > > > done so for the myriad of other components that are largely
> independent
> > > > from each other (like SQL).
> > > > There are some practical issues here with having a separate dev ML,
> for
> > > > example where to send FLIPs or release threads and ensuring they
> reach
> > a
> > > > large enough audience, which a dedicated ML would likely hinder.
> > > > I'm currently also assuming that builds/commits also go to the
> general
> > > > flink MLs, making it even weirder if just dev were spliced out.
> > > >
> > > > 5) separate component, like "API / Statefun"
> > > >
> > > > Personally I'm not sold on the "statefun" name, has this been a
> > > > discussion item in one o

[jira] [Created] (FLINK-14708) Introduce full restarts failover strategy for NG scheduler

2019-11-11 Thread Zhu Zhu (Jira)
Zhu Zhu created FLINK-14708:
---

 Summary: Introduce full restarts failover strategy for NG scheduler
 Key: FLINK-14708
 URL: https://issues.apache.org/jira/browse/FLINK-14708
 Project: Flink
  Issue Type: Sub-task
  Components: Runtime / Coordination
Affects Versions: 1.10.0
Reporter: Zhu Zhu
 Fix For: 1.10.0


This strategy allows users to force all the tasks to be restarted if any task 
fails.
It may help if it's non-sense or even worse if only part of the tasks can 
acquire resources and process data. In such cases, checkpoints cannot be 
completed and progress cannot be persisted.

Supporting this strategy also enables the NG scheduler to be backward 
compatible for existing failover strategy configuration.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (FLINK-14709) Allow outputting elements in close method of chained drivers.

2019-11-11 Thread David Moravek (Jira)
David Moravek created FLINK-14709:
-

 Summary: Allow outputting elements in close method of chained 
drivers.
 Key: FLINK-14709
 URL: https://issues.apache.org/jira/browse/FLINK-14709
 Project: Flink
  Issue Type: Improvement
  Components: Runtime / Task
Affects Versions: 1.9.1, 1.8.1, 1.7.2
Reporter: David Moravek


Currently, BatchTask and DataSourceTask only allow outputting elements in close 
method of "rich" operators, that they directly execute.

Task workflow is as follows:
1) open "head" driver  (calls "open" method on udf)
2) open chained drivers
3) run "head" driver
4) close "head" driver (calls "close" method on udf)
5) close output collector (no elements can be collected after this point)
6) close chained drivers

In order to properly support outputs from close method, we want to switch 6) 
and 5). We also need to tweak implementation of Reduce / Combine chained 
drivers, because they dispose sorters in closeTask method (this should be done 
in the close method).

This would bring huge performance improvement for Beam users, because we could 
properly implement bundling on batch (whole partition = single bundle).




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


Re: [VOTE] FLIP-79: Flink Function DDL Support (1.10 Release Feature Only)

2019-11-11 Thread Shuyi Chen
+1 (binding)

On Sat, Nov 9, 2019 at 11:17 PM Kurt Young  wrote:

> +1 (binding)
>
> Best,
> Kurt
>
>
> On Sun, Nov 10, 2019 at 12:25 PM Peter Huang 
> wrote:
>
> > Hi Yu,
> >
> > Thanks for your reminder about the timeline of delivering the basic
> > function DDL in release 1.10.
> > As I replied to Xuefu, the "CREATE FUNCTION" and "DROP FUNCTION" can
> > relatively easy achieve by revising the existing PR.
> > Definitely, I probably need to start to work on a basic version of PR for
> > "ALTER FUNCTION" and "SHOW FUNCTIONS".
> > Please let me know if you have any suggestion to better align the
> timeline
> > of the ongoing catalog related efforts.
> >
> > Best Regards
> > Peter Huang
> >
> >
> > On Sat, Nov 9, 2019 at 7:26 PM Yu Li  wrote:
> >
> > > Thanks for driving this Peter!
> > >
> > > I agree it would be great if we could include this feature in 1.10.
> > > However, FWIW, since we are following the time-based release policy [1]
> > and
> > > 1.10 release is approaching its feature freeze (planned to be at the
> end
> > of
> > > November) [2], I'm a little bit concerned about the schedule.
> > >
> > > [1]
> > https://cwiki.apache.org/confluence/display/FLINK/Time-based+releases
> > > [2]
> > >
> > >
> >
> http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-Features-for-Apache-Flink-1-10-td32824.html
> > >
> > > Best Regards,
> > > Yu
> > >
> > >
> > > On Sat, 9 Nov 2019 at 04:12, Xuefu Z  wrote:
> > >
> > > > Hi Peter,
> > > >
> > > > Thanks for driving this. I'm all-in for this. However, as I read the
> > > latest
> > > > FLIP, I have a couple of questions/comments:
> > > >
> > > > 1. It seems that "JVM" is proposed as a language type in parallel to
> > > > python. I'm not sure that's very intuitive. JVM stands for "java
> > virtual
> > > > machine", so the language is really "JAVA", correct? I know "scala"
> is
> > > also
> > > > a language which can be generate java byte code that JVM can execute.
> > > >
> > > > 2. In the flip, "SHOW FUNCTIONS" and "ALTER FUNCTION" are mentioned
> > but I
> > > > don't see there is any implementation plan, either in 1.10 or
> beyond. I
> > > > think we could have more clarification on those.
> > > >
> > > > Thanks,
> > > > Xuefu
> > > >
> > > > On Fri, Nov 8, 2019 at 10:36 AM Bowen Li 
> wrote:
> > > >
> > > > > Peter and I went thru the details and defined scope/plan for 1.10
> > > offline
> > > > > in the last few days. +1 (binding) from my side.
> > > > >
> > > > > On Fri, Nov 8, 2019 at 12:55 AM Terry Wang 
> > wrote:
> > > > >
> > > > > > Thanks Peter driving on this. LGTM for 1.10 release feature.
> > > > > >
> > > > > > +1 from my side. (non-binding)
> > > > > >
> > > > > > Best,
> > > > > > Terry Wang
> > > > > >
> > > > > >
> > > > > >
> > > > > > > 2019年11月8日 13:20,Peter Huang  写道:
> > > > > > >
> > > > > > > Dear All,
> > > > > > >
> > > > > > > I would like to start the vote for 1.10 release features in
> > FLIP-79
> > > > [1]
> > > > > > > which is discussed and research consensus in the discussion
> > thread
> > > > [2].
> > > > > > For
> > > > > > > the advanced feature, such as loading function from remote
> > > resources,
> > > > > > > support scala/python function, we will have the further
> > discussion
> > > > > after
> > > > > > > release 1.10.
> > > > > > >
> > > > > > > The vote will be open for at least 72 hours. If the voting
> > passes,
> > > I
> > > > > will
> > > > > > > close it by 2019-11-10 14:00 UTC.
> > > > > > >
> > > > > > > [1]
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> https://cwiki.apache.org/confluence/display/FLINK/FLIP-79+Flink+Function+DDL+Support
> > > > > > > [2]
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/Discussion-FLIP-79-Flink-Function-DDL-Support-td33965.html
> > > > > > >
> > > > > > > Best Regards
> > > > > > > Peter Huang
> > > > > >
> > > > > >
> > > > >
> > > >
> > > >
> > > > --
> > > > Xuefu Zhang
> > > >
> > > > "In Honey We Trust!"
> > > >
> > >
> >
>


Re: [VOTE] FLIP-79: Flink Function DDL Support (1.10 Release Feature Only)

2019-11-11 Thread Xuefu Z
+1 (non-binding)

On Mon, Nov 11, 2019 at 9:54 AM Shuyi Chen  wrote:

> +1 (binding)
>
> On Sat, Nov 9, 2019 at 11:17 PM Kurt Young  wrote:
>
> > +1 (binding)
> >
> > Best,
> > Kurt
> >
> >
> > On Sun, Nov 10, 2019 at 12:25 PM Peter Huang  >
> > wrote:
> >
> > > Hi Yu,
> > >
> > > Thanks for your reminder about the timeline of delivering the basic
> > > function DDL in release 1.10.
> > > As I replied to Xuefu, the "CREATE FUNCTION" and "DROP FUNCTION" can
> > > relatively easy achieve by revising the existing PR.
> > > Definitely, I probably need to start to work on a basic version of PR
> for
> > > "ALTER FUNCTION" and "SHOW FUNCTIONS".
> > > Please let me know if you have any suggestion to better align the
> > timeline
> > > of the ongoing catalog related efforts.
> > >
> > > Best Regards
> > > Peter Huang
> > >
> > >
> > > On Sat, Nov 9, 2019 at 7:26 PM Yu Li  wrote:
> > >
> > > > Thanks for driving this Peter!
> > > >
> > > > I agree it would be great if we could include this feature in 1.10.
> > > > However, FWIW, since we are following the time-based release policy
> [1]
> > > and
> > > > 1.10 release is approaching its feature freeze (planned to be at the
> > end
> > > of
> > > > November) [2], I'm a little bit concerned about the schedule.
> > > >
> > > > [1]
> > > https://cwiki.apache.org/confluence/display/FLINK/Time-based+releases
> > > > [2]
> > > >
> > > >
> > >
> >
> http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-Features-for-Apache-Flink-1-10-td32824.html
> > > >
> > > > Best Regards,
> > > > Yu
> > > >
> > > >
> > > > On Sat, 9 Nov 2019 at 04:12, Xuefu Z  wrote:
> > > >
> > > > > Hi Peter,
> > > > >
> > > > > Thanks for driving this. I'm all-in for this. However, as I read
> the
> > > > latest
> > > > > FLIP, I have a couple of questions/comments:
> > > > >
> > > > > 1. It seems that "JVM" is proposed as a language type in parallel
> to
> > > > > python. I'm not sure that's very intuitive. JVM stands for "java
> > > virtual
> > > > > machine", so the language is really "JAVA", correct? I know "scala"
> > is
> > > > also
> > > > > a language which can be generate java byte code that JVM can
> execute.
> > > > >
> > > > > 2. In the flip, "SHOW FUNCTIONS" and "ALTER FUNCTION" are mentioned
> > > but I
> > > > > don't see there is any implementation plan, either in 1.10 or
> > beyond. I
> > > > > think we could have more clarification on those.
> > > > >
> > > > > Thanks,
> > > > > Xuefu
> > > > >
> > > > > On Fri, Nov 8, 2019 at 10:36 AM Bowen Li 
> > wrote:
> > > > >
> > > > > > Peter and I went thru the details and defined scope/plan for 1.10
> > > > offline
> > > > > > in the last few days. +1 (binding) from my side.
> > > > > >
> > > > > > On Fri, Nov 8, 2019 at 12:55 AM Terry Wang 
> > > wrote:
> > > > > >
> > > > > > > Thanks Peter driving on this. LGTM for 1.10 release feature.
> > > > > > >
> > > > > > > +1 from my side. (non-binding)
> > > > > > >
> > > > > > > Best,
> > > > > > > Terry Wang
> > > > > > >
> > > > > > >
> > > > > > >
> > > > > > > > 2019年11月8日 13:20,Peter Huang 
> 写道:
> > > > > > > >
> > > > > > > > Dear All,
> > > > > > > >
> > > > > > > > I would like to start the vote for 1.10 release features in
> > > FLIP-79
> > > > > [1]
> > > > > > > > which is discussed and research consensus in the discussion
> > > thread
> > > > > [2].
> > > > > > > For
> > > > > > > > the advanced feature, such as loading function from remote
> > > > resources,
> > > > > > > > support scala/python function, we will have the further
> > > discussion
> > > > > > after
> > > > > > > > release 1.10.
> > > > > > > >
> > > > > > > > The vote will be open for at least 72 hours. If the voting
> > > passes,
> > > > I
> > > > > > will
> > > > > > > > close it by 2019-11-10 14:00 UTC.
> > > > > > > >
> > > > > > > > [1]
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> https://cwiki.apache.org/confluence/display/FLINK/FLIP-79+Flink+Function+DDL+Support
> > > > > > > > [2]
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/Discussion-FLIP-79-Flink-Function-DDL-Support-td33965.html
> > > > > > > >
> > > > > > > > Best Regards
> > > > > > > > Peter Huang
> > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > > >
> > > > > --
> > > > > Xuefu Zhang
> > > > >
> > > > > "In Honey We Trust!"
> > > > >
> > > >
> > >
> >
>


-- 
Xuefu Zhang

"In Honey We Trust!"


Re: [VOTE] FLIP-79: Flink Function DDL Support (1.10 Release Feature Only)

2019-11-11 Thread vino yang
+1 (non-binding)

Best,
Vino

Xuefu Z  于2019年11月12日周二 上午3:27写道:

> +1 (non-binding)
>
> On Mon, Nov 11, 2019 at 9:54 AM Shuyi Chen  wrote:
>
> > +1 (binding)
> >
> > On Sat, Nov 9, 2019 at 11:17 PM Kurt Young  wrote:
> >
> > > +1 (binding)
> > >
> > > Best,
> > > Kurt
> > >
> > >
> > > On Sun, Nov 10, 2019 at 12:25 PM Peter Huang <
> huangzhenqiu0...@gmail.com
> > >
> > > wrote:
> > >
> > > > Hi Yu,
> > > >
> > > > Thanks for your reminder about the timeline of delivering the basic
> > > > function DDL in release 1.10.
> > > > As I replied to Xuefu, the "CREATE FUNCTION" and "DROP FUNCTION" can
> > > > relatively easy achieve by revising the existing PR.
> > > > Definitely, I probably need to start to work on a basic version of PR
> > for
> > > > "ALTER FUNCTION" and "SHOW FUNCTIONS".
> > > > Please let me know if you have any suggestion to better align the
> > > timeline
> > > > of the ongoing catalog related efforts.
> > > >
> > > > Best Regards
> > > > Peter Huang
> > > >
> > > >
> > > > On Sat, Nov 9, 2019 at 7:26 PM Yu Li  wrote:
> > > >
> > > > > Thanks for driving this Peter!
> > > > >
> > > > > I agree it would be great if we could include this feature in 1.10.
> > > > > However, FWIW, since we are following the time-based release policy
> > [1]
> > > > and
> > > > > 1.10 release is approaching its feature freeze (planned to be at
> the
> > > end
> > > > of
> > > > > November) [2], I'm a little bit concerned about the schedule.
> > > > >
> > > > > [1]
> > > >
> https://cwiki.apache.org/confluence/display/FLINK/Time-based+releases
> > > > > [2]
> > > > >
> > > > >
> > > >
> > >
> >
> http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-Features-for-Apache-Flink-1-10-td32824.html
> > > > >
> > > > > Best Regards,
> > > > > Yu
> > > > >
> > > > >
> > > > > On Sat, 9 Nov 2019 at 04:12, Xuefu Z  wrote:
> > > > >
> > > > > > Hi Peter,
> > > > > >
> > > > > > Thanks for driving this. I'm all-in for this. However, as I read
> > the
> > > > > latest
> > > > > > FLIP, I have a couple of questions/comments:
> > > > > >
> > > > > > 1. It seems that "JVM" is proposed as a language type in parallel
> > to
> > > > > > python. I'm not sure that's very intuitive. JVM stands for "java
> > > > virtual
> > > > > > machine", so the language is really "JAVA", correct? I know
> "scala"
> > > is
> > > > > also
> > > > > > a language which can be generate java byte code that JVM can
> > execute.
> > > > > >
> > > > > > 2. In the flip, "SHOW FUNCTIONS" and "ALTER FUNCTION" are
> mentioned
> > > > but I
> > > > > > don't see there is any implementation plan, either in 1.10 or
> > > beyond. I
> > > > > > think we could have more clarification on those.
> > > > > >
> > > > > > Thanks,
> > > > > > Xuefu
> > > > > >
> > > > > > On Fri, Nov 8, 2019 at 10:36 AM Bowen Li 
> > > wrote:
> > > > > >
> > > > > > > Peter and I went thru the details and defined scope/plan for
> 1.10
> > > > > offline
> > > > > > > in the last few days. +1 (binding) from my side.
> > > > > > >
> > > > > > > On Fri, Nov 8, 2019 at 12:55 AM Terry Wang  >
> > > > wrote:
> > > > > > >
> > > > > > > > Thanks Peter driving on this. LGTM for 1.10 release feature.
> > > > > > > >
> > > > > > > > +1 from my side. (non-binding)
> > > > > > > >
> > > > > > > > Best,
> > > > > > > > Terry Wang
> > > > > > > >
> > > > > > > >
> > > > > > > >
> > > > > > > > > 2019年11月8日 13:20,Peter Huang 
> > 写道:
> > > > > > > > >
> > > > > > > > > Dear All,
> > > > > > > > >
> > > > > > > > > I would like to start the vote for 1.10 release features in
> > > > FLIP-79
> > > > > > [1]
> > > > > > > > > which is discussed and research consensus in the discussion
> > > > thread
> > > > > > [2].
> > > > > > > > For
> > > > > > > > > the advanced feature, such as loading function from remote
> > > > > resources,
> > > > > > > > > support scala/python function, we will have the further
> > > > discussion
> > > > > > > after
> > > > > > > > > release 1.10.
> > > > > > > > >
> > > > > > > > > The vote will be open for at least 72 hours. If the voting
> > > > passes,
> > > > > I
> > > > > > > will
> > > > > > > > > close it by 2019-11-10 14:00 UTC.
> > > > > > > > >
> > > > > > > > > [1]
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> https://cwiki.apache.org/confluence/display/FLINK/FLIP-79+Flink+Function+DDL+Support
> > > > > > > > > [2]
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/Discussion-FLIP-79-Flink-Function-DDL-Support-td33965.html
> > > > > > > > >
> > > > > > > > > Best Regards
> > > > > > > > > Peter Huang
> > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > > >
> > > > > > --
> > > > > > Xuefu Zhang
> > > > > >
> > > > > > "In Honey We Trust!"
> > > > > >
> > > > >
> > > >
> > >
> >
>
>
> --
> Xuefu Zhang
>
> "In Honey We Trust!"
>


Re: Flink Read thousands of files with batch

2019-11-11 Thread tison
I suspect you suffer from Client submission failure which also throws
AskTimeoutException.

The related configure option are `akka.client.timeout` which you can
increase. However, there
was some cases you can resolve the problem by upgrading Java to latest
minimum version 8u212

Best,
tison.


Zhu Zhu  于2019年11月11日周一 下午6:03写道:

> Hi Dominik,
>
> Would you check whether the JM GC status?
> One possible cause is that the large number of file metas
> inHadoopInputFormat is burdening the JM memory.
>
> `akka.ask.timeout` is the default RPC timeout, while some RPCs may override
> this timeout for their own purpose. e.g. the RPCs from web usually use
> `web.timeout`.
> Providing the detailed call stack of the AskTimeoutException may help to
> identify where this timeout happened.
>
> Thanks,
> Zhu Zhu
>
> Dominik Wosiński  于2019年11月11日周一 上午3:35写道:
>
> > Hey,
> > I have a very specific use case. I have a history of records stored as
> > Parquet in S3. I would like to read and process them with Flink. The
> issue
> > is that the number of files is quite large ( >100k). If I provide the
> full
> > list of files to HadoopInputFormat that I am using it will fail with
> > AskTimeoutException, which Is weird since I am using YARN and setting the
> > -yD akka.ask.timeout=600s, even thought according to the logs the setting
> > is processed properly, the job execution still with AskTimeoutException
> > after 10s, which seems weird to me. I have managed to go around this, by
> > grouping files and reading them in a loop, so that finally I have the
> > Seq[DataSet]. But if I try to union those datasets, then I will
> > receive the AskTimeoutException again. So my question is, what can be the
> > reason behind this exception being thrown and why is the setting ignored,
> > even if this is pared properly.
> >
> > I will be glad for any help.
> >
> > Best Regards,
> > Dom.
> >
>


Re: [VOTE] FLIP-79: Flink Function DDL Support (1.10 Release Feature Only)

2019-11-11 Thread Jingsong Li
+1 (non-binding)

Best,
Jingsong Lee

On Tue, Nov 12, 2019 at 9:49 AM vino yang  wrote:

> +1 (non-binding)
>
> Best,
> Vino
>
> Xuefu Z  于2019年11月12日周二 上午3:27写道:
>
> > +1 (non-binding)
> >
> > On Mon, Nov 11, 2019 at 9:54 AM Shuyi Chen  wrote:
> >
> > > +1 (binding)
> > >
> > > On Sat, Nov 9, 2019 at 11:17 PM Kurt Young  wrote:
> > >
> > > > +1 (binding)
> > > >
> > > > Best,
> > > > Kurt
> > > >
> > > >
> > > > On Sun, Nov 10, 2019 at 12:25 PM Peter Huang <
> > huangzhenqiu0...@gmail.com
> > > >
> > > > wrote:
> > > >
> > > > > Hi Yu,
> > > > >
> > > > > Thanks for your reminder about the timeline of delivering the basic
> > > > > function DDL in release 1.10.
> > > > > As I replied to Xuefu, the "CREATE FUNCTION" and "DROP FUNCTION"
> can
> > > > > relatively easy achieve by revising the existing PR.
> > > > > Definitely, I probably need to start to work on a basic version of
> PR
> > > for
> > > > > "ALTER FUNCTION" and "SHOW FUNCTIONS".
> > > > > Please let me know if you have any suggestion to better align the
> > > > timeline
> > > > > of the ongoing catalog related efforts.
> > > > >
> > > > > Best Regards
> > > > > Peter Huang
> > > > >
> > > > >
> > > > > On Sat, Nov 9, 2019 at 7:26 PM Yu Li  wrote:
> > > > >
> > > > > > Thanks for driving this Peter!
> > > > > >
> > > > > > I agree it would be great if we could include this feature in
> 1.10.
> > > > > > However, FWIW, since we are following the time-based release
> policy
> > > [1]
> > > > > and
> > > > > > 1.10 release is approaching its feature freeze (planned to be at
> > the
> > > > end
> > > > > of
> > > > > > November) [2], I'm a little bit concerned about the schedule.
> > > > > >
> > > > > > [1]
> > > > >
> > https://cwiki.apache.org/confluence/display/FLINK/Time-based+releases
> > > > > > [2]
> > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-Features-for-Apache-Flink-1-10-td32824.html
> > > > > >
> > > > > > Best Regards,
> > > > > > Yu
> > > > > >
> > > > > >
> > > > > > On Sat, 9 Nov 2019 at 04:12, Xuefu Z  wrote:
> > > > > >
> > > > > > > Hi Peter,
> > > > > > >
> > > > > > > Thanks for driving this. I'm all-in for this. However, as I
> read
> > > the
> > > > > > latest
> > > > > > > FLIP, I have a couple of questions/comments:
> > > > > > >
> > > > > > > 1. It seems that "JVM" is proposed as a language type in
> parallel
> > > to
> > > > > > > python. I'm not sure that's very intuitive. JVM stands for
> "java
> > > > > virtual
> > > > > > > machine", so the language is really "JAVA", correct? I know
> > "scala"
> > > > is
> > > > > > also
> > > > > > > a language which can be generate java byte code that JVM can
> > > execute.
> > > > > > >
> > > > > > > 2. In the flip, "SHOW FUNCTIONS" and "ALTER FUNCTION" are
> > mentioned
> > > > > but I
> > > > > > > don't see there is any implementation plan, either in 1.10 or
> > > > beyond. I
> > > > > > > think we could have more clarification on those.
> > > > > > >
> > > > > > > Thanks,
> > > > > > > Xuefu
> > > > > > >
> > > > > > > On Fri, Nov 8, 2019 at 10:36 AM Bowen Li 
> > > > wrote:
> > > > > > >
> > > > > > > > Peter and I went thru the details and defined scope/plan for
> > 1.10
> > > > > > offline
> > > > > > > > in the last few days. +1 (binding) from my side.
> > > > > > > >
> > > > > > > > On Fri, Nov 8, 2019 at 12:55 AM Terry Wang <
> zjuwa...@gmail.com
> > >
> > > > > wrote:
> > > > > > > >
> > > > > > > > > Thanks Peter driving on this. LGTM for 1.10 release
> feature.
> > > > > > > > >
> > > > > > > > > +1 from my side. (non-binding)
> > > > > > > > >
> > > > > > > > > Best,
> > > > > > > > > Terry Wang
> > > > > > > > >
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > > 2019年11月8日 13:20,Peter Huang  >
> > > 写道:
> > > > > > > > > >
> > > > > > > > > > Dear All,
> > > > > > > > > >
> > > > > > > > > > I would like to start the vote for 1.10 release features
> in
> > > > > FLIP-79
> > > > > > > [1]
> > > > > > > > > > which is discussed and research consensus in the
> discussion
> > > > > thread
> > > > > > > [2].
> > > > > > > > > For
> > > > > > > > > > the advanced feature, such as loading function from
> remote
> > > > > > resources,
> > > > > > > > > > support scala/python function, we will have the further
> > > > > discussion
> > > > > > > > after
> > > > > > > > > > release 1.10.
> > > > > > > > > >
> > > > > > > > > > The vote will be open for at least 72 hours. If the
> voting
> > > > > passes,
> > > > > > I
> > > > > > > > will
> > > > > > > > > > close it by 2019-11-10 14:00 UTC.
> > > > > > > > > >
> > > > > > > > > > [1]
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> https://cwiki.apache.org/confluence/display/FLINK/FLIP-79+Flink+Function+DDL+Support
> > > > > > > > > > [2]
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> http://apache-flink-mailing-list-archi

Re: [DISCUSS] PyFlink User-Defined Function Resource Management

2019-11-11 Thread Dian Fu
Hi Jincheng,

Thanks for the reply and also looking forward to the feedback from the 
community.

Thanks,
Dian

> 在 2019年11月11日,下午2:34,jincheng sun  写道:
> 
> Hi all,
> 
> +1, Thanks for bring up this discussion Dian!
> 
> The Resource Management is very important for PyFlink UDF. So, It's great
> if anyone can add more comments or inputs in the design doc or feedback in
> ML. :)
> 
> Best,
> Jincheng
> 
> Dian Fu  于2019年11月5日周二 上午11:32写道:
> 
>> Hi everyone,
>> 
>> In FLIP-58[1] it will add the support of Python user-defined stateless
>> function for Python Table API. It will launch a separate Python process for
>> Python user-defined function execution. The resources used by the Python
>> process should be managed properly by Flink’s resource management
>> framework. FLIP-49[2] has proposed a unified memory management framework
>> and PyFlink user-defined function resource management should be based on
>> it. Jincheng, Hequn, Xintong, GuoWei and I discussed offline about this. I
>> draft a design doc[3] and want to start a discussion about PyFlink
>> user-defined function resource management.
>> 
>> Welcome any comments on the design doc or giving us feedback on the ML
>> directly.
>> 
>> Regards,
>> Dian
>> 
>> [1]
>> https://cwiki.apache.org/confluence/display/FLINK/FLIP-58%3A+Flink+Python+User-Defined+Stateless+Function+for+Table
>> [2]
>> https://cwiki.apache.org/confluence/display/FLINK/FLIP-49%3A+Unified+Memory+Configuration+for+TaskExecutors
>> [3]
>> https://docs.google.com/document/d/1LQP8L66Thu2yVv6YRSfmF9EkkMnwhBHGjcTQ11GUmFc/edit#heading=h.4q4ggaftf78m



[jira] [Created] (FLINK-14710) Decide column nullability according to Hive constraints

2019-11-11 Thread Rui Li (Jira)
Rui Li created FLINK-14710:
--

 Summary: Decide column nullability according to Hive constraints
 Key: FLINK-14710
 URL: https://issues.apache.org/jira/browse/FLINK-14710
 Project: Flink
  Issue Type: Improvement
  Components: Connectors / Hive
Reporter: Rui Li






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


Re: [DISCUSS] Releasing Flink 1.8.3

2019-11-11 Thread Hequn Cheng
Hi all,

Thanks a lot for your kindly reply and feedbacks.

@Jincheng Thank you very much for providing the release help.
@Yang Wang Thanks a lot for your PR. I think it is an important fix! I left
some comments in FLINK-13184 and we can discuss whether this is a blocker
there.

I have gone over all the issues of 1.8.3. Currently, only one critical
issue:
[Critical]:
[FLINK-13184] Starting a TaskExecutor blocks the YarnResourceManager's main
thread(@Yang Wang is working on it, reviewed by @Rongrong and @Tison)

All issues of this release can also be tracked here:
https://issues.apache.org/jira/projects/FLINK/versions/12346112

Ideally, we can kick off the release vote for the first RC early next week.
If there are some other critical issues for 1.8.3, please let me know here
to account for it for the 1.8.3 release.

Best, Hequn

On Mon, Nov 11, 2019 at 2:27 PM Danny Chan  wrote:

> Thanks Hequn to be in change of the release !
>
> Best,
> Danny Chan
> 在 2019年11月9日 +0800 PM3:59,Hequn Cheng ,写道:
> > Hi Jincheng,
> >
> > Thanks for kicking this discussion off!
> >
> > +1 to the 1.8.3 release as it would be nice to have these important fixes
> > and also two
> > months have passed since the last release.
> >
> > Besides, I wonder if I can be the release manager of 1.8.3 or work with
> you
> > together @Jincheng?
> > It's always exciting to help the community as much as possible.
> >
> > Best, Hequn
> >
> > On Sat, Nov 9, 2019 at 12:34 PM Dian Fu  wrote:
> >
> > > Hi Jincheng,
> > >
> > > Thanks a lot for bringing up this discussion. +1 for releasing 1.8.3.
> > >
> > > Regards,
> > > Dian
> > >
> > > On Sat, Nov 9, 2019 at 12:11 PM jincheng sun  >
> > > wrote:
> > >
> > > > Hi Flink devs,
> > > >
> > > > It has been more than 2 months since the 1.8.2 released. So, What do
> you
> > > > think about releasing Flink 1.8.3 soon?
> > > >
> > > > We already have many important bug fixes in the release-1.8 branch
> (29
> > > > resolved issues).
> > > >
> > > > Most notable fixes are:
> > > >
> > > > - FLINK-14010 Dispatcher & JobManagers don't give up leadership when
> AM
> > > is
> > > > shut down
> > > > - FLINK-14315 NPE with JobMaster.disconnectTaskManager
> > > > - FLINK-12848 Method equals() in RowTypeInfo should consider
> fieldsNames
> > > > - FLINK-12342 Yarn Resource Manager Acquires Too Many Containers
> > > > - FLINK-14589 Redundant slot requests with the same AllocationID
> leads to
> > > > inconsistent slot table
> > > >
> > > > Furthermore, the following critical issues is in progress, maybe we
> can
> > > > wait for it if it is not too much effort.
> > > >
> > > > - FLINK-13184 Starting a TaskExecutor blocks the
> YarnResourceManager's
> > > main
> > > > thread
> > > >
> > > > Please let me know what you think?
> > > >
> > > > Best,
> > > > Jincheng
> > > >
> > >
>


[jira] [Created] (FLINK-14711) Create Alter Function and Show Functions DDL

2019-11-11 Thread Zhenqiu Huang (Jira)
Zhenqiu Huang created FLINK-14711:
-

 Summary: Create Alter Function and Show Functions DDL
 Key: FLINK-14711
 URL: https://issues.apache.org/jira/browse/FLINK-14711
 Project: Flink
  Issue Type: Sub-task
  Components: Table SQL / API
Affects Versions: 1.10.0
Reporter: Zhenqiu Huang


It is a sibling ticket for https://issues.apache.org/jira/browse/FLINK-7151. 

Alter Function Statement
ALTER [TEMPORARY|TEMPORARY SYSTEM] FUNCTION [IF EXISTS] 
[catalog_name.][db_name.] function_name AS identifier [LANGUAGE 
JAVA|SCALA|PYTHON];
Show Function Statement

SHOW FUNCTIONS  [catalog_name.][db_name]





--
This message was sent by Atlassian Jira
(v8.3.4#803005)


Re: [VOTE] FLIP-79: Flink Function DDL Support (1.10 Release Feature Only)

2019-11-11 Thread Peter Huang
Thanks, everyone for joining the discussion and giving feedback!. The
voting time for FLIP-79 has passed. I'm closing the vote now.

There were seven +1 votes, 3 of which are binding:
- Bowen Li (binding)
- Kurt Yong (binding)
- Shuyi Chen (binding)

- Terry Wang (non-binding)
- Xuefu Zhang (non-binding)
- Vino Yang (non-binding)
- Jingsong Lee (non-binding)

There were no disapproving votes. Thus, FLIP-79 has been accepted. Now, we
agreed on the function DDL syntax as listed on the FLIP
 and also the initial execution plan for release 1.10. Currently, I will
mainly work on these two Jira tickets.

1) https://issues.apache.org/jira/browse/FLINK-7151
2) https://issues.apache.org/jira/browse/FLINK-14711



Best Regards
Peter Huang


Best Regards
Peter Huang

On Mon, Nov 11, 2019 at 7:42 PM Jingsong Li  wrote:

> +1 (non-binding)
>
> Best,
> Jingsong Lee
>
> On Tue, Nov 12, 2019 at 9:49 AM vino yang  wrote:
>
> > +1 (non-binding)
> >
> > Best,
> > Vino
> >
> > Xuefu Z  于2019年11月12日周二 上午3:27写道:
> >
> > > +1 (non-binding)
> > >
> > > On Mon, Nov 11, 2019 at 9:54 AM Shuyi Chen  wrote:
> > >
> > > > +1 (binding)
> > > >
> > > > On Sat, Nov 9, 2019 at 11:17 PM Kurt Young  wrote:
> > > >
> > > > > +1 (binding)
> > > > >
> > > > > Best,
> > > > > Kurt
> > > > >
> > > > >
> > > > > On Sun, Nov 10, 2019 at 12:25 PM Peter Huang <
> > > huangzhenqiu0...@gmail.com
> > > > >
> > > > > wrote:
> > > > >
> > > > > > Hi Yu,
> > > > > >
> > > > > > Thanks for your reminder about the timeline of delivering the
> basic
> > > > > > function DDL in release 1.10.
> > > > > > As I replied to Xuefu, the "CREATE FUNCTION" and "DROP FUNCTION"
> > can
> > > > > > relatively easy achieve by revising the existing PR.
> > > > > > Definitely, I probably need to start to work on a basic version
> of
> > PR
> > > > for
> > > > > > "ALTER FUNCTION" and "SHOW FUNCTIONS".
> > > > > > Please let me know if you have any suggestion to better align the
> > > > > timeline
> > > > > > of the ongoing catalog related efforts.
> > > > > >
> > > > > > Best Regards
> > > > > > Peter Huang
> > > > > >
> > > > > >
> > > > > > On Sat, Nov 9, 2019 at 7:26 PM Yu Li  wrote:
> > > > > >
> > > > > > > Thanks for driving this Peter!
> > > > > > >
> > > > > > > I agree it would be great if we could include this feature in
> > 1.10.
> > > > > > > However, FWIW, since we are following the time-based release
> > policy
> > > > [1]
> > > > > > and
> > > > > > > 1.10 release is approaching its feature freeze (planned to be
> at
> > > the
> > > > > end
> > > > > > of
> > > > > > > November) [2], I'm a little bit concerned about the schedule.
> > > > > > >
> > > > > > > [1]
> > > > > >
> > > https://cwiki.apache.org/confluence/display/FLINK/Time-based+releases
> > > > > > > [2]
> > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-Features-for-Apache-Flink-1-10-td32824.html
> > > > > > >
> > > > > > > Best Regards,
> > > > > > > Yu
> > > > > > >
> > > > > > >
> > > > > > > On Sat, 9 Nov 2019 at 04:12, Xuefu Z 
> wrote:
> > > > > > >
> > > > > > > > Hi Peter,
> > > > > > > >
> > > > > > > > Thanks for driving this. I'm all-in for this. However, as I
> > read
> > > > the
> > > > > > > latest
> > > > > > > > FLIP, I have a couple of questions/comments:
> > > > > > > >
> > > > > > > > 1. It seems that "JVM" is proposed as a language type in
> > parallel
> > > > to
> > > > > > > > python. I'm not sure that's very intuitive. JVM stands for
> > "java
> > > > > > virtual
> > > > > > > > machine", so the language is really "JAVA", correct? I know
> > > "scala"
> > > > > is
> > > > > > > also
> > > > > > > > a language which can be generate java byte code that JVM can
> > > > execute.
> > > > > > > >
> > > > > > > > 2. In the flip, "SHOW FUNCTIONS" and "ALTER FUNCTION" are
> > > mentioned
> > > > > > but I
> > > > > > > > don't see there is any implementation plan, either in 1.10 or
> > > > > beyond. I
> > > > > > > > think we could have more clarification on those.
> > > > > > > >
> > > > > > > > Thanks,
> > > > > > > > Xuefu
> > > > > > > >
> > > > > > > > On Fri, Nov 8, 2019 at 10:36 AM Bowen Li <
> bowenl...@gmail.com>
> > > > > wrote:
> > > > > > > >
> > > > > > > > > Peter and I went thru the details and defined scope/plan
> for
> > > 1.10
> > > > > > > offline
> > > > > > > > > in the last few days. +1 (binding) from my side.
> > > > > > > > >
> > > > > > > > > On Fri, Nov 8, 2019 at 12:55 AM Terry Wang <
> > zjuwa...@gmail.com
> > > >
> > > > > > wrote:
> > > > > > > > >
> > > > > > > > > > Thanks Peter driving on this. LGTM for 1.10 release
> > feature.
> > > > > > > > > >
> > > > > > > > > > +1 from my side. (non-binding)
> > > > > > > > > >
> > > > > > > > > > Best,
> > > > > > > > > > Terry Wang
> > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > > > 2019年11月8日 13:20,Peter Huang <
> huangzhenqiu0...@gmail.com
> > >
> > > > 写道:
> > > > > > > > > > >
> > > > > >

Re: [VOTE] FLIP-79: Flink Function DDL Support (1.10 Release Feature Only)

2019-11-11 Thread Bowen Li
Thanks Peter! I assigned the ticket to you, and I can help with reviewing
and merging PRs of this FLIP.

Ccing 1.10 release manager Yu that we are on track of FLIP-79.

Cheers,
Bowen

On Mon, Nov 11, 2019 at 9:03 PM Peter Huang 
wrote:

> Thanks, everyone for joining the discussion and giving feedback!. The
> voting time for FLIP-79 has passed. I'm closing the vote now.
>
> There were seven +1 votes, 3 of which are binding:
> - Bowen Li (binding)
> - Kurt Yong (binding)
> - Shuyi Chen (binding)
>
> - Terry Wang (non-binding)
> - Xuefu Zhang (non-binding)
> - Vino Yang (non-binding)
> - Jingsong Lee (non-binding)
>
> There were no disapproving votes. Thus, FLIP-79 has been accepted. Now, we
> agreed on the function DDL syntax as listed on the FLIP
>  and also the initial execution plan for release 1.10. Currently, I will
> mainly work on these two Jira tickets.
>
> 1) https://issues.apache.org/jira/browse/FLINK-7151
> 2) https://issues.apache.org/jira/browse/FLINK-14711
>
>
>
> Best Regards
> Peter Huang
>
>
> Best Regards
> Peter Huang
>
> On Mon, Nov 11, 2019 at 7:42 PM Jingsong Li 
> wrote:
>
> > +1 (non-binding)
> >
> > Best,
> > Jingsong Lee
> >
> > On Tue, Nov 12, 2019 at 9:49 AM vino yang  wrote:
> >
> > > +1 (non-binding)
> > >
> > > Best,
> > > Vino
> > >
> > > Xuefu Z  于2019年11月12日周二 上午3:27写道:
> > >
> > > > +1 (non-binding)
> > > >
> > > > On Mon, Nov 11, 2019 at 9:54 AM Shuyi Chen 
> wrote:
> > > >
> > > > > +1 (binding)
> > > > >
> > > > > On Sat, Nov 9, 2019 at 11:17 PM Kurt Young 
> wrote:
> > > > >
> > > > > > +1 (binding)
> > > > > >
> > > > > > Best,
> > > > > > Kurt
> > > > > >
> > > > > >
> > > > > > On Sun, Nov 10, 2019 at 12:25 PM Peter Huang <
> > > > huangzhenqiu0...@gmail.com
> > > > > >
> > > > > > wrote:
> > > > > >
> > > > > > > Hi Yu,
> > > > > > >
> > > > > > > Thanks for your reminder about the timeline of delivering the
> > basic
> > > > > > > function DDL in release 1.10.
> > > > > > > As I replied to Xuefu, the "CREATE FUNCTION" and "DROP
> FUNCTION"
> > > can
> > > > > > > relatively easy achieve by revising the existing PR.
> > > > > > > Definitely, I probably need to start to work on a basic version
> > of
> > > PR
> > > > > for
> > > > > > > "ALTER FUNCTION" and "SHOW FUNCTIONS".
> > > > > > > Please let me know if you have any suggestion to better align
> the
> > > > > > timeline
> > > > > > > of the ongoing catalog related efforts.
> > > > > > >
> > > > > > > Best Regards
> > > > > > > Peter Huang
> > > > > > >
> > > > > > >
> > > > > > > On Sat, Nov 9, 2019 at 7:26 PM Yu Li  wrote:
> > > > > > >
> > > > > > > > Thanks for driving this Peter!
> > > > > > > >
> > > > > > > > I agree it would be great if we could include this feature in
> > > 1.10.
> > > > > > > > However, FWIW, since we are following the time-based release
> > > policy
> > > > > [1]
> > > > > > > and
> > > > > > > > 1.10 release is approaching its feature freeze (planned to be
> > at
> > > > the
> > > > > > end
> > > > > > > of
> > > > > > > > November) [2], I'm a little bit concerned about the schedule.
> > > > > > > >
> > > > > > > > [1]
> > > > > > >
> > > >
> https://cwiki.apache.org/confluence/display/FLINK/Time-based+releases
> > > > > > > > [2]
> > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-Features-for-Apache-Flink-1-10-td32824.html
> > > > > > > >
> > > > > > > > Best Regards,
> > > > > > > > Yu
> > > > > > > >
> > > > > > > >
> > > > > > > > On Sat, 9 Nov 2019 at 04:12, Xuefu Z 
> > wrote:
> > > > > > > >
> > > > > > > > > Hi Peter,
> > > > > > > > >
> > > > > > > > > Thanks for driving this. I'm all-in for this. However, as I
> > > read
> > > > > the
> > > > > > > > latest
> > > > > > > > > FLIP, I have a couple of questions/comments:
> > > > > > > > >
> > > > > > > > > 1. It seems that "JVM" is proposed as a language type in
> > > parallel
> > > > > to
> > > > > > > > > python. I'm not sure that's very intuitive. JVM stands for
> > > "java
> > > > > > > virtual
> > > > > > > > > machine", so the language is really "JAVA", correct? I know
> > > > "scala"
> > > > > > is
> > > > > > > > also
> > > > > > > > > a language which can be generate java byte code that JVM
> can
> > > > > execute.
> > > > > > > > >
> > > > > > > > > 2. In the flip, "SHOW FUNCTIONS" and "ALTER FUNCTION" are
> > > > mentioned
> > > > > > > but I
> > > > > > > > > don't see there is any implementation plan, either in 1.10
> or
> > > > > > beyond. I
> > > > > > > > > think we could have more clarification on those.
> > > > > > > > >
> > > > > > > > > Thanks,
> > > > > > > > > Xuefu
> > > > > > > > >
> > > > > > > > > On Fri, Nov 8, 2019 at 10:36 AM Bowen Li <
> > bowenl...@gmail.com>
> > > > > > wrote:
> > > > > > > > >
> > > > > > > > > > Peter and I went thru the details and defined scope/plan
> > for
> > > > 1.10
> > > > > > > > offline
> > > > > > > > > > in the last few days. +1 (binding) from my side.
> > >

Re: [VOTE] FLIP-79: Flink Function DDL Support (1.10 Release Feature Only)

2019-11-11 Thread Peter Huang
Thank you in advance. Once the PR is done from my side, I will @ you and yu
for review.

On Mon, Nov 11, 2019 at 9:15 PM Bowen Li  wrote:

> Thanks Peter! I assigned the ticket to you, and I can help with reviewing
> and merging PRs of this FLIP.
>
> Ccing 1.10 release manager Yu that we are on track of FLIP-79.
>
> Cheers,
> Bowen
>
> On Mon, Nov 11, 2019 at 9:03 PM Peter Huang 
> wrote:
>
> > Thanks, everyone for joining the discussion and giving feedback!. The
> > voting time for FLIP-79 has passed. I'm closing the vote now.
> >
> > There were seven +1 votes, 3 of which are binding:
> > - Bowen Li (binding)
> > - Kurt Yong (binding)
> > - Shuyi Chen (binding)
> >
> > - Terry Wang (non-binding)
> > - Xuefu Zhang (non-binding)
> > - Vino Yang (non-binding)
> > - Jingsong Lee (non-binding)
> >
> > There were no disapproving votes. Thus, FLIP-79 has been accepted. Now,
> we
> > agreed on the function DDL syntax as listed on the FLIP
> >  and also the initial execution plan for release 1.10. Currently, I will
> > mainly work on these two Jira tickets.
> >
> > 1) https://issues.apache.org/jira/browse/FLINK-7151
> > 2) https://issues.apache.org/jira/browse/FLINK-14711
> >
> >
> >
> > Best Regards
> > Peter Huang
> >
> >
> > Best Regards
> > Peter Huang
> >
> > On Mon, Nov 11, 2019 at 7:42 PM Jingsong Li 
> > wrote:
> >
> > > +1 (non-binding)
> > >
> > > Best,
> > > Jingsong Lee
> > >
> > > On Tue, Nov 12, 2019 at 9:49 AM vino yang 
> wrote:
> > >
> > > > +1 (non-binding)
> > > >
> > > > Best,
> > > > Vino
> > > >
> > > > Xuefu Z  于2019年11月12日周二 上午3:27写道:
> > > >
> > > > > +1 (non-binding)
> > > > >
> > > > > On Mon, Nov 11, 2019 at 9:54 AM Shuyi Chen 
> > wrote:
> > > > >
> > > > > > +1 (binding)
> > > > > >
> > > > > > On Sat, Nov 9, 2019 at 11:17 PM Kurt Young 
> > wrote:
> > > > > >
> > > > > > > +1 (binding)
> > > > > > >
> > > > > > > Best,
> > > > > > > Kurt
> > > > > > >
> > > > > > >
> > > > > > > On Sun, Nov 10, 2019 at 12:25 PM Peter Huang <
> > > > > huangzhenqiu0...@gmail.com
> > > > > > >
> > > > > > > wrote:
> > > > > > >
> > > > > > > > Hi Yu,
> > > > > > > >
> > > > > > > > Thanks for your reminder about the timeline of delivering the
> > > basic
> > > > > > > > function DDL in release 1.10.
> > > > > > > > As I replied to Xuefu, the "CREATE FUNCTION" and "DROP
> > FUNCTION"
> > > > can
> > > > > > > > relatively easy achieve by revising the existing PR.
> > > > > > > > Definitely, I probably need to start to work on a basic
> version
> > > of
> > > > PR
> > > > > > for
> > > > > > > > "ALTER FUNCTION" and "SHOW FUNCTIONS".
> > > > > > > > Please let me know if you have any suggestion to better align
> > the
> > > > > > > timeline
> > > > > > > > of the ongoing catalog related efforts.
> > > > > > > >
> > > > > > > > Best Regards
> > > > > > > > Peter Huang
> > > > > > > >
> > > > > > > >
> > > > > > > > On Sat, Nov 9, 2019 at 7:26 PM Yu Li 
> wrote:
> > > > > > > >
> > > > > > > > > Thanks for driving this Peter!
> > > > > > > > >
> > > > > > > > > I agree it would be great if we could include this feature
> in
> > > > 1.10.
> > > > > > > > > However, FWIW, since we are following the time-based
> release
> > > > policy
> > > > > > [1]
> > > > > > > > and
> > > > > > > > > 1.10 release is approaching its feature freeze (planned to
> be
> > > at
> > > > > the
> > > > > > > end
> > > > > > > > of
> > > > > > > > > November) [2], I'm a little bit concerned about the
> schedule.
> > > > > > > > >
> > > > > > > > > [1]
> > > > > > > >
> > > > >
> > https://cwiki.apache.org/confluence/display/FLINK/Time-based+releases
> > > > > > > > > [2]
> > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-Features-for-Apache-Flink-1-10-td32824.html
> > > > > > > > >
> > > > > > > > > Best Regards,
> > > > > > > > > Yu
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > On Sat, 9 Nov 2019 at 04:12, Xuefu Z 
> > > wrote:
> > > > > > > > >
> > > > > > > > > > Hi Peter,
> > > > > > > > > >
> > > > > > > > > > Thanks for driving this. I'm all-in for this. However,
> as I
> > > > read
> > > > > > the
> > > > > > > > > latest
> > > > > > > > > > FLIP, I have a couple of questions/comments:
> > > > > > > > > >
> > > > > > > > > > 1. It seems that "JVM" is proposed as a language type in
> > > > parallel
> > > > > > to
> > > > > > > > > > python. I'm not sure that's very intuitive. JVM stands
> for
> > > > "java
> > > > > > > > virtual
> > > > > > > > > > machine", so the language is really "JAVA", correct? I
> know
> > > > > "scala"
> > > > > > > is
> > > > > > > > > also
> > > > > > > > > > a language which can be generate java byte code that JVM
> > can
> > > > > > execute.
> > > > > > > > > >
> > > > > > > > > > 2. In the flip, "SHOW FUNCTIONS" and "ALTER FUNCTION" are
> > > > > mentioned
> > > > > > > > but I
> > > > > > > > > > don't see there is any implementation plan, either in
> 1.10
> > 

[jira] [Created] (FLINK-14712) Add NetWork metric for IOMetricsInfo

2019-11-11 Thread lining (Jira)
lining created FLINK-14712:
--

 Summary: Add NetWork metric for IOMetricsInfo
 Key: FLINK-14712
 URL: https://issues.apache.org/jira/browse/FLINK-14712
 Project: Flink
  Issue Type: Bug
  Components: Runtime / Metrics, Runtime / Network, Runtime / REST
Reporter: lining
 Attachments: image-2019-11-12-14-30-16-130.png

h4. (1) The current monitor is heavy-weight. 
 *   Backpressure monitoring works by repeatedly taking stack trace samples of 
your running tasks.

h4. (2) It is difficult to find out which vertex is the source  of  
backpressure.
 * User need to know current and upstream's network metric to judge current 
whether is the source of backpressure. Now user has to record relevant 
information.

h3. Proposed Changes

Update IOMetricsInfo add  outPoolUsage, inputExclusiveBuffersUsage, 
inputFloatingBuffersUsage:
public final class IOMetricsInfo {private final float outPoolUsage;
private final float inputExclusiveBuffersUsage;private final float 
inputFloatingBuffersUsage;}
 

JobDetailsInfo.JobVertexDetailsInfo merge use Math.max.(ps: outPoolUsage is 
from upstream)、

According to  !image-2019-11-12-14-30-16-130.png!



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (FLINK-14713) Show Attempt History in Vertex SubTask

2019-11-11 Thread lining (Jira)
lining created FLINK-14713:
--

 Summary: Show Attempt History in Vertex SubTask
 Key: FLINK-14713
 URL: https://issues.apache.org/jira/browse/FLINK-14713
 Project: Flink
  Issue Type: Improvement
  Components: Runtime / REST, Runtime / Web Frontend
Reporter: lining


Flink jobs could recovery by failover, but the user couldn't see any 
information about the jobs' failure. There isn't information about the failed 
attempt.
h3. Proposed Changes
h4. Add SubtaskAllExecutionAttemptsDetailsHandler for failed attempt
 * return subtask all attempt and state
 * get prior attempts according to

{code:java}
final AccessExecution execution = executionVertex.getCurrentExecutionAttempt();
final int currentAttemptNum = execution.getAttemptNumber();

if (currentAttemptNum > 0) {
  for (int i = currentAttemptNum - 1; i >= 0; i--) {
 final AccessExecution currentExecution = 
executionVertex.getPriorExecutionAttempt(i);
 if (currentExecution != null) {

allAttempts.add(SubtaskExecutionAttemptDetailsInfo.create(currentExecution, 
metricFetcher, jobID, jobVertexID));
 }
  }
}
{code}
 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


Re: [VOTE] FLIP-79: Flink Function DDL Support (1.10 Release Feature Only)

2019-11-11 Thread Yu Li
Thanks Bowen and Peter! Great to know.

Best Regards,
Yu


On Tue, 12 Nov 2019 at 13:38, Peter Huang 
wrote:

> Thank you in advance. Once the PR is done from my side, I will @ you and
> yu for review.
>
> On Mon, Nov 11, 2019 at 9:15 PM Bowen Li  wrote:
>
>> Thanks Peter! I assigned the ticket to you, and I can help with reviewing
>> and merging PRs of this FLIP.
>>
>> Ccing 1.10 release manager Yu that we are on track of FLIP-79.
>>
>> Cheers,
>> Bowen
>>
>> On Mon, Nov 11, 2019 at 9:03 PM Peter Huang 
>> wrote:
>>
>> > Thanks, everyone for joining the discussion and giving feedback!. The
>> > voting time for FLIP-79 has passed. I'm closing the vote now.
>> >
>> > There were seven +1 votes, 3 of which are binding:
>> > - Bowen Li (binding)
>> > - Kurt Yong (binding)
>> > - Shuyi Chen (binding)
>> >
>> > - Terry Wang (non-binding)
>> > - Xuefu Zhang (non-binding)
>> > - Vino Yang (non-binding)
>> > - Jingsong Lee (non-binding)
>> >
>> > There were no disapproving votes. Thus, FLIP-79 has been accepted. Now,
>> we
>> > agreed on the function DDL syntax as listed on the FLIP
>> >  and also the initial execution plan for release 1.10. Currently, I will
>> > mainly work on these two Jira tickets.
>> >
>> > 1) https://issues.apache.org/jira/browse/FLINK-7151
>> > 2) https://issues.apache.org/jira/browse/FLINK-14711
>> >
>> >
>> >
>> > Best Regards
>> > Peter Huang
>> >
>> >
>> > Best Regards
>> > Peter Huang
>> >
>> > On Mon, Nov 11, 2019 at 7:42 PM Jingsong Li 
>> > wrote:
>> >
>> > > +1 (non-binding)
>> > >
>> > > Best,
>> > > Jingsong Lee
>> > >
>> > > On Tue, Nov 12, 2019 at 9:49 AM vino yang 
>> wrote:
>> > >
>> > > > +1 (non-binding)
>> > > >
>> > > > Best,
>> > > > Vino
>> > > >
>> > > > Xuefu Z  于2019年11月12日周二 上午3:27写道:
>> > > >
>> > > > > +1 (non-binding)
>> > > > >
>> > > > > On Mon, Nov 11, 2019 at 9:54 AM Shuyi Chen 
>> > wrote:
>> > > > >
>> > > > > > +1 (binding)
>> > > > > >
>> > > > > > On Sat, Nov 9, 2019 at 11:17 PM Kurt Young 
>> > wrote:
>> > > > > >
>> > > > > > > +1 (binding)
>> > > > > > >
>> > > > > > > Best,
>> > > > > > > Kurt
>> > > > > > >
>> > > > > > >
>> > > > > > > On Sun, Nov 10, 2019 at 12:25 PM Peter Huang <
>> > > > > huangzhenqiu0...@gmail.com
>> > > > > > >
>> > > > > > > wrote:
>> > > > > > >
>> > > > > > > > Hi Yu,
>> > > > > > > >
>> > > > > > > > Thanks for your reminder about the timeline of delivering
>> the
>> > > basic
>> > > > > > > > function DDL in release 1.10.
>> > > > > > > > As I replied to Xuefu, the "CREATE FUNCTION" and "DROP
>> > FUNCTION"
>> > > > can
>> > > > > > > > relatively easy achieve by revising the existing PR.
>> > > > > > > > Definitely, I probably need to start to work on a basic
>> version
>> > > of
>> > > > PR
>> > > > > > for
>> > > > > > > > "ALTER FUNCTION" and "SHOW FUNCTIONS".
>> > > > > > > > Please let me know if you have any suggestion to better
>> align
>> > the
>> > > > > > > timeline
>> > > > > > > > of the ongoing catalog related efforts.
>> > > > > > > >
>> > > > > > > > Best Regards
>> > > > > > > > Peter Huang
>> > > > > > > >
>> > > > > > > >
>> > > > > > > > On Sat, Nov 9, 2019 at 7:26 PM Yu Li 
>> wrote:
>> > > > > > > >
>> > > > > > > > > Thanks for driving this Peter!
>> > > > > > > > >
>> > > > > > > > > I agree it would be great if we could include this
>> feature in
>> > > > 1.10.
>> > > > > > > > > However, FWIW, since we are following the time-based
>> release
>> > > > policy
>> > > > > > [1]
>> > > > > > > > and
>> > > > > > > > > 1.10 release is approaching its feature freeze (planned
>> to be
>> > > at
>> > > > > the
>> > > > > > > end
>> > > > > > > > of
>> > > > > > > > > November) [2], I'm a little bit concerned about the
>> schedule.
>> > > > > > > > >
>> > > > > > > > > [1]
>> > > > > > > >
>> > > > >
>> > https://cwiki.apache.org/confluence/display/FLINK/Time-based+releases
>> > > > > > > > > [2]
>> > > > > > > > >
>> > > > > > > > >
>> > > > > > > >
>> > > > > > >
>> > > > > >
>> > > > >
>> > > >
>> > >
>> >
>> http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-Features-for-Apache-Flink-1-10-td32824.html
>> > > > > > > > >
>> > > > > > > > > Best Regards,
>> > > > > > > > > Yu
>> > > > > > > > >
>> > > > > > > > >
>> > > > > > > > > On Sat, 9 Nov 2019 at 04:12, Xuefu Z 
>> > > wrote:
>> > > > > > > > >
>> > > > > > > > > > Hi Peter,
>> > > > > > > > > >
>> > > > > > > > > > Thanks for driving this. I'm all-in for this. However,
>> as I
>> > > > read
>> > > > > > the
>> > > > > > > > > latest
>> > > > > > > > > > FLIP, I have a couple of questions/comments:
>> > > > > > > > > >
>> > > > > > > > > > 1. It seems that "JVM" is proposed as a language type in
>> > > > parallel
>> > > > > > to
>> > > > > > > > > > python. I'm not sure that's very intuitive. JVM stands
>> for
>> > > > "java
>> > > > > > > > virtual
>> > > > > > > > > > machine", so the language is really "JAVA", correct? I
>> know
>> > > > > "scala"
>> > > > > > > is
>> > > > > > > > > also
>> > > > > > > > > > a language whic

[jira] [Created] (FLINK-14714) BytesHashMap should not warn invoking stack when it want to spill

2019-11-11 Thread Jingsong Lee (Jira)
Jingsong Lee created FLINK-14714:


 Summary: BytesHashMap should not warn invoking stack when it want 
to spill
 Key: FLINK-14714
 URL: https://issues.apache.org/jira/browse/FLINK-14714
 Project: Flink
  Issue Type: Improvement
  Components: Table SQL / Runtime
Reporter: Jingsong Lee
 Fix For: 1.10.0


Now BytesHashMap will warn invoking stack when it want to spill, this lead to a 
lot of messages in logs.

We need remove the stack and just print warns.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (FLINK-14715) HiveModuleTest fails with Hive-3.1.1

2019-11-11 Thread Rui Li (Jira)
Rui Li created FLINK-14715:
--

 Summary: HiveModuleTest fails with Hive-3.1.1
 Key: FLINK-14715
 URL: https://issues.apache.org/jira/browse/FLINK-14715
 Project: Flink
  Issue Type: Test
  Components: Connectors / Hive
Reporter: Rui Li


[ERROR] Failures:
[ERROR] HiveModuleTest.testNumberOfBuiltinFunctions:62 expected:<287> but 
was:<306>



--
This message was sent by Atlassian Jira
(v8.3.4#803005)