[jira] [Created] (FLINK-11115) Port some flink.ml algorithms to table based

2018-12-09 Thread Weihua Jiang (JIRA)
Weihua Jiang created FLINK-5: Summary: Port some flink.ml algorithms to table based Key: FLINK-5 URL: https://issues.apache.org/jira/browse/FLINK-5 Project: Flink Issue Type: Sub

[jira] [Created] (FLINK-11114) Support wrapping inference pipeline as a UDF function in SQL

2018-12-09 Thread Weihua Jiang (JIRA)
Weihua Jiang created FLINK-4: Summary: Support wrapping inference pipeline as a UDF function in SQL Key: FLINK-4 URL: https://issues.apache.org/jira/browse/FLINK-4 Project: Flink

[jira] [Created] (FLINK-11113) Support periodically update models when inferencing

2018-12-09 Thread Weihua Jiang (JIRA)
Weihua Jiang created FLINK-3: Summary: Support periodically update models when inferencing Key: FLINK-3 URL: https://issues.apache.org/jira/browse/FLINK-3 Project: Flink Issue

[jira] [Created] (FLINK-11112) Support pipeline import/export

2018-12-09 Thread Weihua Jiang (JIRA)
Weihua Jiang created FLINK-2: Summary: Support pipeline import/export Key: FLINK-2 URL: https://issues.apache.org/jira/browse/FLINK-2 Project: Flink Issue Type: Sub-task

[jira] [Created] (FLINK-11111) Create a new set of parameters

2018-12-09 Thread Weihua Jiang (JIRA)
Weihua Jiang created FLINK-1: Summary: Create a new set of parameters Key: FLINK-1 URL: https://issues.apache.org/jira/browse/FLINK-1 Project: Flink Issue Type: Sub-task

[jira] [Created] (FLINK-11108) Create a new set of table based ML Pipeline classes

2018-12-09 Thread Weihua Jiang (JIRA)
Weihua Jiang created FLINK-11108: Summary: Create a new set of table based ML Pipeline classes Key: FLINK-11108 URL: https://issues.apache.org/jira/browse/FLINK-11108 Project: Flink Issue

[jira] [Created] (FLINK-11110) Support pipeline stage type inference

2018-12-09 Thread Weihua Jiang (JIRA)
Weihua Jiang created FLINK-0: Summary: Support pipeline stage type inference Key: FLINK-0 URL: https://issues.apache.org/jira/browse/FLINK-0 Project: Flink Issue Type: Sub-task

[jira] [Created] (FLINK-11109) Create Table based optimizers

2018-12-09 Thread Weihua Jiang (JIRA)
Weihua Jiang created FLINK-11109: Summary: Create Table based optimizers Key: FLINK-11109 URL: https://issues.apache.org/jira/browse/FLINK-11109 Project: Flink Issue Type: Sub-task

[jira] [Created] (FLINK-11096) Create a new table based flink ML package

2018-12-07 Thread Weihua Jiang (JIRA)
Weihua Jiang created FLINK-11096: Summary: Create a new table based flink ML package Key: FLINK-11096 URL: https://issues.apache.org/jira/browse/FLINK-11096 Project: Flink Issue Type: Sub

[jira] [Created] (FLINK-11095) Table based ML Pipeline

2018-12-07 Thread Weihua Jiang (JIRA)
Weihua Jiang created FLINK-11095: Summary: Table based ML Pipeline Key: FLINK-11095 URL: https://issues.apache.org/jira/browse/FLINK-11095 Project: Flink Issue Type: New Feature

Re: [DISCUSS] Embracing Table API in Flink ML

2018-12-04 Thread Weihua Jiang
und map-reduce) > > Hope it makes sense. BTW, xgboost (most popular ML competition framework) > has very primitive flink support, might worth check out. > https://github.com/dmlc/xgboost > > Chen > > On Tue, Nov 20, 2018 at 6:13 PM Weihua Jiang > wrote: > > > Hi Y

Re: [DISCUSS]Enhancing flink scheduler by implementing blacklist mechanism

2018-11-27 Thread Weihua Jiang
This is a quite useful feature for production use. I once encountered such a case in production cluster and the Storm jobs used 2 hours to stabilize. After that, we implemented similar blacklist solution for storm. The design doc looks good to me. Some minor suggestions about blacklist removal: in

Re: [DISCUSS] Support Interactive Programming in Flink Table API

2018-11-20 Thread Weihua Jiang
Hi Becket, The design is quite interesting and useful. I have several questions about your design: 1. Shall we add some persistence level hint to cache() function for different temperature data? E.g. IN_MEM, IN_DISK, etc, or HOTTEST, HOT, WARM, COLD? 2. When will the corresponding cached data be

Re: [DISCUSS] Embracing Table API in Flink ML

2018-11-20 Thread Weihua Jiang
eline by eliminating the > requirement to load the models from external file systems. > > I will read the design doc more carefully. Thanks again for sharing > the design doc! > > Yours sincerely > Yun Gao > > > -

Re: [DISCUSS] Embracing Table API in Flink ML

2018-11-20 Thread Weihua Jiang
s. Let us leave it for > a while before the new interface is completed and well adopted. Then > we can deprecate the old ones. > > I will take a more thorough look at your proposal and leave comments > directly on the doc. > > Regards, > Shaoxuan > > > On 11/20/18, Weih

Re: [DISCUSS] Embracing Table API in Flink ML

2018-11-20 Thread Weihua Jiang
ter. > > > > I think It's grateful if you can add a phased implementation plan in > google > > doc. What to do you think? > > > > Thanks, > > Jincheng > > > > > > Weihua Jiang 于2018年11月20日周二 下午8:53写道: > > > > >

Re: [DISCUSS] Embracing Table API in Flink ML

2018-11-20 Thread Weihua Jiang
ink It's grateful if you can add a phased implementation plan in google > doc. What to do you think? > > Thanks, > Jincheng > > > Weihua Jiang 于2018年11月20日周二 下午8:53写道: > > > ML Pipeline is the idea brought by Scikit-learn > > <https://arxiv.org/abs/1309.0238>. Bot

[DISCUSS] Embracing Table API in Flink ML

2018-11-20 Thread Weihua Jiang
ML Pipeline is the idea brought by Scikit-learn . Both Spark and Flink has borrowed this idea and made their own implementations [Spark ML Pipeline , Flink ML Pipeline

[jira] [Created] (FLINK-8020) Deadlock found in Flink Streaming job

2017-11-08 Thread Weihua Jiang (JIRA)
Weihua Jiang created FLINK-8020: --- Summary: Deadlock found in Flink Streaming job Key: FLINK-8020 URL: https://issues.apache.org/jira/browse/FLINK-8020 Project: Flink Issue Type: Bug

[jira] [Created] (FLINK-8019) Flink streaming job stopped at consuming Kafka data

2017-11-07 Thread Weihua Jiang (JIRA)
Weihua Jiang created FLINK-8019: --- Summary: Flink streaming job stopped at consuming Kafka data Key: FLINK-8019 URL: https://issues.apache.org/jira/browse/FLINK-8019 Project: Flink Issue Type