[jira] [Created] (FLINK-32598) Spill data from feedback edge to disk to avoid possible OOM

2023-07-17 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-32598: - Summary: Spill data from feedback edge to disk to avoid possible OOM Key: FLINK-32598 URL: https://issues.apache.org/jira/browse/FLINK-32598 Project: Flink

Re: [VOTE] Apache Flink ML Release 2.3.0, release candidate #1

2023-06-29 Thread Zhipeng Zhang
Thanks Dong and Xin for driving this release. +1 (non-binding) - Verified that the checksums and GPG files. - Verified that the source distributions do not contain any binaries. - Browsed through JIRA release notes files. - Browsed through README.md files. Xin Jiang 于2023年6月29日周四 12:08写道: > > H

[jira] [Created] (FLINK-32293) Support vector with long index

2023-06-08 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-32293: - Summary: Support vector with long index Key: FLINK-32293 URL: https://issues.apache.org/jira/browse/FLINK-32293 Project: Flink Issue Type: New Feature

[jira] [Created] (FLINK-32292) TableUtils.getRowTypeInfo fails to get type information of Tuple

2023-06-08 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-32292: - Summary: TableUtils.getRowTypeInfo fails to get type information of Tuple Key: FLINK-32292 URL: https://issues.apache.org/jira/browse/FLINK-32292 Project: Flink

[jira] [Created] (FLINK-31910) Using BroadcastUtils#withBroadcast in iteration perround mode got stuck

2023-04-24 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-31910: - Summary: Using BroadcastUtils#withBroadcast in iteration perround mode got stuck Key: FLINK-31910 URL: https://issues.apache.org/jira/browse/FLINK-31910 Project

[jira] [Created] (FLINK-31909) Using BroadcastUtils#withBroadcast in iteration perround mode got stuck

2023-04-24 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-31909: - Summary: Using BroadcastUtils#withBroadcast in iteration perround mode got stuck Key: FLINK-31909 URL: https://issues.apache.org/jira/browse/FLINK-31909 Project

[jira] [Created] (FLINK-31903) Caching records fails in BroadcastUtils#withBroadcastStream

2023-04-24 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-31903: - Summary: Caching records fails in BroadcastUtils#withBroadcastStream Key: FLINK-31903 URL: https://issues.apache.org/jira/browse/FLINK-31903 Project: Flink

[jira] [Created] (FLINK-31901) AbstractBroadcastWrapperOperator should not block checkpoint barriers when processing cached records

2023-04-23 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-31901: - Summary: AbstractBroadcastWrapperOperator should not block checkpoint barriers when processing cached records Key: FLINK-31901 URL: https://issues.apache.org/jira/browse/FLINK

Re: [ANNOUNCE] New Apache Flink PMC Member - Leonard Xu

2023-04-23 Thread Zhipeng Zhang
Congratulations, Leonard. Hang Ruan 于2023年4月23日周日 19:03写道: > > Congratulations, Leonard. > > Best, > Hang > > Yanfei Lei 于2023年4月23日周日 18:34写道: > > > Congratulations, Leonard! > > > > Best, > > Yanfei > > > > liu ron 于2023年4月23日周日 17:45写道: > > > > > > Congratulations, Leonard. > > > > > > Best,

Re: [ANNOUNCE] New Apache Flink PMC Member - Qingsheng Ren

2023-04-23 Thread Zhipeng Zhang
Congratulations, Qingsheng! Hang Ruan 于2023年4月23日周日 19:03写道: > > Congratulations, Qingsheng! > > Best, > Hang > > Yanfei Lei 于2023年4月23日周日 18:33写道: > > > Congratulations, Qingsheng! > > > > Best, > > Yanfei > > > > liu ron 于2023年4月23日周日 17:47写道: > > > > > > Congratulations, Qingsheng. > > > > >

Re: [VOTE] Apache Flink ML Release 2.2.0, release candidate #2

2023-04-13 Thread Zhipeng Zhang
Hi Dong, Thanks for driving this release! +1 (non-binding) Here is what I have checked. - Verified that the checksums and GPG files. - Verified that the source distributions do not contain any binaries. - Built the source distribution and run all unit tests. - Verified that all POM files point to

[jira] [Created] (FLINK-31732) flink-ml-uber module should include statefun as a dependency

2023-04-04 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-31732: - Summary: flink-ml-uber module should include statefun as a dependency Key: FLINK-31732 URL: https://issues.apache.org/jira/browse/FLINK-31732 Project: Flink

Re: [DISCUSS] Releasing Flink ML 2.2.0

2023-03-30 Thread Zhipeng Zhang
Hi Dong, Thanks for starting the discussion. +1 for the Flink ML 2.1.0 release.

[jira] [Created] (FLINK-31410) ListStateWithCache Should support incremental snapshot

2023-03-12 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-31410: - Summary: ListStateWithCache Should support incremental snapshot Key: FLINK-31410 URL: https://issues.apache.org/jira/browse/FLINK-31410 Project: Flink

[jira] [Created] (FLINK-31374) ProxyStreamPartitioner should implement ConfigurableStreamPartitioner

2023-03-08 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-31374: - Summary: ProxyStreamPartitioner should implement ConfigurableStreamPartitioner Key: FLINK-31374 URL: https://issues.apache.org/jira/browse/FLINK-31374 Project

[jira] [Created] (FLINK-31373) PerRoundWrapperOperator should carry epoch information in watermark

2023-03-08 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-31373: - Summary: PerRoundWrapperOperator should carry epoch information in watermark Key: FLINK-31373 URL: https://issues.apache.org/jira/browse/FLINK-31373 Project: Flink

[jira] [Created] (FLINK-31276) VectorIndexerTest#testFitAndPredictWithHandleInvalid fails

2023-03-01 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-31276: - Summary: VectorIndexerTest#testFitAndPredictWithHandleInvalid fails Key: FLINK-31276 URL: https://issues.apache.org/jira/browse/FLINK-31276 Project: Flink

[jira] [Created] (FLINK-31255) OperatorUtils#createWrappedOperatorConfig fails to wrap operator config

2023-02-28 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-31255: - Summary: OperatorUtils#createWrappedOperatorConfig fails to wrap operator config Key: FLINK-31255 URL: https://issues.apache.org/jira/browse/FLINK-31255 Project

Re: [VOTE] FLIP-289: Support online inference (Flink ML)

2023-02-23 Thread Zhipeng Zhang
ussion open for at least 72 hours before merging the > PR. > > Thanks, > Dong > > > On Thu, Feb 16, 2023 at 9:42 PM Dong Lin wrote: > > > Thank you all for the votes! > > > > The vote is now closed. I will announce the results in a separate email.

[jira] [Created] (FLINK-31191) VectorIndexer should check whether doublesByColumn is null before snapshot

2023-02-22 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-31191: - Summary: VectorIndexer should check whether doublesByColumn is null before snapshot Key: FLINK-31191 URL: https://issues.apache.org/jira/browse/FLINK-31191 Project

[jira] [Created] (FLINK-31173) TailOperator should only have one input

2023-02-21 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-31173: - Summary: TailOperator should only have one input Key: FLINK-31173 URL: https://issues.apache.org/jira/browse/FLINK-31173 Project: Flink Issue Type: Bug

[jira] [Created] (FLINK-31160) Support join/cogroup in BroadcastUtils.withBroadcastStream

2023-02-20 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-31160: - Summary: Support join/cogroup in BroadcastUtils.withBroadcastStream Key: FLINK-31160 URL: https://issues.apache.org/jira/browse/FLINK-31160 Project: Flink

Re: [ANNOUNCE] New Apache Flink PMC Member - Dong Lin

2023-02-16 Thread Zhipeng Zhang
Congratulations, Dong! Yun Tang 于2023年2月17日周五 10:37写道: > > Congratulations, Dong! > > Best > Yun Tang > > From: Yuxin Tan > Sent: Friday, February 17, 2023 10:19 > To: dev@flink.apache.org > Subject: Re: [ANNOUNCE] New Apache Flink PMC Member - Dong Lin > > Cong

Re: [VOTE] FLIP-289: Support online inference (Flink ML)

2023-02-16 Thread Zhipeng Zhang
+1 (binding) Regards, Zhipeng Dian Fu 于2023年2月13日周一 20:21写道: > > +1 (binding) > > Regards, > Dian > > On Mon, Feb 13, 2023 at 11:04 AM Dong Lin wrote: > > > Hi all, > > > > We would like to start the vote for FLIP-289: Support online inference > > (Flink ML) [1]. This FLIP was discussed in this

[jira] [Created] (FLINK-30933) Result of join inside iterationBody loses max watermark

2023-02-06 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-30933: - Summary: Result of join inside iterationBody loses max watermark Key: FLINK-30933 URL: https://issues.apache.org/jira/browse/FLINK-30933 Project: Flink

[jira] [Created] (FLINK-30671) Add AlgoOperator for ClusteringEvaluator

2023-01-12 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-30671: - Summary: Add AlgoOperator for ClusteringEvaluator Key: FLINK-30671 URL: https://issues.apache.org/jira/browse/FLINK-30671 Project: Flink Issue Type

[jira] [Created] (FLINK-30566) Add benchmark configurations for agglomerativeclustering, hashingtf, idf, kbinsdiscretizer, linearregression, linearsvc, logisticregression, ngram, regextokenizer, toke

2023-01-04 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-30566: - Summary: Add benchmark configurations for agglomerativeclustering, hashingtf, idf, kbinsdiscretizer, linearregression, linearsvc, logisticregression, ngram, regextokenizer, tokenizer and vectorindexer

[jira] [Created] (FLINK-30541) Add Transformer and Estimator for OnlineStandardScaler

2023-01-02 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-30541: - Summary: Add Transformer and Estimator for OnlineStandardScaler Key: FLINK-30541 URL: https://issues.apache.org/jira/browse/FLINK-30541 Project: Flink

[jira] [Created] (FLINK-30451) Add Estimator and Transformer for Swing

2022-12-18 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-30451: - Summary: Add Estimator and Transformer for Swing Key: FLINK-30451 URL: https://issues.apache.org/jira/browse/FLINK-30451 Project: Flink Issue Type: New

[jira] [Created] (FLINK-30249) TableUtils.getRowTypeInfo() creating wrong TypeInformation

2022-11-30 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-30249: - Summary: TableUtils.getRowTypeInfo() creating wrong TypeInformation Key: FLINK-30249 URL: https://issues.apache.org/jira/browse/FLINK-30249 Project: Flink

[jira] [Created] (FLINK-29911) Improve performance of AgglomerativeClustering

2022-11-06 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-29911: - Summary: Improve performance of AgglomerativeClustering Key: FLINK-29911 URL: https://issues.apache.org/jira/browse/FLINK-29911 Project: Flink Issue Type

[jira] [Created] (FLINK-29824) AgglomerativeClustering fails when the distanceThreshold is very large

2022-10-31 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-29824: - Summary: AgglomerativeClustering fails when the distanceThreshold is very large Key: FLINK-29824 URL: https://issues.apache.org/jira/browse/FLINK-29824 Project

[jira] [Created] (FLINK-29176) Add python source/test for ChiSqTest

2022-09-01 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-29176: - Summary: Add python source/test for ChiSqTest Key: FLINK-29176 URL: https://issues.apache.org/jira/browse/FLINK-29176 Project: Flink Issue Type

[jira] [Created] (FLINK-29175) Add documents for AgglomerativeClustering, KbinsCretizer, ChisquaredTest, VectorIndexer, Tokenizer, RegexTokenizer, IDF

2022-09-01 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-29175: - Summary: Add documents for AgglomerativeClustering, KbinsCretizer, ChisquaredTest, VectorIndexer, Tokenizer, RegexTokenizer, IDF Key: FLINK-29175 URL: https://issues.apache.org

[jira] [Created] (FLINK-29174) Add document for ML algorithms

2022-09-01 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-29174: - Summary: Add document for ML algorithms Key: FLINK-29174 URL: https://issues.apache.org/jira/browse/FLINK-29174 Project: Flink Issue Type: Improvement

[jira] [Created] (FLINK-29170) Add Transformer and Estimator for VarianceThresholdSelector

2022-09-01 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-29170: - Summary: Add Transformer and Estimator for VarianceThresholdSelector Key: FLINK-29170 URL: https://issues.apache.org/jira/browse/FLINK-29170 Project: Flink

[jira] [Created] (FLINK-29169) Add Transformer and Estimator for UnivariateFeatureSelector

2022-09-01 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-29169: - Summary: Add Transformer and Estimator for UnivariateFeatureSelector Key: FLINK-29169 URL: https://issues.apache.org/jira/browse/FLINK-29169 Project: Flink

[jira] [Created] (FLINK-29168) Add Transformer for NGram

2022-09-01 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-29168: - Summary: Add Transformer for NGram Key: FLINK-29168 URL: https://issues.apache.org/jira/browse/FLINK-29168 Project: Flink Issue Type: New Feature

Re: [ANNOUNCE] New Apache Flink Committer - Junhan Yang

2022-08-17 Thread Zhipeng Zhang
Congratulations, Junhan! Xintong Song 于2022年8月18日周四 11:21写道: > > Hi everyone, > > On behalf of the PMC, I'm very happy to announce Junhan Yang as a new Flink > committer. > > Junhan has been contributing to the Flink project for more than 1 year. His > contributions are mostly identified in the w

Re: [ANNOUNCE] New Apache Flink Committer - Lijie Wang

2022-08-17 Thread Zhipeng Zhang
Congratulations, Lijie! Xintong Song 于2022年8月18日周四 11:23写道: > > Congratulations Lijie, and welcome~! > > Best, > > Xintong > > > > On Thu, Aug 18, 2022 at 11:12 AM Xingbo Huang wrote: > > > Congrats, Lijie > > > > Best, > > Xingbo > > > > Lincoln Lee 于2022年8月18日周四 11:01写道: > > > > > Congratulat

[jira] [Created] (FLINK-28906) Add AlgoOperator for AgglomerativeClustering

2022-08-10 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-28906: - Summary: Add AlgoOperator for AgglomerativeClustering Key: FLINK-28906 URL: https://issues.apache.org/jira/browse/FLINK-28906 Project: Flink Issue Type

[jira] [Created] (FLINK-28806) Add Estimator and Transformer for IDF

2022-08-04 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-28806: - Summary: Add Estimator and Transformer for IDF Key: FLINK-28806 URL: https://issues.apache.org/jira/browse/FLINK-28806 Project: Flink Issue Type: New

[jira] [Created] (FLINK-28805) Add Transformer for HashingTF

2022-08-04 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-28805: - Summary: Add Transformer for HashingTF Key: FLINK-28805 URL: https://issues.apache.org/jira/browse/FLINK-28805 Project: Flink Issue Type: New Feature

[jira] [Created] (FLINK-28803) Add Transformer and Estimator for KBinsDiscretizer

2022-08-04 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-28803: - Summary: Add Transformer and Estimator for KBinsDiscretizer Key: FLINK-28803 URL: https://issues.apache.org/jira/browse/FLINK-28803 Project: Flink Issue

[jira] [Created] (FLINK-28739) Illegal State for checkpoint in LogisticRegressionTest.test_get_model_data

2022-07-29 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-28739: - Summary: Illegal State for checkpoint in LogisticRegressionTest.test_get_model_data Key: FLINK-28739 URL: https://issues.apache.org/jira/browse/FLINK-28739

[jira] [Created] (FLINK-28684) NullPointerException at OneHotEncoder.GenerateModelDataOperator.snapshot

2022-07-25 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-28684: - Summary: NullPointerException at OneHotEncoder.GenerateModelDataOperator.snapshot Key: FLINK-28684 URL: https://issues.apache.org/jira/browse/FLINK-28684 Project

[ANNOUNCE] Apache Flink ML 2.1.0 released

2022-07-12 Thread Zhipeng Zhang
The Apache Flink community is excited to announce the release of Flink ML 2.1.0! This release focuses on improving Flink ML's infrastructure, such as Python SDK, memory management, and benchmark framework, to facilitate the development of performant, memory-safe, and easy-to-use algorithm librari

[jira] [Created] (FLINK-28502) Add Transformer for RegexTokenizer

2022-07-11 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-28502: - Summary: Add Transformer for RegexTokenizer Key: FLINK-28502 URL: https://issues.apache.org/jira/browse/FLINK-28502 Project: Flink Issue Type: New Feature

[jira] [Created] (FLINK-28501) Add Transformer and Estimator for VectorIndexer

2022-07-11 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-28501: - Summary: Add Transformer and Estimator for VectorIndexer Key: FLINK-28501 URL: https://issues.apache.org/jira/browse/FLINK-28501 Project: Flink Issue Type

[jira] [Created] (FLINK-28500) Add Transformer for Tokenizer

2022-07-11 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-28500: - Summary: Add Transformer for Tokenizer Key: FLINK-28500 URL: https://issues.apache.org/jira/browse/FLINK-28500 Project: Flink Issue Type: New Feature

[RESULT] [VOTE] Apache Flink ML Release 2.1.0, release candidate #2

2022-07-07 Thread Zhipeng Zhang
I'm happy to announce that we have unanimously approved this release [1]. * Dong Lin (non-binding) * Yunfeng Zhou (non-binding) * Dian Fu (binding) * Yun Gao (binding) * Becket Qin (binding) There are no disapproving votes. Thanks everyone! [1] https://lists.apache.org/list.html?dev@flink.apach

[VOTE] Apache Flink ML Release 2.1.0, release candidate #2

2022-06-30 Thread Zhipeng Zhang
Hi everyone, Please review and vote on the release candidate #2 for the version 2.1.0 of Apache Flink ML as follows: [ ] +1, Approve the release [ ] -1, Do not approve the release (please provide specific comments) **Testing Guideline** You can find here [1] a page in the project wiki on in

Re: [VOTE] Apache Flink ML Release 2.1.0, release candidate #1

2022-06-29 Thread Zhipeng Zhang
idate > to fix this issue? > > Thanks, > Dong > > > On Tue, Jun 28, 2022 at 5:28 PM Zhipeng Zhang > wrote: > > > Hi everyone, > > > > Please review and vote on the release candidate #1 for the version 2.1.0 > of > > Apache Flink ML as follows: >

[VOTE] Apache Flink ML Release 2.1.0, release candidate #1

2022-06-28 Thread Zhipeng Zhang
Hi everyone, Please review and vote on the release candidate #1 for the version 2.1.0 of Apache Flink ML as follows: [ ] +1, Approve the release [ ] -1, Do not approve the release (please provide specific comments) **Testing Guideline** You can find here [1] a page in the project wiki on instruc

Re: [DISCUSS] Releasing Flink ML 2.1.0

2022-06-26 Thread Zhipeng Zhang
> > > > > Hi Zhipeng and Yun, > > > > > > Thanks for starting the discussion. +1 for the Flink ML 2.1.0 release. > > > > > > Cheers, > > > Dong > > > > > > On Thu, Jun 23, 2022 at 11:15 AM Zhipeng Zhang < > zhangzhip

[DISCUSS] Releasing Flink ML 2.1.0

2022-06-22 Thread Zhipeng Zhang
Hi devs, Yun and I would like to start a discussion for releasing Flink ML 2.1.0. In the past few months, we focused on improving the infra (e.g. memory management, benchmark infra, online training, python support) of Flink ML by implementing, benchmarking, an

Re: Re: [ANNOUNCE] New Apache Flink Committers: Qingsheng Ren, Shengkai Fang

2022-06-21 Thread Zhipeng Zhang
Congratulations, Qingsheng and ShengKai. Yang Wang 于2022年6月21日周二 19:43写道: > Congratulations, Qingsheng and ShengKai. > > > Best, > Yang > > Benchao Li 于2022年6月21日周二 19:33写道: > > > Congratulations! > > > > weijie guo 于2022年6月21日周二 13:44写道: > > > > > Congratulations, Qingsheng and ShengKai! > >

[jira] [Created] (FLINK-27952) Table UDF fails when using Double.POSITIVE_INFINITY as parameters

2022-06-08 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-27952: - Summary: Table UDF fails when using Double.POSITIVE_INFINITY as parameters Key: FLINK-27952 URL: https://issues.apache.org/jira/browse/FLINK-27952 Project: Flink

[jira] [Created] (FLINK-27877) Improve performance of several feature engineering algorithms

2022-06-01 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-27877: - Summary: Improve performance of several feature engineering algorithms Key: FLINK-27877 URL: https://issues.apache.org/jira/browse/FLINK-27877 Project: Flink

[jira] [Created] (FLINK-27826) Support machine learning training for high dimesional models

2022-05-28 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-27826: - Summary: Support machine learning training for high dimesional models Key: FLINK-27826 URL: https://issues.apache.org/jira/browse/FLINK-27826 Project: Flink

[jira] [Created] (FLINK-27072) Add Bucketizer in FlinkML

2022-04-05 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-27072: - Summary: Add Bucketizer in FlinkML Key: FLINK-27072 URL: https://issues.apache.org/jira/browse/FLINK-27072 Project: Flink Issue Type: New Feature

[jira] [Created] (FLINK-26801) LogisticRegressionTest.testGetModelData» Runtime Failed to fetch next res...

2022-03-22 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-26801: - Summary: LogisticRegressionTest.testGetModelData» Runtime Failed to fetch next res... Key: FLINK-26801 URL: https://issues.apache.org/jira/browse/FLINK-26801

Re: [ANNOUNCE] New PMC member: Yuan Mei

2022-03-14 Thread Zhipeng Zhang
Congratulations, Yuan! Matt Wang 于2022年3月15日周二 10:13写道: > Congratulations, Yuan! > > > -- > > Best, > Matt Wang > > > On 03/15/2022 09:51,godfrey he wrote: > Congratulations, Yuan! > > Best, > Godfrey > > Lijie Wang 于2022年3月15日周二 09:18写道: > > Congratulations, Yuan! > > Best, > Lijie > > Benchao

[jira] [Created] (FLINK-26626) Add Estimator and Transformer of StandardScaler in FlinkML

2022-03-13 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-26626: - Summary: Add Estimator and Transformer of StandardScaler in FlinkML Key: FLINK-26626 URL: https://issues.apache.org/jira/browse/FLINK-26626 Project: Flink

Re: [ANNOUNCE] New Apache Flink Committer - Martijn Visser

2022-03-03 Thread Zhipeng Zhang
Congratulations Martijn! Qingsheng Ren 于2022年3月4日周五 10:14写道: > Congratulations Martijn! > > Best regards, > > Qingsheng Ren > > > On Mar 4, 2022, at 9:56 AM, Leonard Xu wrote: > > > > Congratulations and well deserved Martjin ! > > > > Best, > > Leonard > > > >> 2022年3月4日 上午7:55,Austin Cawley-E

[jira] [Created] (FLINK-26443) Add a benchmark framework for flinkml

2022-03-01 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-26443: - Summary: Add a benchmark framework for flinkml Key: FLINK-26443 URL: https://issues.apache.org/jira/browse/FLINK-26443 Project: Flink Issue Type: New

Re: [ANNOUNCE] New Apache Flink Committers: Feng Wang, Zhipeng Zhang

2022-02-17 Thread Zhipeng Zhang
o announce two new Flink > > committers: Feng Wang and Zhipeng Zhang! > > > > Feng is one of the most active Flink evangelists in China, with plenty of > > public talks, blog posts and other evangelization activities. The PMC > wants > > to recognize and value t

Re: [DISCUSS] FLIP-205: Support cache in DataStream for Batch Processing

2022-01-05 Thread Zhipeng Zhang
Hi Xuannnan, Thanks for the reply. Regarding whether and how to support cache sideoutput, I agree that the second option might be better if there do exist a use case that users need to cache only some certain side outputs. Xuannan Su 于2022年1月4日周二 15:50写道: > Hi Zhipeng and Gen, > > Thanks for

[jira] [Created] (FLINK-25527) Add StringIndexer in FlinkML

2022-01-05 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-25527: - Summary: Add StringIndexer in FlinkML Key: FLINK-25527 URL: https://issues.apache.org/jira/browse/FLINK-25527 Project: Flink Issue Type: New Feature

Re: [DISCUSS] Drop Gelly

2022-01-04 Thread Zhipeng Zhang
t; Martijn > > On Tue, 4 Jan 2022 at 02:57, Zhipeng Zhang > wrote: > >> Hi everyone, >> >> Thanks for starting the discussion :) >> >> We (Alink team [1]) are actually using part of the Gelly library to >> support graph algorithms (connected component, sin

Re: [DISCUSS] Drop Gelly

2022-01-03 Thread Zhipeng Zhang
Hi everyone, Thanks for starting the discussion :) We (Alink team [1]) are actually using part of the Gelly library to support graph algorithms (connected component, single source shortest path, etc.) for users in Alibaba Inc. As DataSet API is going to be dropped, shall we also provide a new gr

Re: [VOTE] Apache Flink ML Release 2.0.0, release candidate #3

2021-12-30 Thread Zhipeng Zhang
+1 (non-binding) - Verified that the checksums and GPG files match the corresponding release files - Verified that the source distributions do not contain any binaries - Built the source distribution with Maven to ensure all source files have Apache headers - Verified that all POM files point to t

Re: [DISCUSS] FLIP-205: Support cache in DataStream for Batch Processing

2021-12-30 Thread Zhipeng Zhang
Hi Xuannan, Thanks for starting the discussion. This would certainly help a lot on both efficiency and reproducibility in machine learning cases :) I have a few questions as follows: 1. Can we support caching both the output and sideoutputs of a SingleOutputStreamOperator (which I believe is a r

Re: [VOTE] Apache Flink ML Release 2.0.0, release candidate #2

2021-12-29 Thread Zhipeng Zhang
Hi Yun, Thanks for the release! I found that the NOTICE and license of `flink-ml-uber` is wrong since `flink-ml-uber` does not use `com.github.fommil.netlib:core:1.1.2` anymore. Rather, we are using `dev.ludovic.netlib:blas:2.2.0` in flink-ml-core. I have created a PR to remove the NOTICE and li

[jira] [Created] (FLINK-25424) Checkpointing is currently not supported for operators that implement InputSelectable

2021-12-22 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-25424: - Summary: Checkpointing is currently not supported for operators that implement InputSelectable Key: FLINK-25424 URL: https://issues.apache.org/jira/browse/FLINK-25424

Re: [DISCUSS] Releasing Flink ML 2.0.0

2021-12-17 Thread Zhipeng Zhang
Hi Dong, Thanks for starting the discussion. It is really a great pleasure to be part of the FlinkML project. Cheers :) Dong Lin 于2021年12月17日周五 22:22写道: > Hi devs, > > Yun and I would like to start a discussion for releasing Flink ML > 2.0.0 > > In the past

[jira] [Created] (FLINK-24845) Add allreduce utility function in FlinkML

2021-11-09 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-24845: - Summary: Add allreduce utility function in FlinkML Key: FLINK-24845 URL: https://issues.apache.org/jira/browse/FLINK-24845 Project: Flink Issue Type: New

[jira] [Created] (FLINK-24838) Add BaseAlgoImpl class to support link() and linkFrom()

2021-11-08 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-24838: - Summary: Add BaseAlgoImpl class to support link() and linkFrom() Key: FLINK-24838 URL: https://issues.apache.org/jira/browse/FLINK-24838 Project: Flink

[jira] [Created] (FLINK-24556) Support Logistic Regression in FlinkML

2021-10-14 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-24556: - Summary: Support Logistic Regression in FlinkML Key: FLINK-24556 URL: https://issues.apache.org/jira/browse/FLINK-24556 Project: Flink Issue Type: New

Re: [DISCUSS] FLIP-173: Support DAG of algorithms (Flink ML)

2021-08-26 Thread Zhipeng Zhang
Thanks for the post, Dong :) We welcome everyone to drop us an email on Flink ML. Let's work together to build machine learning on Flink :) Dong Lin 于2021年8月25日周三 下午8:58写道: > Hi everyone, > > Based on the feedback received in the online/offline discussion in the > past few weeks, we (Zhepeng,

Re: [DISCUSS] FLIP-173: Support DAG of algorithms (Flink ML)

2021-08-10 Thread Zhipeng Zhang
Hi Timo, Becket, Thanks for the feedback. I agree that having named table can help the code more readable. No matter there is one output table or multiple output tables, users have to access an output table by a magic index (For the case that there is only one output table, we need to use index z

Re: [DISCUSS] FLIP-173: Support DAG of algorithms (Flink ML)

2021-07-20 Thread Zhipeng Zhang
://docs.google.com/document/d/1L3aI9LjkcUPoM52liEY6uFktMnFMNFQ6kXAjnz_11do/edit#heading=h.c2qr9r64btd9 Thanks, Zhipeng Zhang Becket Qin 于2021年7月20日周二 上午11:42写道: > Hi Dong, Zhipeng and Fan, > > Thanks for the detailed proposals. It is quite a lot of reading! Given > that we are introduc