Weihua Jiang created FLINK-11096: ------------------------------------ Summary: Create a new table based flink ML package Key: FLINK-11096 URL: https://issues.apache.org/jira/browse/FLINK-11096 Project: Flink Issue Type: Sub-task Components: Machine Learning Library, Table API & SQL Reporter: Weihua Jiang
Currently, the DataSet based ML library is under org.apache._flink.ml_ scala package and under _flink-libraries/flink-ml directory._ There are two questions related to packaging: # Shall we create a new scala/java package, e.g. org.apache.flink.table.ml? Or still stay in org.apache.flink.ml? # Shall we still put new code in flink-libraries/flink-ml directory or create a new one, e.g. flink-libraries/flink-table-ml and corresponding maven package? I implemented a prototype for the design and found that the new design is very hard to fit into existing flink.ml codebase. The existing flink.ml code is tightly coupled with DataSet API. Thus, I have to rewrite almost all parts of flink.ml to get some sample case to work. The only reusable code from flink.ml are the base math classes under _org.apache.flink.ml.math_ and _org.apache.flink.ml.metrics.distance_ packages. Considering this fact, I will prefer to create a new package org.apache.flink.table.ml and a new maven package flink-table-ml. Please feel free to give your feedbacks. -- This message was sent by Atlassian JIRA (v7.6.3#76005)