[ https://issues.apache.org/jira/browse/FLINK-2034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14553973#comment-14553973 ]
ASF GitHub Bot commented on FLINK-2034: --------------------------------------- Github user thvasilo commented on a diff in the pull request: https://github.com/apache/flink/pull/688#discussion_r30786490 --- Diff: docs/libs/ml/index.md --- @@ -20,8 +20,100 @@ specific language governing permissions and limitations under the License. --> +The Machine Learning (ML) library for Flink is a new effort to bring scalable ML tools to the Flink +community. Our goal is is to design and implement a system that is scalable and can deal with +problems of various sizes, whether your data size is measured in megabytes or terabytes and beyond. +We call this library FlinkML. + +An important concern for developers of ML systems is the amount of glue code that developers are +forced to write [1] in the process of implementing an end-to-end ML system. Our goal with FlinkML +is to help developers keep glue code to a minimum. The Flink ecosystem provides a great setting to +tackle this problem, with its scalable ETL capabilities that can be easily combined inside the same +program with FlinkML, allowing the development of robust pipelines without the need to use yet +another technology for data ingestion and data munging. + +Another goal for FlinkML is to make the library easy to use. To that end we will be providing +detailed documentation along with examples for every part of the system. Our aim is that developers +will be able to get started with writing their ML pipelines quickly, using familiar programming +concepts and terminology. --- End diff -- I think the reason this is here is to communicate to people that would also like to contribute to the library that we consider documentation an integral part of the library and not an after-thought. > Add vision and roadmap for ML library to docs > --------------------------------------------- > > Key: FLINK-2034 > URL: https://issues.apache.org/jira/browse/FLINK-2034 > Project: Flink > Issue Type: Improvement > Components: Machine Learning Library > Reporter: Theodore Vasiloudis > Assignee: Theodore Vasiloudis > Labels: ML > Fix For: 0.9 > > > We should have a document describing the vision of the Machine Learning > library in Flink and an up to date roadmap. -- This message was sent by Atlassian JIRA (v6.3.4#6332)