[ https://issues.apache.org/jira/browse/FLINK-14152?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Konstantin Knauf updated FLINK-14152: ------------------------------------- Labels: pull-request-available (was: pull-request-available stale-major) Removed "stale-critical|major|minor" label in line with https://issues.apache.org/jira/browse/FLINK-22429. > Add class for DocCountVectorizerMapper. > ------------------------------------------ > > Key: FLINK-14152 > URL: https://issues.apache.org/jira/browse/FLINK-14152 > Project: Flink > Issue Type: Sub-task > Components: Library / Machine Learning > Reporter: Xu Yang > Priority: Major > Labels: pull-request-available > Time Spent: 10m > Remaining Estimate: 0h > > DocCountVectorizerModelMapper is a transformer to converts a document > to a sparse vector based on the document frequency, word count or > inverse document frequency of each word in the document. > * Add DocCountVectorizerModelMapper for the operation of the > DocCountVectorizerModelMapper. > * Add DocCountVectorizerModelDataConverter to serialize and deserialize > model. > * Add DocCountVectorizerPredictParams for the params of > DocCountVectorizerModelMapper. > * Add DocCountVectorizerModelMapperTest for the test example. -- This message was sent by Atlassian Jira (v8.3.4#803005)