Yep, I definitely agree with you.

Moreover, such improvement should reduce parallel hierarchies in trainers
and preprocessors, from this point of view preprocessor will be equal to a
trainer. In my opinion, this improvement is very important for ml module
because it can give a flexible hierarchy of components.

I created a ticket for serializable object support in Vectors:
https://issues.apache.org/jira/browse/IGNITE-11647
Another related ticket to this thread:
https://issues.apache.org/jira/browse/IGNITE-11642

чт, 28 мар. 2019 г. в 11:27, Alexey Zinoviev <zaleslaw....@gmail.com>:

> Hi, Igniters
>
> The new functionality of building Vectors was merged to Apache Ignite in
> the
> next commit
> <
> https://github.com/apache/ignite/commit/a0a15d62a250defb0db9ec72153ee287830f6a15>
>
>
> This new functionality brings to Ignite ML the new approach of building
> vectors. But in my opinion the shouldn't constrain ourselves with narrow
> understanding of Vector nature as an analogue of double[] array.
>
> I suggest to extend the Vector and Vectorizer API to support Strings and
> another types (like Blobs, Images and etc) as a vector elements.
>
> It brings next advantages:
> * gives a chance to inify the hierarchy of Preprocessing Trainers and Model
> Trainers
> * give us a chance to implement ML algorithms working not only with doubles
> * unifies our Vectorizers as a first step in our Pipelines
> * drops a lot of unused generics
> * makes one simple requirement to final users: convert their data to
> Vectors
>
> Join to discussion, ML-interested persons and share your opinon here!
>
>
>
> --
> Sent from: http://apache-ignite-developers.2346864.n4.nabble.com/
>

Reply via email to