Hi,
in the ML-Pipeline of Flink we have the "LabeledVector" class. It
consists of a vector and a label as a double value. Unfortunately, it is
not applicable for sequence learning where the label is also a vector.
For example, in NLP we have a vector of words and the label is a vector
of the corresponding labels.
The optimize function of the "Solver" class has a DateSet[LabeledVector]
as input and, therefore, it is not applicable for sequence learning. I
think the LabeledVector should be adapted that the label is a vector
instead of a single Double value. What do you think?
Best Regards,
--
==================================================================
Hilmi Yildirim, M.Sc.
Researcher
DFKI GmbH
Intelligente Analytik für Massendaten
DFKI Projektbüro Berlin
Alt-Moabit 91c
D-10559 Berlin
Phone: +49 30 23895 1814
E-Mail: hilmi.yildi...@dfki.de
-------------------------------------------------------------
Deutsches Forschungszentrum fuer Kuenstliche Intelligenz GmbH
Firmensitz: Trippstadter Strasse 122, D-67663 Kaiserslautern
Geschaeftsfuehrung:
Prof. Dr. Dr. h.c. mult. Wolfgang Wahlster (Vorsitzender)
Dr. Walter Olthoff
Vorsitzender des Aufsichtsrats:
Prof. Dr. h.c. Hans A. Aukes
Amtsgericht Kaiserslautern, HRB 2313
-------------------------------------------------------------