Re: Flink ML - Vector and DenseVector

2016-01-19 Thread Hilmi Yildirim
Hi, yes sry. This was my mistake. I had problems with the E-Mail server... Am 19.01.2016 um 09:36 schrieb Chiwan Park: Hi Hilmi, As I understand, Till said that this email was sent twice [1][2]. [1]: http://mail-archives.apache.org/mod_mbox/flink-dev/201601.mbox/%3C569CD85A.6050309%40dfki.de%

Re: Flink ML - Vector and DenseVector

2016-01-19 Thread Chiwan Park
Hi Hilmi, As I understand, Till said that this email was sent twice [1][2]. [1]: http://mail-archives.apache.org/mod_mbox/flink-dev/201601.mbox/%3C569CD85A.6050309%40dfki.de%3E [2]: http://mail-archives.apache.org/mod_mbox/flink-dev/201601.mbox/%3C569CD969.1080100%40dfki.de%3E > On Jan 19, 201

Re: Flink ML - Vector and DenseVector

2016-01-19 Thread Hilmi Yildirim
We had a discussion about the "LabeledVector" class. But now this is a discussion about the Vector and DenseVector :) Am 18.01.2016 um 22:29 schrieb Till Rohrmann: Didn't we just had the discussion in another email thread? On Jan 18, 2016 8:55 PM, "Hilmi Yildirim" wrote: Hi, the Vector and D

Re: Flink ML Vector and DenseVector

2016-01-18 Thread Chiwan Park
How about mapping a number for each string? Maybe you can do it with custom Transformer. > On Jan 19, 2016, at 12:02 AM, Hilmi Yildirim wrote: > > Ok. In this case I will use an Array instead. > > Am 18.01.2016 um 14:56 schrieb Theodore Vasiloudis: >> I agree with Till, the data types are diff

Re: Flink ML - Vector and DenseVector

2016-01-18 Thread Till Rohrmann
Didn't we just had the discussion in another email thread? On Jan 18, 2016 8:55 PM, "Hilmi Yildirim" wrote: > Hi, > the Vector and DenseVector implementations of Flink ML only allow Double > values. But there are cases where the values are not Doubles, e.g. in NLP. > Does it make sense to make th

Flink ML - Vector and DenseVector

2016-01-18 Thread Hilmi Yildirim
Hi, the Vector and DenseVector implementations of Flink ML only allow Double values. But there are cases where the values are not Doubles, e.g. in NLP. Does it make sense to make the implementations generic, i.e. Vector[T] and DenseVector[T]? Best Regards, Hilmi -- ==

Re: Flink ML Vector and DenseVector

2016-01-18 Thread Hilmi Yildirim
Ok. In this case I will use an Array instead. Am 18.01.2016 um 14:56 schrieb Theodore Vasiloudis: I agree with Till, the data types are different here so you need a custom string vector. The Vector abstraction in FlinkML is designed with numerical vectors in mind. On Mon, Jan 18, 2016 at 2:33

Re: Flink ML Vector and DenseVector

2016-01-18 Thread Hilmi Yildirim
Ok. In this case I will use an Array instead. Am 18.01.2016 um 14:56 schrieb Theodore Vasiloudis: I agree with Till, the data types are different here so you need a custom string vector. The Vector abstraction in FlinkML is designed with numerical vectors in mind. On Mon, Jan 18, 2016 at 2:33

Re: Flink ML Vector and DenseVector

2016-01-18 Thread Theodore Vasiloudis
I agree with Till, the data types are different here so you need a custom string vector. The Vector abstraction in FlinkML is designed with numerical vectors in mind. On Mon, Jan 18, 2016 at 2:33 PM, Till Rohrmann wrote: > Hi Hilmi, > > I think in your case it makes sense to define a custom vec

Re: Flink ML Vector and DenseVector

2016-01-18 Thread Till Rohrmann
Hi Hilmi, I think in your case it makes sense to define a custom vector of strings. The easiest implementation could be an Array[String] or List[String]. The reason why it does not make so much sense to make Vector and DenseVector generic is that these types are algebraic data types. How would yo

Re: Flink ML Vector and DenseVector

2016-01-18 Thread Hilmi Yildirim
Hi, how I explained it in a previous E-Mail, I need a LabeledVector where the label is also a vector. After we discussed this issue, I created a new class named LabeledSequenceVector with the labels as a Vector. In my use case, I want to train a POS-Tagger system, so the "vector" is a vector o

Re: Flink ML Vector and DenseVector

2016-01-18 Thread Chiwan Park
Hi Hilmi, In NLP, which types are used for vector values? I think we can cover typical case using double values. > On Jan 18, 2016, at 9:19 PM, Hilmi Yildirim wrote: > > Hi, > the Vector and DenseVector implementations of Flink ML only allow Double > values. But there are cases where the valu

Flink ML Vector and DenseVector

2016-01-18 Thread Hilmi Yildirim
Hi, the Vector and DenseVector implementations of Flink ML only allow Double values. But there are cases where the values are not Doubles, e.g. in NLP. Does it make sense to make the implementations generic, i.e. Vector[T] and DenseVector[T]? Best Regards, Hilmi -- ==