Re: Flink ML - Vector and DenseVector

2016-01-19 Thread Hilmi Yildirim
Hi, yes sry. This was my mistake. I had problems with the E-Mail server... Am 19.01.2016 um 09:36 schrieb Chiwan Park: Hi Hilmi, As I understand, Till said that this email was sent twice [1][2]. [1]: http://mail-archives.apache.org/mod_mbox/flink-dev/201601.mbox/%3C569CD85A.6050309%40dfki.de%

Re: Flink ML - Vector and DenseVector

2016-01-19 Thread Chiwan Park
Hi Hilmi, As I understand, Till said that this email was sent twice [1][2]. [1]: http://mail-archives.apache.org/mod_mbox/flink-dev/201601.mbox/%3C569CD85A.6050309%40dfki.de%3E [2]: http://mail-archives.apache.org/mod_mbox/flink-dev/201601.mbox/%3C569CD969.1080100%40dfki.de%3E > On Jan 19, 201

Re: Flink ML - Vector and DenseVector

2016-01-19 Thread Hilmi Yildirim
We had a discussion about the "LabeledVector" class. But now this is a discussion about the Vector and DenseVector :) Am 18.01.2016 um 22:29 schrieb Till Rohrmann: Didn't we just had the discussion in another email thread? On Jan 18, 2016 8:55 PM, "Hilmi Yildirim" wrote: Hi, the Vector and D

Re: Flink ML Vector and DenseVector

2016-01-18 Thread Chiwan Park
How about mapping a number for each string? Maybe you can do it with custom Transformer. > On Jan 19, 2016, at 12:02 AM, Hilmi Yildirim wrote: > > Ok. In this case I will use an Array instead. > > Am 18.01.2016 um 14:56 schrieb Theodore Vasiloudis: >> I agree with Till, the data types are diff

Re: Flink ML - Vector and DenseVector

2016-01-18 Thread Till Rohrmann
Didn't we just had the discussion in another email thread? On Jan 18, 2016 8:55 PM, "Hilmi Yildirim" wrote: > Hi, > the Vector and DenseVector implementations of Flink ML only allow Double > values. But there are cases where the values are not Doubles, e.g. in NLP. > Does it make sense to make th

Re: Flink ML Vector and DenseVector

2016-01-18 Thread Hilmi Yildirim
Ok. In this case I will use an Array instead. Am 18.01.2016 um 14:56 schrieb Theodore Vasiloudis: I agree with Till, the data types are different here so you need a custom string vector. The Vector abstraction in FlinkML is designed with numerical vectors in mind. On Mon, Jan 18, 2016 at 2:33

Re: Flink ML Vector and DenseVector

2016-01-18 Thread Hilmi Yildirim
Ok. In this case I will use an Array instead. Am 18.01.2016 um 14:56 schrieb Theodore Vasiloudis: I agree with Till, the data types are different here so you need a custom string vector. The Vector abstraction in FlinkML is designed with numerical vectors in mind. On Mon, Jan 18, 2016 at 2:33

Re: Flink ML Vector and DenseVector

2016-01-18 Thread Theodore Vasiloudis
I agree with Till, the data types are different here so you need a custom string vector. The Vector abstraction in FlinkML is designed with numerical vectors in mind. On Mon, Jan 18, 2016 at 2:33 PM, Till Rohrmann wrote: > Hi Hilmi, > > I think in your case it makes sense to define a custom vec

Re: Flink ML Vector and DenseVector

2016-01-18 Thread Till Rohrmann
Hi Hilmi, I think in your case it makes sense to define a custom vector of strings. The easiest implementation could be an Array[String] or List[String]. The reason why it does not make so much sense to make Vector and DenseVector generic is that these types are algebraic data types. How would yo

Re: Flink ML Vector and DenseVector

2016-01-18 Thread Hilmi Yildirim
Hi, how I explained it in a previous E-Mail, I need a LabeledVector where the label is also a vector. After we discussed this issue, I created a new class named LabeledSequenceVector with the labels as a Vector. In my use case, I want to train a POS-Tagger system, so the "vector" is a vector o

Re: Flink ML Vector and DenseVector

2016-01-18 Thread Chiwan Park
Hi Hilmi, In NLP, which types are used for vector values? I think we can cover typical case using double values. > On Jan 18, 2016, at 9:19 PM, Hilmi Yildirim wrote: > > Hi, > the Vector and DenseVector implementations of Flink ML only allow Double > values. But there are cases where the valu