Re: Re: questions of partition and task of Samza

2015-10-29 Thread Selina Tech
Hi, Yan: Thanks a lot for your answer. Sincerely, Selina On Mon, Oct 26, 2015 at 8:03 PM, Yan Fang wrote: > Hi Selina, > > > Your understanding is correct. Yes, you "need to consumer the original > input and send it back to Kafka and reset the* Key to departmentName *and > then consume it

Re:Re: questions of partition and task of Samza

2015-10-26 Thread Yan Fang
Hi Selina, Your understanding is correct. Yes, you "need to consumer the original input and send it back to Kafka and reset the* Key to departmentName *and then consume it again to count in Samza" if you want to count the number of students in the same departmentName. This is a typical aggreg

Re: questions of partition and task of Samza

2015-10-24 Thread Selina Tech
Hi, Yan: Thanks a lot for your reply. You mentioned "if you give the msgs the same partition key", which mean same partition key value or same partition key attribute name? I mentioned "primary key" as "key" at public KeyedMessage(java.lang.String topic, K key, V message) or

questions of partition and task of Samza

2015-10-21 Thread Selina Tech
Hi, All: In the Samza document, it mentioned "Each task consumes data from one partition for each of the job’s input streams." Does it mean if the data processing one job is not in one partition, the result will be wrong. Assuming my Samza input data on Kafka topic -- "input" is p