Re: Periodic cleanup of unused local stores

2016-09-08 Thread santhosh venkat
Hi Navina, Thanks for the review and the comments. Please find my replies inline. 1. It is always very useful to provide more context to the reader, esp. in explaining what the different terms mean (like host-affinity, tombstone etc) and how it relates to the problem being described." >> Updated

Re: Review Request 51142: SAMZA-967: HDFS System Consumer

2016-09-08 Thread Hai Lu
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/51142/ --- (Updated Sept. 9, 2016, 1:34 a.m.) Review request for samza, Chris Pettitt, Yi

Re: Review Request 51142: SAMZA-967: HDFS System Consumer

2016-09-08 Thread Hai Lu
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/51142/ --- (Updated Sept. 9, 2016, 1:32 a.m.) Review request for samza, Chris Pettitt, Yi

Re: Review Request 51142: SAMZA-967: HDFS System Consumer

2016-09-08 Thread Hai Lu
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/51142/ --- (Updated Sept. 9, 2016, 1:30 a.m.) Review request for samza, Chris Pettitt, Yi

Re: Question about Samza Metrcis

2016-09-08 Thread 舒琦
Hi Navina, Thanks for your clear description, I understand more now. ShuQi 舒琦 地址:长沙市岳麓区文轩路27号麓谷企业广场A4栋1单元6F 网址:http://www.eefung.com 微博:http://weibo.com/eefung 邮编:410013 电话:400-677-0986 传真:0731-88519609 > 在 2016年9月9日,04:19,Navina Ramesh 写道: > > Hi ShuQi, > > Auto-creation of streams

Re: Periodic cleanup of unused local stores

2016-09-08 Thread Navina R
(Replying again since it seems to have bounced off most inboxes): Hi Santhosh, Thanks for picking SAMZA-656. This is long overdue and will help make our host-affinity based solution more robust. I have a couple of thoughts on your design proposal. 1. It is always very useful to provide more cont

Re: Question about Samza Metrcis

2016-09-08 Thread Navina Ramesh
Hi ShuQi, Auto-creation of streams depends on your Kafka server configuration. In case of coordinator stream and checkpoint stream, samza explicitly creates a stream with 1 partition before publishing to it. This doesn't apply for metrics. So, if auto-creation is turned off in kafka server, then y

Re: Review Request 51346: SAMZA-974 - Support finite datasources in Samza that have a notion of End-Of-Stream

2016-09-08 Thread Xinyu Liu
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/51346/#review148229 --- samza-core/src/main/scala/org/apache/samza/system/SystemConsumers

Re: Review Request 50174: SAMZA-977: User doc for samza multithreading

2016-09-08 Thread Navina Ramesh
> On Aug. 31, 2016, 12:54 a.m., Navina Ramesh wrote: > > docs/learn/documentation/versioned/jobs/configuration-table.html, line 357 > > > > > > Is it too late to comment on the config key pattern? Traditionally, we

Re: Review Request 50174: SAMZA-977: User doc for samza multithreading

2016-09-08 Thread Navina Ramesh
> On Aug. 30, 2016, 1 a.m., Xinyu Liu wrote: > > docs/learn/documentation/versioned/jobs/configuration-table.html, line 368 > > > > > > oh, actually the job.container.single.thread.mode means using the old > > runl

Re: Review Request 51346: SAMZA-974 - Support finite datasources in Samza that have a notion of End-Of-Stream

2016-09-08 Thread Xinyu Liu
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/51346/#review148210 --- samza-core/src/main/java/org/apache/samza/task/AsyncRunLoop.java

Review Request 51726: SAMZA-1005: Refactor class instantiation code to a helper class.

2016-09-08 Thread Branislav Cogic
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/51726/ --- Review request for samza. Bugs: SAMZA-1005 https://issues.apache.org/jira/b

How to handling RecordTooLargeException while sending message to kafka system at 0.10.1

2016-09-08 Thread Liu Bo
Hi group We run into RecordTooLargeException while sending message to outgoing kafka system. We solved this at samza 0.10.0. Now it's back in 0.10.1... Here's the detail. We are analyzing crawled web pages, some message will grow beyond the max.request.size after adding new features to them. Our