Re: Using processor API via DSL

2019-04-15 Thread Alessandro Tagliapietra
Thank you Bruno, I'll look into those, however average is just a simple thing I'm trying right now just to get an initial windowing flow working. In the future I'll probably still need the actual values for other calculations. We won't have more than 60 elements per window for sure. So far to not

Re: Using processor API via DSL

2019-04-15 Thread Bruno Cadonna
Hi Alessandro, Have a look at this Kafka Usage Pattern for computing averages without using an ArrayList. https://cwiki.apache.org/confluence/display/KAFKA/Kafka+Stream+Usage+Patterns#KafkaStreamUsagePatterns-Howtocomputean(windowed)average? The advantages of this pattern over the ArrayList appr

Re: Using processor API via DSL

2019-04-15 Thread Alessandro Tagliapietra
Sorry but it seemed harder than I thought, to have the custom aggregation working I need to get an ArrayList of all the values in the window, so far my aggregate DSL method creates an ArrayList on the initializer and adds each value to the list in the aggregator. Then I think I'll have to provide

Re: Partition Strategy in Kafka Stream

2019-04-15 Thread Matthias J. Sax
As explained on https://issues.apache.org/jira/browse/KAFKA-8207, setting a custom partition assignor in not allows in Kafka Streams. There is work in progress to address the reported issue. Unfortunately, I am not aware of a workaround for now. -Matthias On 4/10/19 11:35 PM, Neeraj Bhatt wrote

Re: Using processor API via DSL

2019-04-15 Thread Alessandro Tagliapietra
Thank you Bruno and Matthias, I've modified the transformer to implement the ValueTransformerWithKey interface and everything is working fine. I've now to window the data and manually aggregate each window data since I've to do some averages and sum of differences. So far I've just having some iss

OutOfMemoryError when restart my Kafka Streams appplication

2019-04-15 Thread Kun Song
Hi community, I have encountered a Kafka Streams problem recently, and I have post it on stackoverflow[1], could anyone help me with this? Thanks you in advance :) [1] https://stackoverflow.com/questions/55690662/outofmemoryerror-when-restart-my-kafka-streams-appplication

Re: Kafka tuning - consultant work

2019-04-15 Thread Stephen Boesch
Please refrain from using this list as a job board. thank you. Am Mo., 15. Apr. 2019 um 07:00 Uhr schrieb Manoj Murumkar < manoj.murum...@gmail.com>: > Damian, > > Let me know when we can talk. I have done extensive work on Kafka and run > a boutique consulting firm specializes in this work. Let

Re: Kafka tuning - consultant work

2019-04-15 Thread Manoj Murumkar
Damian, Let me know when we can talk. I have done extensive work on Kafka and run a boutique consulting firm specializes in this work. Let me know when it's a good time to talk. I am located in San Francisco Bay area and can do a bit of travel, if needed. Thanks, Manoj (650) 417.5847 > On A

Kafka tuning - consultant work

2019-04-15 Thread Damian Martinez
Hi all, we are looking for one experienced developer with hands on experience installing and configuring Kafka in cloud and hybrid environments. This service is part of the backbone for data processing and analytics information. *Primary Skills*: Kafka *Duration:* 3 Months (possibility of extensio

kafka broker server uneven CPU utilization

2019-04-15 Thread Parth Gandhi
HI, We have 3 node of kafka cluster with around 25+ topic and 300+ partition spread across these servers. We have the load evenly distributed amongst this partition however we are observing that 2 broker server are running around >60% CPU where as the third one is running just abour 10%. How do we

Re: Method to check if the log-cleaner of a Kafka broker is running or not

2019-04-15 Thread Vincent Maurin
Hi There is a couple of metrics and log produced by the log cleaner https://github.com/apache/kafka/blob/1.0/core/src/main/scala/kafka/log/LogCleaner.scala You can try to monitor that (metrics could be fetched with JMX and for the log you can tweak the log4j properties) Don't forget that the log