Just a quick remark about memory and number of slots: with your configuration 
of 30 slots but only ~20gb of RAM each processing slot does not have a lot of 
memory to work with. For batch programs this can be a problem. I would suggest 
to use less but bigger slots, even if the number of cores is very high on your 
machine.

Best,
Aljoscha
> On 22. Jun 2017, at 17:36, Greg Hogan <c...@greghogan.com> wrote:
> 
> Some documentation on application profiling with Flink 1.3 (can be manually 
> inserted into the scripts for Flink 1.2):
>   
> https://ci.apache.org/projects/flink/flink-docs-release-1.3/monitoring/application_profiling.html
>  
> <https://ci.apache.org/projects/flink/flink-docs-release-1.3/monitoring/application_profiling.html>
> 
> 
>> On Jun 22, 2017, at 9:24 AM, Stefan Richter <s.rich...@data-artisans.com 
>> <mailto:s.rich...@data-artisans.com>> wrote:
>> 
>> Hi,
>> 
>> the answer highly depends on what you job is doing and there is no 
>> information about that. Also what is your target in performance? Are you 
>> using batch or streaming? If you feel like the performance is lower than 
>> expected, I suggest that you do some profiling to figure out the hotspots.
>> For example, you could see that your job spends most time in type 
>> serialization, which is a common bottleneck. In this case, maybe you can 
>> write a faster custom serializer. Or rewriting the job (e.g. use early 
>> aggregation where possible etc.) can yield much more performance improvement 
>> then tuning magic numbers with no further knowledge about your job.
>> 
>> Best,
>> Stefan 
>> 
>>> Am 22.06.2017 um 12:08 schrieb Samim Ahmed <samim1...@gmail.com 
>>> <mailto:samim1...@gmail.com>>:
>>> 
>>> Hi All,
>>> 
>>> This query regarding the flink performance improvement .
>>> 
>>> Flink Configuration:
>>> using flink in clustor mode with 3 salves and a master configuration
>>> slots used 30 (as the system has 30 core)
>>> task manager memory 30GB
>>> parallelism used : 30
>>> jobmanager.heap.mb: 20480
>>> taskmanager.heap.mb: 20480
>>> taskmanager.numberOfTaskSlots: 30
>>> taskmanager.network.numberOfBuffers: 20000
>>> 
>>> Input info:
>>> Input file : 1ROP(5min) data with 3333 Nodes and 665K eps
>>> Total number of events :: 199498294 
>>> 
>>> Observation :
>>> Total time taken to complete the task = 6m24s
>>> 
>>> Can you please suggest what else I need to modify to get the high 
>>> performance in terms of lese execution time. Thanks in advance
>>> 
>>> 
>>> -- 
>>> Regards,
>>> Samim Ahmed 
>>> Mumbai
>>> 09004259232

Reply via email to