Re: Resource allocation on YARN

2020-02-27 Thread Abhishek S
With respect to the configs for YARN, I feel my recommendations should work fine. Regarding the job configs hopefully the following help (Note: these numbers are workload and environment dependent): *cluster-manager.container.memory.mb: * According to our benchmarks a 2G, 1CPU container can do:

Re: Resource allocation on YARN

2020-02-25 Thread Malcolm McFarland
Hey Abhishek, I appreciate the feedback. I'm not seeing any actual errors -- our cluster has been running pretty reliably for several months now -- but we're revisiting our resource allocation, and I was wondering if anybody out there, maybe even at Confluent or LinkedIn, has advice about good rul

Re: Resource allocation on YARN

2020-02-25 Thread Abhishek S
Hey Malcolm, **YARN** *yarn.nodemanager.resource.memory-mb (Amount of physical memory, in MB, that can be allocated for containers)* The value for this depends on if there are any other side-car applications on the machine that the node-manager runs on. eg. on your 32GB machine - if other apps on

Re: Resource allocation on YARN

2020-02-24 Thread Malcolm McFarland
On another, similar theme, has anybody tried running Samza on Hadoop 2.8.5? I'm experimenting with it right now, and can't get it to recognize the CPU core configuration. I'm curious if anybody knows about an API change between 2.7.x and 2.8.x in how applications are requested. What would the effe