Re: spark driver with OOM due to org.apache.spark.status.ElementTrackingStore

2022-08-02 Thread Qian Sun
Hi Jason LiveUI initializes ElementTrackingStore with InMemoryStore, so it has OOM risk. /** * Create an in-memory store for a live application. */ def createLiveStore( conf: SparkConf, appStatusSource: Option[AppStatusSource] = None): AppStatusStore = { val store = new ElementTracking

spark driver with OOM due to org.apache.spark.status.ElementTrackingStore

2022-08-02 Thread Jason Jun
He there, We have spark driver running 24x7, and we are continiously getting OOM in spark driver every 10 days. I found org.apache.spark.status.ElementTrackingStore keep 85% of heap usage after analyzing heap dump like this image: [image: image.png] i found these parameter would be the root cause

Re: Setting spark.kubernetes.driver.connectionTimeout, spark.kubernetes.submission.connectionTimeout to default spark.network.timeout

2022-08-02 Thread Pralabh Kumar
Hi Dongjoon Thx for replying and clarifying. Below are the errors in Spark32 pn K8s which occurred because of time out . io.fabric8.kubernetes.client.KubernetesClientException: Operation: [create] for kind: [ConfigMap] with name: [null] in namespace: [xyz] failed. at io.fabric8.kub