Re: dynamic executor scalling spark on kubernetes client mode

2020-05-12 Thread Steven Stetzler
the conversation but are keen where I can find >> information regarding dynamic allocation on kubernetes. As far as I know >> the docs just point to future work. >> >> Thanks a lot, >> Roland >> >> >> >> Am 12.05.2020 um 09:25 schrieb Steven Stet

Re: dynamic executor scalling spark on kubernetes client mode

2020-05-12 Thread Steven Stetzler
Hi all, I am interested in this as well. My use-case could benefit from dynamic executor scaling but we are restricted to using client mode since we are only using Spark shells. Could anyone help me understand the barriers to getting dynamic executor scaling to work in client mode on Kubernetes?

Re: Fitting only the intercept for LinearRegression

2020-03-21 Thread Steven Stetzler
Hi Eugen, You should be able to do this without the LinearRegression API. I believe for a linear regression model ( https://en.wikipedia.org/wiki/Simple_linear_regression) [image: image.png] the best estimator for the intercept will be [image: image.png] where \overline{y} is the average of the ta

Re: Can't get Spark to interface with S3A Filesystem with correct credentials

2020-03-04 Thread Steven Stetzler
To successfully read from S3 using s3a, I've had to also set ``` spark.hadoop.fs.s3a.impl=org.apache.hadoop.fs.s3a.S3AFileSystem ``` in addition to `spark.hadoop.fs.s3a.access.key` and `spark.hadoop.fs.s3a.secret.key`. I've also needed to ensure Spark has access to the AWS SDK jar. I have downloade

Spark on k8s: Mount config map in executor

2019-08-26 Thread Steven Stetzler
insight the userbase can offer for this issue. Thanks, Steven Stetzler

Re: [Beginner] Run compute on large matrices and return the result in seconds?

2019-07-11 Thread Steven Stetzler
Hi Gautham, I am a beginner spark user too and I may not have a complete understanding of your question, but I thought I would start a discussion anyway. Have you looked into using Spark's built in Correlation function? ( https://spark.apache.org/docs/latest/ml-statistics.html) This might let you

Re: Problem running Spark on Kubernetes: Certificate error

2018-12-20 Thread Steven Stetzler
t > spark.kubernetes.authenticate.driver.caCertFile > to the path of your CA certificate on your local disk, spark-submit will > create a secret that contains that certificate file and use that > certificate to configure TLS for the driver pod’s communication with the > API server.

Problem running Spark on Kubernetes: Certificate error

2018-12-13 Thread Steven Stetzler
Hello, I am following the tutorial here ( https://spark.apache.org/docs/latest/running-on-kubernetes.html) to get spark running on a Kubernetes cluster. My Kubernetes cluster is hosted with Digital Ocean's kubernetes cluster manager. I have change the KUBECONFIG environment variable to point to my