Do you have dynamic resource allocation enabled?

On Wednesday, July 17, 2019, zenglong chen <czlong.kel...@gmail.com> wrote:

> Hi,all,
>         My standalone mode has two slaves.When I submit my job,the
> localhost slave is working well,but second slave do add and remove executor
> action always!The log are below:
>        2019-07-17 10:51:38,889 INFO client.StandaloneAppClient$ClientEndpoint:
> Executor updated: app-20190717105135-0008/2 is now EXITED (Command exited
> with code 1)
> 2019-07-17 10:51:38,890 INFO cluster.StandaloneSchedulerBackend: Executor
> app-20190717105135-0008/2 removed: Command exited with code 1
> 2019-07-17 10:51:38,890 INFO storage.BlockManagerMasterEndpoint: Trying
> to remove executor 2 from BlockManagerMaster.
> 2019-07-17 10:51:38,890 INFO storage.BlockManagerMaster: Removal of
> executor 2 requested
> 2019-07-17 10:51:38,891 INFO 
> cluster.CoarseGrainedSchedulerBackend$DriverEndpoint:
> Asked to remove non-existent executor 2
> 2019-07-17 10:51:38,892 INFO client.StandaloneAppClient$ClientEndpoint:
> Executor added: app-20190717105135-0008/3 on 
> worker-20190717093045-172.22.9.179-40573
> (172.22.9.179:40573) with 8 core(s)
> 2019-07-17 10:51:38,892 INFO cluster.StandaloneSchedulerBackend: Granted
> executor ID app-20190717105135-0008/3 on hostPort 172.22.9.179:40573 with
> 8 core(s), 12.0 GB RAM
> 2019-07-17 10:51:38,893 INFO client.StandaloneAppClient$ClientEndpoint:
> Executor updated: app-20190717105135-0008/3 is now RUNNING
> 2019-07-17 10:51:40,521 INFO client.StandaloneAppClient$ClientEndpoint:
> Executor updated: app-20190717105135-0008/3 is now EXITED (Command exited
> with code 1)
> 2019-07-17 10:51:40,521 INFO cluster.StandaloneSchedulerBackend: Executor
> app-20190717105135-0008/3 removed: Command exited with code 1
> 2019-07-17 10:51:40,521 INFO client.StandaloneAppClient$ClientEndpoint:
> Executor added: app-20190717105135-0008/4 on 
> worker-20190717093045-172.22.9.179-40573
> (172.22.9.179:40573) with 8 core(s)
> 2019-07-17 10:51:40,521 INFO storage.BlockManagerMaster: Removal of
> executor 3 requested
> 2019-07-17 10:51:40,521 INFO 
> cluster.CoarseGrainedSchedulerBackend$DriverEndpoint:
> Asked to remove non-existent executor 3
> 2019-07-17 10:51:40,521 INFO storage.BlockManagerMasterEndpoint: Trying
> to remove executor 3 from BlockManagerMaster.
> 2019-07-17 10:51:40,521 INFO cluster.StandaloneSchedulerBackend: Granted
> executor ID app-20190717105135-0008/4 on hostPort 172.22.9.179:40573 with
> 8 core(s), 12.0 GB RAM
> 2019-07-17 10:51:40,523 INFO client.StandaloneAppClient$ClientEndpoint:
> Executor updated: app-20190717105135-0008/4 is now RUNNING
>
>
> And the slave output are below:
>        19/07/17 10:47:12 INFO ExecutorRunner: Launch command:
> "/home/ubuntu/data/jdk/jre/bin/java" "-cp" "/home/ubuntu/spark-2.4.3/
> conf/:/home/ubuntu/spark-2.4.3/jars/*" "-Xmx12288M"
> "-Dspark.driver.port=40335" 
> "org.apache.spark.executor.CoarseGrainedExecutorBackend"
> "--driver-url" "spark://CoarseGrainedScheduler@iZk1a7vdbutmi6eluaskecZ:40335"
> "--executor-id" "18" "--hostname" "172.22.9.179" "--cores" "8" "--app-id"
> "app-20190717104645-0007" "--worker-url" "spark://Worker@172.22.9.179:
> 40573"
> 19/07/17 10:47:13 INFO Worker: Executor app-20190717104645-0007/18
> finished with state EXITED message Command exited with code 1 exitStatus 1
> 19/07/17 10:47:13 INFO ExternalShuffleBlockResolver: Clean up non-shuffle
> files associated with the finished executor 18
> 19/07/17 10:47:13 INFO ExternalShuffleBlockResolver: Executor is not
> registered (appId=app-20190717104645-0007, execId=18)
> 19/07/17 10:47:13 INFO Worker: Asked to launch executor
> app-20190717104645-0007/19 for ph_user_pre_level
> 19/07/17 10:47:13 INFO SecurityManager: Changing view acls to: ubuntu
> 19/07/17 10:47:13 INFO SecurityManager: Changing modify acls to: ubuntu
> 19/07/17 10:47:13 INFO SecurityManager: Changing view acls groups to:
> 19/07/17 10:47:13 INFO SecurityManager: Changing modify acls groups to:
> 19/07/17 10:47:13 INFO SecurityManager: SecurityManager: authentication
> disabled; ui acls disabled; users  with view permissions: Set(ubuntu);
> groups with view permissions: Set(); users  with modify permissions:
> Set(ubuntu); groups with modify permissions: Set()
> 19/07/17 10:47:14 INFO ExecutorRunner: Launch command:
> "/home/ubuntu/data/jdk/jre/bin/java" "-cp" "/home/ubuntu/spark-2.4.3/
> conf/:/home/ubuntu/spark-2.4.3/jars/*" "-Xmx12288M"
> "-Dspark.driver.port=40335" 
> "org.apache.spark.executor.CoarseGrainedExecutorBackend"
> "--driver-url" "spark://CoarseGrainedScheduler@iZk1a7vdbutmi6eluaskecZ:40335"
> "--executor-id" "19" "--hostname" "172.22.9.179" "--cores" "8" "--app-id"
> "app-20190717104645-0007" "--worker-url" "spark://Worker@172.22.9.179:
> 40573"
>
> I guest that  may be  "Dspark.driver.port=40335" problem.
> Any suggests will help me a lot!
>

Reply via email to