---------- Forwarded message ---------- From: Angel Angel <areyouange...@gmail.com> Date: Wed, Sep 23, 2015 at 12:24 PM Subject: Executor lost To: user@spark.apache.org
Hello Sir/Madam, I am running the deeplearning example on spark. i have the following configuration 1 Master and 3 slaves My driver program setting is: public static void main(String[] args) { SparkConf conf = new SparkConf().setMaster("spark://hadoopm0:7077")//.set("spark.akka.framSize", "67108864") .setAppName("Mnist Classification Pipeline (Java)") .set("spark.driver.memory", "6g") .set("spark.executor.memory", "6g") .set("spark.akka.timeout", "200000ms"); SparkContext jsc = new SparkContext(conf); SQLContext jsql = new SQLContext(jsc); String imagesPath = "file:///root/Downloads/Database/images-idx1-ubyte"; String labelsPath = "file:///root/Downloads/Database/labels-idx1-ubyte"; Map<String, String> params = new HashMap<String, String>(); params.put("imagesPath", imagesPath); params.put("labelsPath", labelsPath); params.put("recordsPerPartition", "1500");// params.put("maxRecords", "6000");// number of samples However when i run this example i get errors like: *ERROR TaskSchedulerImpl: Lost executor 1 on 165.132.45.46 <http://165.132.45.46>: worker lost* *15/09/23 12:15:31 WARN TaskSetManager: Lost task 0.0 in stage 1.0 (TID 4, 165.132.45.46): ExecutorLostFailure (executor 1 lost)* *15/09/23 12:15:31 WARN TaskSetManager: Lost task 3.0 in stage 1.0 (TID 7, 165.132.45.46): ExecutorLostFailure (executor 1 lost)* How should i resolve this error. thanks in advance. Also error in detail [image: Inline image 2]