Re: task always lost

2014-07-03 Thread Aaron Davidson
The issue you're seeing is not the same as the one you linked to -- your serialized task sizes are very small, and Mesos fine-grained mode doesn't use Akka anyway. The error log you printed seems to be from some sort of Mesos logs, but do you happen to have the logs from the actual executors thems

Re: task always lost

2014-07-02 Thread qingyang li
executor always been removed. someone encountered same issue https://groups.google.com/forum/#!topic/spark-users/-mYn6BF-Y5Y - 14/07/02 17:41:16 INFO storage.BlockManagerMasterActor: Trying to remove executor 20140616-104524-1694607552-5050-26919-1 from BlockManagerMaster. 14/07/02 17

Re: task always lost

2014-07-01 Thread qingyang li
also this one in warning log: E0702 11:35:08.869998 17840 slave.cpp:2310] Container 'af557235-2d5f-4062-aaf3-a747cb3cd0d1' for executor '20140616-104524-1694607552-5050-26919-1' of framework '20140702-113428-1694607552-5050-17766-' failed to start: Failed to fetch URIs for container 'af557235-

Re: task always lost

2014-07-01 Thread qingyang li
Here is the log: E0702 10:32:07.599364 14915 slave.cpp:2686] Failed to unmonitor container for executor 20140616-104524-1694607552-5050-26919-1 of framework 20140702-102939-1694607552-5050-14846-: Not monitored 2014-07-02 1:45 GMT+08:00 Aaron Davidson : > Can you post the logs from any of t

Re: task always lost

2014-07-01 Thread Aaron Davidson
Can you post the logs from any of the dying executors? On Tue, Jul 1, 2014 at 1:25 AM, qingyang li wrote: > i am using mesos0.19 and spark0.9.0 , the mesos cluster is started, when I > using spark-shell to submit one job, the tasks always lost. here is the > log: > -- > 14/07/01 16:24