I think you should see some other errors before that, from NettyBlockTransferService, with a msg like "Exception while beginning fetchBlocks". There might be a bit more information there. there are an assortment of possible causes, but first lets just make sure you have all the details from the original cause.
On Fri, Mar 20, 2015 at 8:49 AM, Eric Friedman <eric.d.fried...@gmail.com> wrote: > My job crashes with a bunch of these messages in the YARN logs. > > What are the appropriate steps in troubleshooting? > > 15/03/19 23:29:45 ERROR shuffle.RetryingBlockFetcher: Exception while > beginning fetch of 10 outstanding blocks (after 3 retries) > > 15/03/19 23:29:45 ERROR storage.ShuffleBlockFetcherIterator: Failed to get > block(s) from <host>:<port> >