Hey all, I was wondering if anyone could point me where to start debugging the following error:
ERROR Dropping SparkListenerEvent because no remaining room in event queue. This likely means one of the SparkListeners is too slow and cannot keep up with the rate at which tasks are being started by the scheduler. This happens when I call the fit method of Count Vectorizer on a fairly small dataset (< 20 GB). Running on a cluster with 5 nodes (c3.8xlarge), Spark 2.1, and Hadoop 2.7. If there is anything else that would be helpful to know just let me know and I can include it. Best, Patrick