Hey all,

I was wondering if anyone could point me where to start debugging the
following error:

ERROR Dropping SparkListenerEvent because no remaining room in event
queue. This likely means one of the SparkListeners is too slow and
cannot keep up with the rate at which tasks are being started by the
scheduler.

This happens when I call the fit method of Count Vectorizer on a fairly
small dataset (< 20 GB).

Running on a cluster with 5 nodes (c3.8xlarge), Spark 2.1, and Hadoop 2.7.
If there is anything else that would be helpful to know just let me know
and I can include it.

Best,
Patrick

Reply via email to