RE: Spark SQL step with many tasks takes a long time to begin processing

2016-02-16 Thread Dukek, Dillon
360-316-9309 Email: dillon.du...@t-mobile.com From: Teng Qiu [mailto:teng...@gmail.com] Sent: Tuesday, February 16, 2016 12:11 PM To: Dukek, Dillon Cc: user@spark.apache.org Subject: Re: Spark SQL step with many tasks takes a long time to begin processing i believe this is a known issue for u

Spark SQL step with many tasks takes a long time to begin processing

2016-02-16 Thread Dukek, Dillon
Hello, I have been working on a project that allows a BI tool to query roughly 25 TB of application event data from 2015 using the thrift server and Spark SQL. In general the jobs that are submitted have a step that submit many tasks in the order of hundreds of thousands and is equal to the num