The SQL plan of each micro-batch in the Spark UI (SQL tab) has links to the
actual Spark jobs that ran in the micro-batch. From that you can drill down
into the stage information. I agree that its not there as a nice per-stream
table as with the Streaming tab, but all the information is present if you
dig for it.

On Wed, Jun 27, 2018 at 4:14 PM, swetha kasireddy <swethakasire...@gmail.com
> wrote:

> Thanks TD, but the sql plan does not seem to provide any information on
> which stage is taking longer time or to identify any bottlenecks about
> various stages. Spark kafka Direct used to provide information about
> various stages in a micro batch and the time taken by each stage. Is there
> a way to find out stage level information like time take by each stage,
> shuffle read/write data etc? Do you have any documentation on how to use
> SQL tab for troubleshooting?
>
> On Wed, Jun 20, 2018 at 6:07 PM, Tathagata Das <
> tathagata.das1...@gmail.com> wrote:
>
>> Also, you can get information about the last progress made (input rates,
>> etc.) from StreamingQuery.lastProgress, StreamingQuery.recentProgress, and
>> using StreamingQueryListener.
>> Its all documented - https://spark.apache.org/docs/
>> latest/structured-streaming-programming-guide.html#monitorin
>> g-streaming-queries
>>
>> On Wed, Jun 20, 2018 at 6:06 PM, Tathagata Das <
>> tathagata.das1...@gmail.com> wrote:
>>
>>> Structured Streaming does not maintain a queue of batch like DStream.
>>> DStreams used to cut off batches at a fixed interval and put in a queue,
>>> and a different thread processed queued batches. In contrast, Structured
>>> Streaming simply cuts off and immediately processes a batch after the
>>> previous batch finishes. So the question about queue size and lag does not
>>> apply to Structured Streaming.
>>>
>>> That said, there is no UI for Structured Streaming. You can see the sql
>>> plans for each micro-batch in the SQL tab.
>>>
>>>
>>>
>>>
>>>
>>> On Wed, Jun 20, 2018 at 12:12 PM, SRK <swethakasire...@gmail.com> wrote:
>>>
>>>> hi,
>>>>
>>>> How do we get information like lag and queued up batches in Structured
>>>> streaming? Following api does not seem to give any info about  lag and
>>>> queued up batches similar to DStreams.
>>>>
>>>> https://spark.apache.org/docs/2.2.1/api/java/org/apache/spar
>>>> k/streaming/scheduler/BatchInfo.html
>>>>
>>>> Thanks!
>>>>
>>>>
>>>>
>>>>
>>>> --
>>>> Sent from: http://apache-spark-user-list.1001560.n3.nabble.com/
>>>>
>>>> ---------------------------------------------------------------------
>>>> To unsubscribe e-mail: user-unsubscr...@spark.apache.org
>>>>
>>>>
>>>
>>
>

Reply via email to