Re: Spark streaming error - Query terminated with exception: assertion failed: Invalid batch: a#660,b#661L,c#662,d#663,,… 26 more fields != b#1291L

Shixiong(Ryan) Zhu Mon, 01 Apr 2019 09:28:08 -0700

Could you try to use $”a” rather than df(“a”)? The latter one sometimes
doesn’t work.


On Thu, Mar 21, 2019 at 10:41 AM kineret M <kiner...@gmail.com> wrote:

> I try to read a stream using my custom data source (v2, using spark 2.3),
> and it fails *in the second iteration* with the following exception while
> reading prune columns:Query [id=xxx, runId=yyy] terminated with
> exception: assertion failed: Invalid batch: a#660,b#661L,c#662,d#663,,...
> 26 more fields != b#1291L
>
> Datafream creation:
>
> val df = sparkSession.readStream.format("myV2Source").load("/")
> val df1 = df.filter(df("a") >= "-1").select("b")
>
> Stream execution:
>
>       val streamingQuery = df1
>         .writeStream
>         .format("console")
>         .trigger(Trigger.ProcessingTime("10 seconds"))
>         .outputMode(OutputMode.Append())
>         .start()
>
>     streamingQuery.awaitTermination()
>
> if I remove the select (i.e. val df1 = df.filter(df("a") >= "-1")), it
> works fine.
>
> Any idea why?
>
-- 

Best Regards,
Ryan

Re: Spark streaming error - Query terminated with exception: assertion failed: Invalid batch: a#660,b#661L,c#662,d#663,,… 26 more fields != b#1291L

Reply via email to