subject:"Re\: Structured streaming, Writing Kafka topic to BigQuery table, throws error"

Re: Structured streaming, Writing Kafka topic to BigQuery table, throws error

2021-02-25 Thread Mich Talebzadeh

Hi, I managed to make mine work using the *foreachBatch function *in writeStream. "foreach" performs custom write logic on each row and "foreachBatch" performs custom write logic on each micro-batch through SendToBigQuery function here foreachBatch(SendToBigQuery) expects 2 parameters, first: mi

Re: Structured streaming, Writing Kafka topic to BigQuery table, throws error

2021-02-24 Thread Mich Talebzadeh

Thanks Jungtaek. I am stuck on how to add rows to BigQuery. Spark API in PySpark does it fine. However, we are talking about structured streaming with PySpark. This is my code that reads and display data on the console fine class MDStreaming: def __init__(self, spark_session,spark_context):

Re: Structured streaming, Writing Kafka topic to BigQuery table, throws error

2021-02-23 Thread Jungtaek Lim

If your code doesn't require "end to end exactly-once" then you could leverage foreachBatch which enables you to use batch sink. If your code requires "end to end exactly-once", then well, that's the different story. I'm not familiar with BigQuery and even have no idea how sink is implemented, but

Re: Structured streaming, Writing Kafka topic to BigQuery table, throws error

Re: Structured streaming, Writing Kafka topic to BigQuery table, throws error

Re: Structured streaming, Writing Kafka topic to BigQuery table, throws error

3 matches

Site Navigation

Mail list logo

Footer information