Is there any way to set the underlying AWS client connection socket timeout
for the kinesis requests made in the spark-streaming-kinesis-asl?

Currently we get socket timeouts which appear to default to about 120
seconds on driver restarts causing all kinds of backup. We'd like to
shorten it to 10 seconds or closer to the batch time.

Also the Spark job should skip the mapToPair and transform stages on
batches that have 0 events but when a driver is restarted but all stages
are executed causing these intermittent socket timeouts.

Thanks,
Heji

Reply via email to