Exceeded Checkpoint tolerable failure

2022-12-07 Thread Madan D via user
Hello All, I am seeing below issue after I upgraded from 1.9.0 to 1.14.2 while publishing messages to pub sub which is causing frequent job restart and slow processing. Can you please help me. `Caused by: org.apache.flink.util.FlinkRuntimeException: Exceeded checkpoint tolerable failure thresho

Re: How to set disableChaining like streaming multiple INSERT statements in a StatementSet ?

2022-12-07 Thread yuxia
Could you please post the image of the running job graph in Flink UI? Best regards, Yuxia 发件人: "hjw" 收件人: "User" 发送时间: 星期四, 2022年 12 月 08日 上午 12:05:00 主题: How to set disableChaining like streaming multiple INSERT statements in a StatementSet ? Hi, I create a StatementSet that contain

Deterministic co-results

2022-12-07 Thread Salva Alcántara
It's well-known that Flink does not provide any guarantees on the order in which a CoProcessFunction (or one of its multiple variations) processes its two inputs [1]. I wonder then what is the current best practice/recommended approach for cases where one needs deterministic results in presence of:

Re: Accessing kafka message key from a KafkaSource

2022-12-07 Thread Yaroslav Tkachenko
Hi Noel, It's definitely possible. You need to implement a custom KafkaRecordDeserializationSchema: its "deserialize" method gives you a ConsumerRecord as an argument so that you can extract Kafka message key, headers, timestamp, etc. Then pass that when you create a KafkaSource via "setDeseriali

SNI issue

2022-12-07 Thread Jean-Damien Hatzenbuhler via user
Hello, When using the job manager API with an https proxy that uses SNI in front to route the traffic, I get an issue because the flink cli doesn't use the SNI when calling in https the API. Did other user face this issue ? Regards

How to set disableChaining like streaming multiple INSERT statements in a StatementSet ?

2022-12-07 Thread hjw
Hi, I create a StatementSet that contains multiple INSERT statements. I found that multiple INSERT tasks will be organized in a operator chain when StatementSet.execute() is invoked. How to set disableChaining like streaming multiple INSERT statements in a StatementSet api ? env: Fli

Re: ZLIB Vulnerability Exposure in Flink statebackend RocksDB

2022-12-07 Thread Yanfei Lei
Hi Vidya Sagar, Thanks for bringing this up. The RocksDB state backend defaults to Snappy[1]. If the compression option is not specifically configured, this vulnerability of ZLIB has no effect on the Flink application for the time being. *> is there any plan in the coming days to address this? *

Detecting when a job is "caught up"

2022-12-07 Thread Kurtis Walker via user
I’ve got a Flink job that uses a HybridSource. It reads a bunch of S3 files and then transitions to a Kafka Topic where it stays processing live data. Everything gets written to a warehouse where users build and run reports. In takes about 36 hours to read data from the beginning before it’s

Re: Cleanup for high-availability.storageDir

2022-12-07 Thread Alexis Sarda-Espinosa
I see, thanks for the details. I do mean replacing the job without stopping it terminally. Specifically, I mean updating the container image with one that contains an updated job jar. Naturally, the new version must not break state compatibility, but as long as that is fulfilled, the job should be

Re: Cleanup for high-availability.storageDir

2022-12-07 Thread Alexis Sarda-Espinosa
Hi Matthias, Then the explanation is likely that the job has not reached a terminal state. I was testing updates *without* savepoints (but with HA), so I guess that never triggers automatic cleanup. Since, in my case, the job will theoretically never reach a terminal state with this configuration

Accessing kafka message key from a KafkaSource

2022-12-07 Thread Noel OConnor
Hi, I'm using a kafka source to read in messages from kafka into a datastream. However I can't seem to access the key of the kafka message in the datastream. Is this even possible ? cheers Noel