Re: ValueState is null; checkpointing issues

Chesnay Schepler Mon, 12 Jul 2021 00:21:29 -0700

1) The ValueState can only return a non-null value if a prior value withthe same key (in your case, "x.id") has been received. Have youdouble-checked that this is the case?

2) Checkpointing does not alleviate the need to restart all operators,it alleviates having to reprocess all data.It is expected that the entire pipeline is restarted and not just thefailed operator. In a nutshell this happens becausea) downstream operators would otherwise receive duplicate messages (allmessage since the last checkpoint until the operator failure)

b) upstream operators need to reproduce the data to process.


On 11/07/2021 00:35, Marzi K wrote:

Hi All,
I have a simple POC project to understand Flink state management andintegration with Kafka. Thanks in advance for helping me understandand resolve the following issues.
I have a FlinkKafkaConsumer that reads payloads from a separateFlinkKafkaProducer.The final 3 operators in my pipeline are keyed and stateful to savethe content of passed payload in 3 different stages.To verify checkpointing and correctness of ValueStates in my code, Ikill one taskManager manually, and as expected, the job restarts on asecond TM.
But I have noticed the following:
(1) When I print out the valueState.value, it always shows as nulleven though in the checkpointing dir, I see some files and _metaDatagetting saved.I suspected that maybe it’s due to Kryo serialization that gets useddue to JsonNode.class getting processed as General Type. So I changedmy state Object to a POJO but still getting null.
(2) when the job restarts, A few of the payloads keep starting fromthe very beginning of the pipeline and keep repeating over and over. Inoticed this is happening because one of the intermediate operatorsstarts failing (s3 upload) after restart on second TM. So this raisedthe question:When one operator fails, shouldn’t the failed message only retry thefailed operator and not start from the beginning of the pipeline? Ifso, does this further prove that the operator checkpoints are nothappening properly and thus the message needs to start from the verybeginning of the pipeline?
Including semi sudo code for reference:

_In Main class:_

//operators methodA and methodB don't have ValueState
DataStream<JsonNode> payloadStream =
  env.addSource(kafkaConsumer)
  .map((payload) -> methodA(payload))
  .map((payload) -> methodB(payload))
  .keyBy((payload) -> payload.get("x").get("id").asText());

DataStream<Tuple2<JsonNode,JsonNode>> responseStream =
*payloadStream.flatMap(new RichFlatMapA()).name("PostPayload1").uid("Post Payload1")*
  .keyBy((jsonNodeTuple) -> jsonNodeTuple.f0.get("x").get("id").asText())
*.flatMap(new RichFlatMapB()).name("S3 upload").uid("S3 upload")*
  .keyBy((jsonNodeTuple) -> jsonNodeTuple.f0.get("x").get("id").asText())
* .flatMap(new RichFlatMapC()).name("Post Payload2").uid("PostPayload2”)*
*
*
_In one of the operators where I’d like to make payload json stateful: _
public class RichFlatMapA extends RichFlatMapFunction<JsonNode,Tuple2<JsonNode,JsonNode>> {
private ValueState<JsonNode> payload;

public void open(Configuration config) {
payload = getRuntimeContext().getState(newValueStateDescriptor<>("saved payload"), JsonNode.class)
}
public void flatMap(JsonNode jsonNode, Collector<Tuple2<JsonNode,JsonNode>> collector) { JsonNode payload = this.payload.value(); //payload value is alwaysnull
    if (payload != null) {
        this.payload.clear();
    } else {
        this.payload.update(payload);
    }
    httpPost(jsonNode, collector);
}
}

Thank you,
Marzi

Re: ValueState is null; checkpointing issues

Reply via email to