Hi all! I would like to start a discussion on FLIP-384: Introduce TraceReporter and use it to create checkpointing and recovery traces [1].
This proposal intends to improve observability of Flink's Checkpointing and Recovery/Initialization operations, by adding support for reporting traces from Flink. In the future, reporting traces can be of course used for other use cases and also by users. There are also two other follow up FLIPS, FLIP-385 [2] and FLIP-386 [3], which expand the basic functionality introduced in FLIP-384 [1]. Please let me know what you think! Best, Piotr Nowojski [1] https://cwiki.apache.org/confluence/display/FLINK/FLIP-384%3A+Introduce+TraceReporter+and+use+it+to+create+checkpointing+and+recovery+traces [2] https://cwiki.apache.org/confluence/display/FLINK/FLIP-385%3A+Add+OpenTelemetryTraceReporter+and+OpenTelemetryMetricReporter [3] https://cwiki.apache.org/confluence/display/FLINK/FLIP-386%3A+Support+adding+custom+metrics+in+Recovery+Spans