Hi everyone,

We would like to start a discussion thread on "FLIP-118: Improve
Flink’s ID system"[1].

This FLIP mainly discusses the following issues, target to enhance the
readability of IDs in log and help user to debug in case of failures:

- Enhance the readability of the string literals of IDs. Most of them
are hashcodes, e.g. ExecutionAttemptID, which do not provide much
meaningful information and are hard to recognize and compare for
users.
- Log the ID’s lineage information to make debugging more convenient.
Currently, the log fails to always show the lineage information
between IDs. Finding out relationships between entities identified by
given IDs is a common demand, e.g., slot of which AllocationID is
assigned to satisfy slot request of with SlotRequestID. Absence of
such lineage information, it’s impossible to track the end to end
lifecycle of an Execution or a Task now, which makes debugging
difficult.

Key changes proposed in the FLIP are as follows:

- Add location information to distributed components
- Add topology information to graph components
- Log the ID’s lineage information
- Expose the identifier of distributing component to user

Please find more details in the FLIP wiki document [1]. Looking forward to
your feedbacks.

[1] https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=148643521

Best,
Yangze Guo

Reply via email to