Hi everyone, We would like to start a discussion thread on "FLIP-118: Improve Flink’s ID system"[1].
This FLIP mainly discusses the following issues, target to enhance the readability of IDs in log and help user to debug in case of failures: - Enhance the readability of the string literals of IDs. Most of them are hashcodes, e.g. ExecutionAttemptID, which do not provide much meaningful information and are hard to recognize and compare for users. - Log the ID’s lineage information to make debugging more convenient. Currently, the log fails to always show the lineage information between IDs. Finding out relationships between entities identified by given IDs is a common demand, e.g., slot of which AllocationID is assigned to satisfy slot request of with SlotRequestID. Absence of such lineage information, it’s impossible to track the end to end lifecycle of an Execution or a Task now, which makes debugging difficult. Key changes proposed in the FLIP are as follows: - Add location information to distributed components - Add topology information to graph components - Log the ID’s lineage information - Expose the identifier of distributing component to user Please find more details in the FLIP wiki document [1]. Looking forward to your feedbacks. [1] https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=148643521 Best, Yangze Guo