[ https://issues.apache.org/jira/browse/SPARK-51704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Yang Jie reassigned SPARK-51704: -------------------------------- Assignee: zhoubin > Eliminate unnecessary collect operation > --------------------------------------- > > Key: SPARK-51704 > URL: https://issues.apache.org/jira/browse/SPARK-51704 > Project: Spark > Issue Type: Improvement > Components: SQL > Affects Versions: 4.0.0, 3.5.5, 4.1.0 > Reporter: zhoubin > Assignee: zhoubin > Priority: Minor > Labels: pull-request-available > > Change the `TreeNode` operation from `{{{}collect`{}}} to > `{{{}collectFirst`,{}}} when it is supposed to find the first qualified node > rather than apply function through out all the node and it's child, this > function should not take any effect or make any change to the node which > applied for. > Any scenarios satisfy below factors: > # Apply certain operation recursively on the `TreeNode` including itself > # The final purpose is to find the first qualified node in a pre order way > # The final purpose is to verify that there is at least one node which > satisfy the requirement > # Partial function applied should not effect the node or other related > Especially, the last factor should be reviewed carefully. > -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org