zhangfengcdt opened a new pull request, #2546: URL: https://github.com/apache/sedona/pull/2546
## Did you read the Contributor Guide? - Yes, I have read the [Contributor Rules](https://sedona.apache.org/latest/community/rule/) and [Contributor Development Guide](https://sedona.apache.org/latest/community/develop/) ## Is this PR related to a ticket? - Yes, and the PR name follows the format `[GH-XXX] my subject`. Closes #<issue_number> ## What changes were proposed in this PR? - Add `ST_Collect_Aggr` as a new spatial aggregate function - Collects all geometries in a column into a multi-geometry (MultiPoint, MultiLineString, MultiPolygon, or GeometryCollection) - Unlike `ST_Union_Aggr`, this function does not dissolve boundaries - it simply collects geometries - Add `ST_Collect_Aggr` class in `AggregateFunctions.scala` - Add DataFrame API support in `st_aggregates.scala` - Register function in Spark SQL catalog - Add Python API in `st_aggregates.py` - Add Scala and Python tests - Add documentation ## How was this patch tested? - Scala unit tests for various geometry types (points, polygons, mixed) - Test GROUP BY functionality - Test null handling (nulls are skipped) - Test duplicate preservation (unlike ST_Union_Aggr) - DataFrame API tests (Scala and Python) ## Did this PR include necessary documentation updates? - Yes, I have updated the documentation. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
