GitHub user adaros92 added a comment to the discussion: Is there a way to do 
spatial partitioning like we can with RDD using DataFrame API?

Yeah that's what I thought but I've been using version 1.7 and doing a simple 
st_intersect between a points dataset and waterbodies dataset. When I look at 
explain plan I don't see spatial index. Only way is to broadcast one of the 
datasets, and then I do see the index like in the image below. We are just 
reading regular parquet with wkb geoms, which we convert with geomfromwkb. In 
order to get the spatial index and partitioning I need to convert to RDD and 
then back.

```
lakes_rdd.spatialPartitioning(GridType.QUADTREE, 200)
waypoints_rdd.spatialPartitioning(lakes_rdd.getPartitioner())

lakes_rdd.buildIndex(IndexType.QUADTREE, True)
```

<img width="1196" height="841" alt="Screenshot 2025-12-10 at 5 34 44 PM" 
src="https://github.com/user-attachments/assets/d253872f-87d0-4413-9cf9-bfd16a7b6071";
 />


GitHub link: 
https://github.com/apache/sedona/discussions/2554#discussioncomment-15224768

----
This is an automatically sent email for [email protected].
To unsubscribe, please send an email to: [email protected]

Reply via email to