jiayuasu commented on PR #50: URL: https://github.com/apache/sedona-spatialbench/pull/50#issuecomment-3398778031
@asinghvi17 We intentionally avoided using GeoParquet in SpatialBench v0.1.0 for the following reasons: 1. Parquet Geo type adoption: We decided to wait until the Parquet Geo type gains broader adoption. Our goal is to use the Parquet Geo type instead of GeoParquet 1.0 / 1.1. The Sedona team is one of the main driving forces behind GeoParquet / Parquet Geo, and we are currently implementing Parquet Geo support in multiple languages, including Rust. We prefer Parquet Geo because it supports both Geometry and Geography types, which provides better flexibility than GeoParquet. 2. Avoiding spatial pruning effects: We intentionally skipped writing spatial statistics in v0.1.0 to avoid the impact of Parquet’s data pruning. The spatial ordering of features within files can significantly affect pruning performance, and we didn’t want this factor to influence benchmark comparability. Handling data sorting and spatial locality is planned for a future release. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
