Kontinuation opened a new pull request, #2297: URL: https://github.com/apache/sedona/pull/2297
## Did you read the Contributor Guide? - Yes, I have read the [Contributor Rules](https://sedona.apache.org/latest/community/rule/) and [Contributor Development Guide](https://sedona.apache.org/latest/community/develop/) ## Is this PR related to a ticket? - Yes, and the PR name follows the format `[GH-XXX] my subject`. Closes #2250. ## What changes were proposed in this PR? This PR copies ParquetFileFormat source code from Apache Spark 3.5.6 with minor modifications. This is to avoid depending on rapidly changing internal Spark APIs related to ParquetFileFormat. GeoParquetFileFormat used to depend on the ParquetFileFormat class of Spark, and it has caused lots of problems on Databricks, so we have to copy the code here to avoid the dependency. See related issues: - https://github.com/apache/sedona/issues/2250 - https://github.com/apache/sedona/issues/1905 - https://github.com/apache/sedona/issues/1869 - https://github.com/apache/sedona/issues/996 ## How was this patch tested? - Passing existing unit tests - [TODO] Test on Databricks ## Did this PR include necessary documentation updates? - No, this PR does not affect any public API so no need to change the documentation. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
