jiayuasu opened a new pull request, #2528: URL: https://github.com/apache/sedona/pull/2528
## Did you read the Contributor Guide? - Yes, I have read the [Contributor Rules](https://sedona.apache.org/latest/community/rule/) and [Contributor Development Guide](https://sedona.apache.org/latest/community/develop/) ## Is this PR related to a ticket? - Yes, and the PR name follows the format `[GH-XXX] my subject`. Closes #2489 ## What changes were proposed in this PR? This pull request improves the reliability and user experience of downloading large dependencies in the Docker installation scripts for Spark, Zeppelin, Sedona, and related tools. The main changes introduce a helper function to show throttled progress updates during downloads, switch some downloads to faster mirrors with checksum verification, and enhance error handling and feedback for users. **Enhanced download feedback and reliability:** * Added a `download_with_progress` helper function to `docker/install-spark.sh` and `docker/install-zeppelin.sh` that downloads files in the background and prints progress updates every 5 seconds, including final size and error messages. This improves user visibility into long downloads and makes troubleshooting easier. [[1]](diffhunk://#diff-750c969ccb006f6f5c4553cad3487d236f71976672d55a4c6e34e26c82a691f4R27-R89) [[2]](diffhunk://#diff-4cbc7691181a9715b9e7685b9ad33fc81b0f99bcae7a17a8fa79172c6ccef626L26-R69) * Replaced direct `curl` calls with the new progress-enabled function for downloading Spark, Zeppelin, Hadoop AWS, and AWS SDK jars, ensuring consistent feedback and error handling across installations. [[1]](diffhunk://#diff-750c969ccb006f6f5c4553cad3487d236f71976672d55a4c6e34e26c82a691f4R27-R89) [[2]](diffhunk://#diff-4cbc7691181a9715b9e7685b9ad33fc81b0f99bcae7a17a8fa79172c6ccef626L26-R69) **Performance and integrity improvements:** * Changed Spark download in `docker/install-spark.sh` to use the Lyra Hosting mirror for faster downloads, while verifying the archive's integrity using a SHA-512 checksum from the official Apache archive. This ensures both speed and security. **User experience improvements:** * Added progress bar and error display options to `curl` commands for Sedona, gresearch spark extension, and GeoTools jar downloads in `docker/install-sedona.sh`, providing clearer feedback during installations. ## How was this patch tested? Passed local build ## Did this PR include necessary documentation updates? - No, this PR does not affect any public API so no need to change the documentation. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
