That's a good suggestion Fokko.
It would avoid maintaining one more docker image. We can update the
quickstart to use the docker image provided by Spark.

- Ajantha

On Tue, Dec 10, 2024 at 4:08 PM Fokko Driesprong <fo...@apache.org> wrote:

> Hey Ajantha,
>
> Thanks for bringing this up, we should both remove the vendor reference
> and bring this back up to date. My preference would be to rely on the Spark
> image <https://hub.docker.com/r/apache/spark> provided by the Apache
> Spark project, similar to what we do for the Hive
> <https://iceberg.apache.org/hive-quickstart/> quickstart. We should be
> able to load all the Iceberg-specific JARs through the spark.jars.packages
> configuration <https://spark.apache.org/docs/3.5.1/configuration.html>.
>
> Kind regards,
> Fokko
>
> Op di 10 dec 2024 om 11:16 schreef Ajantha Bhat <ajanthab...@gmail.com>:
>
>> The quickstart <https://iceberg.apache.org/spark-quickstart/> page is a
>> critical touchpoint for new users and plays a key role in driving project
>> adoption.
>> Currently, it references *tabulario/spark-iceberg* and
>> *tabulario/iceberg-rest*
>>
>> We’ve already replaced *tabulario/iceberg-rest* with the
>> community-maintained Docker image, *apache/iceberg-rest-fixture*, based
>> on the REST TCK fixture.
>>
>> However, *tabulario/spark-iceberg* seems outdated, and doesn't use the
>> latest Iceberg version.
>> To enhance the user experience and keep the quickstart aligned with
>> project standards, I suggest hosting it either under the /docker folder in
>> the Iceberg repository
>> or as a subproject called *apache/iceberg-playground* where users can
>> contribute to maintain other docker images.
>>
>> The quickstart page should ideally reference images maintained by the
>> community rather than vendor-specific open-source projects.
>>
>> Thoughts?
>>
>> - Ajantha
>>
>

Reply via email to