Re: S3 example in Java

Ryan Blue Wed, 24 Jun 2020 12:35:11 -0700

Correct. You should just need to set the warehouse path so that the new
table is created in S3. You can also create a table with a specific path.


On Wed, Jun 24, 2020 at 11:39 AM Chen Song <[email protected]> wrote:

> Thanks Ryan/Edgar.
>
> I understand that I need to create a table via HiveCatalog. I already
> successfully set up a hive megastore for this and successfully created an
> Iceberg with that. I meant to create an Iceberg backed by data files
> created on S3. Sorry if my question was unclear. It seems like that these
> are already supported out of the box?
> Is it just a matter to change hive warehouse to a path like `s3://...`?
>
> Chen
>
> On Wed, Jun 24, 2020 at 2:00 PM Ryan Blue <[email protected]>
> wrote:
>
>> Chen,
>>
>> We have a getting started page that covers creating a table using the
>> HiveCatalog: https://iceberg.apache.org/getting-started/#creating-a-table
>>
>> I agree with Edgar that you probably want to start with a HiveCatalog
>> table. By default, tables are created under your warehouse path. If that's
>> an S3 location, then it will use the S3 file system that is available. You
>> can set the normal Hive warehouse config property to change the path. You
>> can also specify the path directly by setting "location".
>>
>> One more thing to note about using the Hive catalog vs a table location:
>> you _can_ use a table location without a metastore with S3. The trade-off
>> is that concurrent writes will not be atomic. If you want to just try it
>> out and don't need concurrent writes, then just using an S3 path is a
>> simple way to get started.
>>
>> rb
>>
>> On Wed, Jun 24, 2020 at 10:36 AM Edgar Rodriguez
>> <[email protected]> wrote:
>>
>>> Hi Chen,
>>>
>>> Since S3 does not have atomic rename operation, for create/write/read of
>>> tables in S3 currently the only way to do it is via the HiveCatalog
>>> implementation which requires a Hive metastore with Lock support to provide
>>> the atomic commit required by Iceberg. You can alternatively write your
>>> custom Catalog implementation in which you set up your custom atomic commit
>>> mechanism as shown in http://iceberg.apache.org/custom-catalog/.
>>>
>>> Cheers,
>>>
>>> On Wed, Jun 24, 2020 at 6:12 AM Chen Song <[email protected]>
>>> wrote:
>>>
>>>> Hi
>>>>
>>>> Are there any Java examples to create/write/read tables backed by S3? I
>>>> tried to search in the documentation and github but did not find anything.
>>>>
>>>> Thanks
>>>> Chen
>>>>
>>>>
>>>
>>> --
>>> Edgar R
>>>
>>
>>
>> --
>> Ryan Blue
>> Software Engineer
>> Netflix
>>
>
>
> --
> Chen Song
>
>

-- 
Ryan Blue
Software Engineer
Netflix

Re: S3 example in Java

Reply via email to