Sounds good!

-Jack

On Wed, Dec 6, 2023 at 10:16 AM Tim Brown <t...@onehouse.ai> wrote:

> Hi Ryan,
>
> Apologies for the noise.
>
> Jack and Walaa, let's move any conversations to the Discussions
> <https://github.com/onetable-io/onetable/discussions> board on github for
> the project. Also feel free to reach out to me directly if you prefer.
>
> I'm personally looking forward to learning more about the Rest Catalog and
> making sure we integrate well with it from OneTable.
>
> -Tim Brown
>
> On Wed, Dec 6, 2023 at 11:55 AM Ryan Blue <b...@tabular.io> wrote:
>
>> I'm not sure that this is the right place for a discussion about the
>> merits of their approach.
>>
>> This list is for Iceberg development. I encourage anyone interested to
>> follow up on the appropriate incubator list rather than here.
>>
>> I also think it's debatable whether advertising other projects is helpful
>> or wanted here, but I'd rather not add to the noise either way.
>>
>> Ryan
>>
>> On Tue, Dec 5, 2023 at 8:36 PM Jack Ye <yezhao...@gmail.com> wrote:
>>
>>> I recently did an analysis of the OneTable project, overall it made me a
>>> bit confused.
>>>
>>> From an end user's perspective, no one really wants to use all these 3
>>> formats, and most companies do not have the engineering resources to
>>> maintain a stack of all these 3 formats. Eventually people pick one and
>>> just stick with it.
>>>
>>> If the goal is to provide a converter, then individual communities have
>>> developed different tools, such as Delta's Uniform, Iceberg's snapshot and
>>> migrate procedures, Hudi's bootstrap methods. The advantage of those tools
>>> is that the specific community knows the best way to convert a foreign data
>>> source to its native format, and can declare compatibility and fail
>>> whenever necessary. It is not bounded to the expressiveness of an internal
>>> data model like OneTable, OneField, OneSchema, etc.
>>>
>>> If the goal is format unification, at least for me being in the Iceberg
>>> community with a bit bias, a more straightforward way to achieve the goal
>>> is to extend the feature of "Iceberg external tables", where we can map
>>> Hive, Delta, Hudi and other table formats directly to Iceberg format behind
>>> a REST catalog, and make that readable. This is kind of related to a
>>> recent email thread I sent regarding the EXTERNAL/MANAGED syntax
>>> <https://lists.apache.org/thread/ohqfvhf4wofzkhrvff1lxl58blh432o6>. And
>>> linking back to this thread, that essentially makes Iceberg the unified
>>> format, and we are actually pretty close to achieving that. With this
>>> approach, you get not just conversion, you can (1) not do physical metadata
>>> conversion but directly convert table metadata at runtime to Iceberg data
>>> model, (2) query all the tables using a single unified Iceberg connector in
>>> all supported engines, and (3) it is a very standardized external table
>>> concept that all database system folks immediately understand.
>>>
>>> This makes me feel that we are trying to make OneTable a new table
>>> format without saying it is a new table format. Although the Apache
>>> Incubation proposal clearly says "OneTable is NOT a new table format", it
>>> is hard for me to envision a long-term roadmap that does not eventually
>>> make it a table format, with connectors and data maintenance features built
>>> directly against this internal model, which is kind of feels like what the
>>> commercial entity OneHouse is trying to do right now, but maybe I am wrong.
>>>
>>> What do you think?
>>>
>>> Best,
>>> Jack Ye
>>>
>>> On Tue, Dec 5, 2023 at 3:30 PM Jesús Camacho Rodríguez <
>>> jcama...@apache.org> wrote:
>>>
>>>> Currently, there is no established group discussions. The project was
>>>> recently open-sourced, and communication is currently done through GitHub.
>>>> (If the project is accepted into the ASF incubator, mailing lists will be
>>>> created). If you're interested in regular meetings, feel free to suggest it
>>>> to the community on GitHub.
>>>>
>>>> Thanks,
>>>> Jesús
>>>>
>>>>
>>>> On 2023/12/05 06:30:38 Gaurav Agarwal wrote:
>>>> > HI
>>>> > Thanks for this mail , I would like to know is there any group
>>>> discussion
>>>> > also happened or any call to discuss the issues.
>>>> >
>>>> > thanks
>>>> >
>>>> >
>>>> > On Tue, Dec 5, 2023 at 9:29 AM Walaa Eldin Moustafa <
>>>> wa.moust...@gmail.com>
>>>> > wrote:
>>>> >
>>>> > > Thanks Jesus for sharing OneTable. Looks like it touches upon some
>>>> of the
>>>> > > topics we discussed in the Rise of Table Formats panel at VLDB
>>>> > > <https://ceur-ws.org/Vol-3462/CDMS18.pdf> back in September. I was
>>>> > > browsing through the source code, and I ran into the OneField
>>>> > > <
>>>> https://github.com/onetable-io/onetable/blob/main/api/src/main/java/io/onetable/model/schema/OneField.java>
>>>> class
>>>> > > and noticed it has support for default values, which is good, but
>>>> in the
>>>> > > Iceberg spec, there are two default values
>>>> > > <https://iceberg.apache.org/spec/#default-values> (more details in
>>>> the
>>>> > > spec and respective PR). I was pointing this out as an example of
>>>> small
>>>> > > nuances that can differ from one format to another and was
>>>> wondering how
>>>> > > OneTable is planning to bridge them?
>>>> > >
>>>> > > Thanks,
>>>> > > Walaa.
>>>> > >
>>>> > >
>>>> >
>>>>
>>>
>>
>> --
>> Ryan Blue
>> Tabular
>>
>

Reply via email to