>
> Maybe we could update the time-based partition functions to be applied to
> a long column directly. It would treat that column like a timestamp in
> milliseconds. Would that work? I need to think more about the implications
> of doing that, but I don't think that we currently have an issue with
> extending the supported source column types like that.


By update do you mean adding new functions like HOUR_FROM_UNIX_EPOCH_MILLIS
or overloading the definition to accept Long values and treat them as
milliseconds. The former seems more reasonable to me. The latter I think
has many of the same draw-backs raised on the other thread.  IMO, both
aren't super pleasing from a long term maintainability perspective.

Cheers,
Micah

On Tue, Sep 10, 2024 at 8:50 AM rdb...@gmail.com <rdb...@gmail.com> wrote:

> Maybe we could update the time-based partition functions to be applied to
> a long column directly. It would treat that column like a timestamp in
> milliseconds. Would that work? I need to think more about the implications
> of doing that, but I don't think that we currently have an issue with
> extending the supported source column types like that.
>
> On Mon, Sep 9, 2024 at 9:05 PM Manu Zhang <owenzhang1...@gmail.com> wrote:
>
>> Hi all,
>>
>> I'd like to bump this thread since we don't want to allow long to
>> timestamp promotion in V3
>> <https://lists.apache.org/thread/79y866zdhs2fmyv0nsfq3xvdsmqh7h8c>.
>> What other options do we have?
>>
>> Thanks,
>> Manu
>>
>> On Fri, Apr 5, 2024 at 12:09 AM Jean-Baptiste Onofré <j...@nanthrax.net>
>> wrote:
>>
>>> Ah yes, milestone is fine. Thanks !
>>>
>>> All good.
>>>
>>> Regards
>>> JB
>>>
>>> On Thu, Apr 4, 2024 at 5:12 PM Eduard Tudenhoefner <edu...@tabular.io>
>>> wrote:
>>> >
>>> > There is the V3 Spec milestone where it's tracked (amongst other
>>> things).
>>> >
>>> > On Thu, Apr 4, 2024 at 9:44 AM Jean-Baptiste Onofré <j...@nanthrax.net>
>>> wrote:
>>> >>
>>> >> Hi Eduard,
>>> >>
>>> >> Thanks for the update ! It makes sense to me.
>>> >>
>>> >> Maybe a GH label with spec or v3_spec would help to see what is
>>> planned for v3 ?
>>> >>
>>> >> Regards
>>> >> JB
>>> >>
>>> >> On Thu, Apr 4, 2024 at 9:36 AM Eduard Tudenhoefner <edu...@tabular.io>
>>> wrote:
>>> >> >
>>> >> > Type promotion from Long to Timestamp is on the roadmap for the V3
>>> Spec, so that would be the preferred way.
>>> >> >
>>> >> > On Wed, Apr 3, 2024 at 10:38 AM Jean-Baptiste Onofré <
>>> j...@nanthrax.net> wrote:
>>> >> >>
>>> >> >> Hi Manu
>>> >> >>
>>> >> >> TIMESTAMP_LONG type promotion could be the easiest way, it would
>>> work
>>> >> >> with the existing transform.
>>> >> >>
>>> >> >> Would it work for you?
>>> >> >>
>>> >> >> Regards
>>> >> >> JB
>>> >> >>
>>> >> >> On Wed, Apr 3, 2024 at 5:56 AM Manu Zhang <owenzhang1...@gmail.com>
>>> wrote:
>>> >> >> >
>>> >> >> > Hi all,
>>> >> >> >
>>> >> >> > We have source data with a timestamp field in LONG type to land
>>> in an Iceberg table. We want to partition the table with the timestamp
>>> field such that we can query recent data more efficiently. However, LONG is
>>> not supported as the source type of time-based transform (hour, day, etc)
>>> >> >> >
>>> >> >> > I find the previous discussion
>>> https://github.com/apache/iceberg/issues/417 and Ryan suggested two
>>> solutions
>>> >> >> >
>>> >> >> > 1. type promotion from LONG to TIMESTAMP
>>> >> >> > 2. custom transform
>>> >> >> >
>>> >> >> > As I understand it, neither solution has already been
>>> implemented yet. Is there any progress in either direction? Which solution
>>> does the community prefer? Any other suggestions are also appreciated.
>>> >> >> >
>>> >> >> > Thanks,
>>> >> >> > Manu
>>>
>>

Reply via email to