> > Maybe we could update the time-based partition functions to be applied to > a long column directly. It would treat that column like a timestamp in > milliseconds. Would that work? I need to think more about the implications > of doing that, but I don't think that we currently have an issue with > extending the supported source column types like that.
By update do you mean adding new functions like HOUR_FROM_UNIX_EPOCH_MILLIS or overloading the definition to accept Long values and treat them as milliseconds. The former seems more reasonable to me. The latter I think has many of the same draw-backs raised on the other thread. IMO, both aren't super pleasing from a long term maintainability perspective. Cheers, Micah On Tue, Sep 10, 2024 at 8:50 AM rdb...@gmail.com <rdb...@gmail.com> wrote: > Maybe we could update the time-based partition functions to be applied to > a long column directly. It would treat that column like a timestamp in > milliseconds. Would that work? I need to think more about the implications > of doing that, but I don't think that we currently have an issue with > extending the supported source column types like that. > > On Mon, Sep 9, 2024 at 9:05 PM Manu Zhang <owenzhang1...@gmail.com> wrote: > >> Hi all, >> >> I'd like to bump this thread since we don't want to allow long to >> timestamp promotion in V3 >> <https://lists.apache.org/thread/79y866zdhs2fmyv0nsfq3xvdsmqh7h8c>. >> What other options do we have? >> >> Thanks, >> Manu >> >> On Fri, Apr 5, 2024 at 12:09 AM Jean-Baptiste Onofré <j...@nanthrax.net> >> wrote: >> >>> Ah yes, milestone is fine. Thanks ! >>> >>> All good. >>> >>> Regards >>> JB >>> >>> On Thu, Apr 4, 2024 at 5:12 PM Eduard Tudenhoefner <edu...@tabular.io> >>> wrote: >>> > >>> > There is the V3 Spec milestone where it's tracked (amongst other >>> things). >>> > >>> > On Thu, Apr 4, 2024 at 9:44 AM Jean-Baptiste Onofré <j...@nanthrax.net> >>> wrote: >>> >> >>> >> Hi Eduard, >>> >> >>> >> Thanks for the update ! It makes sense to me. >>> >> >>> >> Maybe a GH label with spec or v3_spec would help to see what is >>> planned for v3 ? >>> >> >>> >> Regards >>> >> JB >>> >> >>> >> On Thu, Apr 4, 2024 at 9:36 AM Eduard Tudenhoefner <edu...@tabular.io> >>> wrote: >>> >> > >>> >> > Type promotion from Long to Timestamp is on the roadmap for the V3 >>> Spec, so that would be the preferred way. >>> >> > >>> >> > On Wed, Apr 3, 2024 at 10:38 AM Jean-Baptiste Onofré < >>> j...@nanthrax.net> wrote: >>> >> >> >>> >> >> Hi Manu >>> >> >> >>> >> >> TIMESTAMP_LONG type promotion could be the easiest way, it would >>> work >>> >> >> with the existing transform. >>> >> >> >>> >> >> Would it work for you? >>> >> >> >>> >> >> Regards >>> >> >> JB >>> >> >> >>> >> >> On Wed, Apr 3, 2024 at 5:56 AM Manu Zhang <owenzhang1...@gmail.com> >>> wrote: >>> >> >> > >>> >> >> > Hi all, >>> >> >> > >>> >> >> > We have source data with a timestamp field in LONG type to land >>> in an Iceberg table. We want to partition the table with the timestamp >>> field such that we can query recent data more efficiently. However, LONG is >>> not supported as the source type of time-based transform (hour, day, etc) >>> >> >> > >>> >> >> > I find the previous discussion >>> https://github.com/apache/iceberg/issues/417 and Ryan suggested two >>> solutions >>> >> >> > >>> >> >> > 1. type promotion from LONG to TIMESTAMP >>> >> >> > 2. custom transform >>> >> >> > >>> >> >> > As I understand it, neither solution has already been >>> implemented yet. Is there any progress in either direction? Which solution >>> does the community prefer? Any other suggestions are also appreciated. >>> >> >> > >>> >> >> > Thanks, >>> >> >> > Manu >>> >>