Sure, you could implement that in a catalog, either one that you plug in or
contribute it to the Iceberg HiveCatalog.

On Mon, Feb 1, 2021 at 3:01 AM luochong.lxf <luochong....@gmail.com> wrote:

> Hi Ryan,
>
> Generally we add createTime and modifiedTime in table schema. However, due
> to historical reasons some hive tables do not have createTime and
> modifiedTime. When these hive tables are transformed to iceberg tables, we
> hope createTime and transient_lastDdl can be retained, so we can still do
> data expiration and track table activities. Once snapshots expire, we can
> not get these time info from iceberg. It seems the only way to solve the
> problem I mentioned above is to modify these hive table schema and rewrite
> them. Do you think so?
>
> Thanks
>
> luochong.lxf
> luochong....@gmail.com
>
> <https://maas.mail.163.com/dashi-web-extend/html/proSignature.html?ftlId=1&name=luochong.lxf&uid=luochong.lxf%40gmail.com&iconUrl=https%3A%2F%2Fmail-online.nosdn.127.net%2Fqiyelogo%2FdefaultAvatar.png&items=%5B%22luochong.lxf%40gmail.com%22%5D>
> 签名由 网易邮箱大师 <https://mail.163.com/dashi/dlpro.html?from=mail81> 定制
>
> On 01/29/2021 02:40,Ryan Blue<rb...@netflix.com.INVALID>
> <rb...@netflix.com.INVALID> wrote:
>
> Chong,
>
> Once snapshots expire, I don't think that there is a way to recover the
> time that a given partition was created.
>
> Can you explain more about what you're trying to do? When we age off data,
> we use the age of the records themselves, not the age from metadata. In
> other words, we use the logical timestamp from a row to expire it, not the
> timestamp when it was added to the table. You might consider doing that as
> well. I think it is probably a better way to ensure compliance.
>
> rb
>
> On Thu, Jan 28, 2021 at 9:42 AM chong luo <luochong....@gmail.com> wrote:
>
>> Hi Iceberg Devs
>>
>>
>> I’m currently working on delete expired table and partition in iceberg.
>> However, I can not find table/partition creation time, it seems iceberg
>> only stores snapshot creation time. In hive, transient_lastDdlTime,
>> createTime and lastAccessTime are stored in metastore. With time metadata,
>> we can know when table is changed and track related jobs.
>>
>>
>> Is there any way to get the time metadata mentioned above in the current
>> implementation of iceberg?
>>
>>
>>
>> Thanks.
>>
>
>
> --
> Ryan Blue
> Software Engineer
> Netflix
>
>

-- 
Ryan Blue
Software Engineer
Netflix

Reply via email to