Table statistics collection was added in HIVE-33 (numRows, rawDataSize,
etc). Is there anything that lets you create your own statistics gathering?

For example, given table A with columns x, y, z, I want to gather
count(distinct x, y) as a statistic that would be stored in the metastore.

I know there exist column level statistics that have approximations for
distinct for a single column, but that doesn't help my use case above where
I have 2 columns involved.

Thanks,
Alex.

Reply via email to