Csaba,
These are some very thoughtful suggestions and I can see that some
recommendations in this area would be useful.
Our focus in our DataSketches team is really on the sketching algorithms
and designing the core sketches to be very high performing, robust,
accurate, and easy to integrate (e.g.
Hi!
This came up while trying to ensure HLL sketch interoperability between
Apache Hive and Apache Impala.
Currently in Hive the following types are not supported by ds_hll_sketch():
- BOOLEAN
- SMALLINT
- DECIMAL
- TIMESTAMP
- DATE
These types vary in complexity and usefulness, e.g. B