Hi I am looking for high performance count distinct solution on Hive Query.
Regular count distinct is very slow but if I use probabilistic count distinct has more error percentage (if the number of records are small). Is there is any solution to have exact count distinct but using low memory and without error? Thanks and Regards Prabakaran.N