Sihua Zhou created FLINK-9474: --------------------------------- Summary: Introduce an approximate version of "count distinct" Key: FLINK-9474 URL: https://issues.apache.org/jira/browse/FLINK-9474 Project: Flink Issue Type: New Feature Components: Table API & SQL Affects Versions: 1.5.0 Reporter: Sihua Zhou Assignee: Sihua Zhou
We can implement an approximate version of "count distinct" base on the "Elastic Bloom Filter", It could be very fast because we don't need to query the state anymore, its accuracy should could be configurable. e.g 95%, 98%. -- This message was sent by Atlassian JIRA (v7.6.3#76005)