Re: Question about Datasketches HLL as replacement of Clearspring in Apache Cassandra

2025-03-12 Thread Lee Rhodes
Štefan, I noticed that I didn't directly answer your question whether any kind of merge is possible between the CS sketch and our HLL sketch. The answer is no. There are several reasons "merge" is impossible: - The CS HyperLogLogPlus sketch uses MurmurHash3 (like we do) but they implemented

Re: [I] NULL handling in sketch functions [datasketches-bigquery]

2025-03-12 Thread via GitHub
AlexanderSaydakov commented on issue #145: URL: https://github.com/apache/datasketches-bigquery/issues/145#issuecomment-2718778337 This issue is slightly different. I addressed aggregate functions here: #140 Now aggregate functions always produce a sketch. Input of all nulls results in

Re: [I] Consider adding theta_sketch_agg_int64_lgk() [datasketches-bigquery]

2025-03-12 Thread via GitHub
will-lauer commented on issue #144: URL: https://github.com/apache/datasketches-bigquery/issues/144#issuecomment-2718720509 One way to think about it is that information could potentially be lost at multiple places: building the original sketches and each level of merging the sketches. By

[I] NULL handling in sketch functions [datasketches-bigquery]

2025-03-12 Thread via GitHub
nikunjbhartia opened a new issue, #145: URL: https://github.com/apache/datasketches-bigquery/issues/145 Can we handle nulls passed as sketches in sketch functions more gracefully ? Current error messages are confusing and unless we dig deeper into a sql query and intermediate data, its d