Hi everyone,

We have a significantly large dataset that is already represented by theta 
sketches, across a variety of dimensions.
The theta sketch is chosen because the type of queries that are presented often 
involve many set operations, to filter down the data according to various 
dimension combinations.
Recently, there is a compelling use case to adopt tuple sketches for some of 
the properties of the data.  These tuple sketches would also need to be 
filtered according to the same query criteria as the theta sketches.
My initial assumption is that we would need to replicate all of our existing 
data as tuple sketches to accommodate this filtering.  Since tuple sketches are 
an extension of the theta idea, is it accurate to regard a tuple sketch where 
the summary mode
Is ALWAYS_ONE as isomorphic the theta sketch, in accuracy and behaviour?
Converting our entire dataset over to tuple sketches is not an option, and I 
also am unconvinced that a theta sketch can be easily converted to a tuple 
sketch on the fly, using the aforementioned summary property.

Would anyone be able to verify my assumptions in this case, namely:
- the theta sketch and tuple sketch can not be combined
- the tuple sketch cannot be converted from and to a theta sketch
- it is necessary to replicate an entire dataset into tuple sketches to use 
common set operations across the same dimensions

Thank you for open-sourcing this library, and for any help with regard to the 
above,
David

Reply via email to