Hi,

My name is Matt and I'm a data engineer at Spotify. I'm testing out trying
Data Sketches with Postgres, and running into some performance issues. I'm
seeing merge times much slower than what I'm seeing in the docs here
<https://datasketches.apache.org/docs/Theta/ThetaMergeSpeed.html> (millions
of sketches/sec).

In my case, I've pre-computed many sketches, inserted then into PG, then
I'm running queries in PG and doing the merging there. My hunch is that
there's something wrong with my Postgres configs, which I've tried tweaking
extensively but haven't been able to improve query time.

My question is if anyone knows what type of performance can be expected in
Postgres and if anyone has any examples/tips in general from their
implementations.

Also, this is my first message to this list, so please let me know if I
should be directing it anywhere else!

Thanks!!
Matt



*Matthew Z. Farkas*

Data Science @ Spotify
MS Northwestern University, BS Georgia Tech

m: (770) 337-2709
e: mfarka...@gmail.com

<https://www.linkedin.com/in/matthewzfarkas>

Reply via email to