Hello Apache DataSketches community,

We are pleased to have released DataSketches-python 5.0.1. (This happened
in early February but I realized I never sent an announcement email.)

This is the first stand-alone python release and comes with a number of
changes:
* Vastly improved API documentation via sphinx
* More pythonic API:
  * __copy__ instead of C++-style copy constructors
  * __str__ takes no arguments in all cases
* Moved from pybind11 to nanobind, which also requires c++17 internally
* Added new exact proportional to size sampling sketch, ebpps_sketch
* Fixed slowness with theta intersection
* Fixed incompatibility with Java's serialized empty frequent items sketch
* Removed default Kernel for density_sketch (GaussianKernel is still
provided)
* Removed extra (largely transparent to users) wrappers around
density_sketch and tuple_sketch

The official download page:
https://datasketches.apache.org/docs/Community/Downloads.html

The release tag on github:
https://github.com/apache/datasketches-python/releases/tag/5.0.1

The python version is also available via pypi:
https://pypi.org/project/datasketches/

For those interested in why this was 5.0.1, there was a packaging issue
with 5.0.0 causing the pypi binaries to be invalid in certain cases. There
are no code differences between 5.0.0 and 5.0.1, only GitHub Actions
workflow changes.

Thank you,
Jon Malkin
jmal...@apache.org

Reply via email to