Re: [Rd] Package compression benchmarks for zstd vs gzip

2025-01-15 Thread Simon Urbanek
Heather, thanks, now fixed (datasets was using numeric value for compress= instead of the compression name so it picked zstd instead of gzip - now the switch order is kept the same). Cheers, Simon > On Jan 15, 2025, at 10:21 PM, Heather Turner wrote: > > With the changes to add zstd support

Re: [Rd] Package compression benchmarks for zstd vs gzip

2025-01-15 Thread Heather Turner
With the changes to add zstd support yesterday, the build of R-devel is failing when zstd is not present, even though the docs say that zstd is optional. The error comes in building the datasets package, see e.g. https://github.com/r-devel/r-svn/actions/runs/12760693086/job/35566530112. Best wi

Re: [Rd] Package compression benchmarks for zstd vs gzip

2025-01-12 Thread Simon Urbanek
I think the first step would have to be to add zstd support to R. zstd is a bit controversial (as shown by the community blowback of the changes you mentioned) and their build system (calling it that is being very generous) is mess so it would require a bit of testing, but it is doable. That sa

Re: [Rd] Package compression benchmarks for zstd vs gzip

2025-01-12 Thread Ivan Krylov via R-devel
On Sat, 11 Jan 2025 16:05:46 -0800 Henrik Bengtsson wrote: > It's probably also worth looking at package compression with 'xz' > compression. In [1], Mike FC has a graph where 'bzip2' and 'xz' seem > to give the best compression ratios, at least for RDS files. 'bzip2' can be surprisingly good on

Re: [Rd] Package compression benchmarks for zstd vs gzip

2025-01-11 Thread Henrik Bengtsson
Can't speak for Jeroen, but it sounds like it's worth adding support for tar.zstd package files, just like how tar.gz, tar.xz, and tar.bzip2 are currently supported. I'd also argue for support zstd compression throughout R, including adding zstdfile(), support for saveRDS(..., compress = "zstd"), a

Re: [Rd] Package compression benchmarks for zstd vs gzip

2025-01-11 Thread Avraham Adler
zstd is accessible within R using the archive package [1]. I use it all the time when saving large objects, using code I adapted from [2]. Is your suggestion to import the libraries/source code into base? [1] https://CRAN.R-project.org/package=archive [2] https://coolbutuseless.github.io/2018/10/

[Rd] Package compression benchmarks for zstd vs gzip

2025-01-10 Thread Jeroen Ooms
Many distros and browsers these days use zstd as the preferred compression method. For example if you unpack a .deb or .rpm file on Debian or Fedora there is zstd archive inside. It is claimed that zstd offers improved compression over gzip, but (unlike lzma) it has comparable decompression speed.