kylebarron opened a new issue, #978: URL: https://github.com/apache/datafusion-python/issues/978
**Describe the bug** Currently, the default parameters when writing to Parquet are _no compression_ https://github.com/apache/datafusion-python/blob/79c22d6d6c0809e7e93a0a23249baa516dbd8d6f/python/datafusion/dataframe.py#L620-L625 Since most high-level end users use the default parameters, I think this is a very bad footgun, and users will think Datafusion is doing something wrong. **To Reproduce** **Expected behavior** Parquet files should be written with some compression by default. I'd suggest ZSTD. **Additional context** Add any other context about the problem here. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For additional commands, e-mail: github-h...@datafusion.apache.org