kylebarron opened a new issue, #978: URL: https://github.com/apache/datafusion-python/issues/978
**Describe the bug** Currently, the default parameters when writing to Parquet are _no compression_ https://github.com/apache/datafusion-python/blob/79c22d6d6c0809e7e93a0a23249baa516dbd8d6f/python/datafusion/dataframe.py#L620-L625 Since most high-level end users use the default parameters, I think this is a very bad footgun, and users will think Datafusion is doing something wrong. **To Reproduce** **Expected behavior** Parquet files should be written with some compression by default. I'd suggest ZSTD. **Additional context** Add any other context about the problem here. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
