kylebarron opened a new issue, #978:
URL: https://github.com/apache/datafusion-python/issues/978

   **Describe the bug**
   
   Currently, the default parameters when writing to Parquet are _no 
compression_
   
   
https://github.com/apache/datafusion-python/blob/79c22d6d6c0809e7e93a0a23249baa516dbd8d6f/python/datafusion/dataframe.py#L620-L625
   
   Since most high-level end users use the default parameters, I think this is 
a very bad footgun, and users will think Datafusion is doing something wrong.
   
   **To Reproduce**
   
   **Expected behavior**
   
   Parquet files should be written with some compression by default. I'd 
suggest ZSTD.
   
   **Additional context**
   Add any other context about the problem here.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to