Uwe L. Korn created ARROW-357:
---------------------------------
Summary: Default Parquet chunk_size of 64k is too small
Key: ARROW-357
URL: https://issues.apache.org/jira/browse/ARROW-357
Project: Apache Arrow
Issue Type: Bug
Affects Versions: 0.1.0
Reporter: Uwe L. Korn
Assignee: Uwe L. Korn
The current default size of RowGroups when writing Parquet files is 64k rows.
This produces a lot of RowGroups given large files which are then really
ineffective to read.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)