iemejia opened a new pull request, #576:
URL: https://github.com/apache/parquet-format/pull/576
## Summary
Fix errors, grammar, and cross-document inconsistencies in the core Parquet
format documentation (README, Encodings, Compression, LogicalTypes).
## Changes
### README.md
- Fix repetition level value for non-nested columns (1 -> 0)
- Update defunct Twitter Code of Conduct links to ASF
- Fix plural agreement ("encoded values is" -> "are")
- Hyphenate compound adjectives ("32 bit" -> "32-bit", etc.)
- Normalize GZIP casing; capitalize proper nouns (RCFile, Avro)
### Encodings.md
- Fix "bitwidth of each block" -> "each miniblock" (DELTA_BINARY_PACKED)
- Remove misleading "always preferred" claim for DELTA_LENGTH_BYTE_ARRAY
- Fix "at at time" -> "at a time"
- Fix BIT_PACKED tense ("will be replaced" -> already replaced)
- Fix PLAIN BOOLEAN link to reference RLE/bit-packing hybrid section
- Hyphenate compound adjectives; "can not" -> "cannot"
### Compression.md
- Fix ZSTD RFC reference (8478 -> 8878)
- Fix Snappy description to match parallel construction
- Remove double space; fix comma splice
### LogicalTypes.md
- Fix embedded types ordering contradiction
- Add nanosecond to TIME precision description
- Remove invalid `<tr colspan=3>` from logical-type tables
- Align DECIMAL precision/scale wording with parquet.thrift
- Fix NaNs casing; add Oxford commas
- "can not" -> "cannot"; grammar fixes throughout
## Validation
No semantic/behavioral changes to the format specification. All fixes are
documentation-only.
_Split from #572 for easier review._
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]