iemejia opened a new pull request, #573:
URL: https://github.com/apache/parquet-format/pull/573
## Summary
Fix typos, grammar, and comment inconsistencies in the canonical Thrift
schema definition.
## Changes
- Fix typos: "to be be", "documention", "not necessary"
- Remove off-by-one in DataPageHeaderV2 `is_compressed` comment
- Fix article agreement ("a element" -> "an element", "a OffsetIndex" -> "an
OffsetIndex")
- Disambiguate `compressed_page_size` comment in PageLocation (it includes
the header; the field of the same name on PageHeader does not)
- Fix "edges interpolation" -> "edge interpolation" in Geospatial comments
- Capitalize proper nouns: Hive, Pig; normalize GZIP casing
- Add terminal periods for consistency
- Clarify BIT_PACKED is superseded by RLE (cross-reference Encodings.md)
- Missing space before parenthesis in frameworks list
## Validation
Thrift definition compiles cleanly after all changes. No semantic/behavioral
changes to the format specification.
_Split from #572 for easier review._
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]