This is an automated email from the ASF dual-hosted git repository.

gangwu pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/parquet-testing.git


The following commit(s) were added to refs/heads/master by this push:
     new e845e41  Add test file with empty DataPage v2 (#74)
e845e41 is described below

commit e845e41789ec8d3aa6317e6464cdbbca987bf91d
Author: Enrico Minack <[email protected]>
AuthorDate: Thu Apr 17 03:24:35 2025 +0200

    Add test file with empty DataPage v2 (#74)
---
 data/README.md                                 |   3 ++-
 data/datapage_v2_empty_datapage.snappy.parquet | Bin 0 -> 413 bytes
 2 files changed, 2 insertions(+), 1 deletion(-)

diff --git a/data/README.md b/data/README.md
index d9ab77c..8da8e3e 100644
--- a/data/README.md
+++ b/data/README.md
@@ -56,7 +56,8 @@
 | old_list_structure.parquet | Single LIST<LIST<INT32>> column with legacy 
two-level list structure. See [old_list_structure.md](old_list_structure.md) |
 | repeated_primitive_no_list.parquet | REPEATED INT32 and BYTE_ARRAY fields 
without LIST annotation. See 
[note](#REPEATED-primitive-fields-with-no-LIST-annotation) |
 | map_no_value.parquet | MAP with null values, MAP with INT32 keys and no 
values, and LIST<INT32> column with same values as the MAP keys. See 
[map_no_value.md](map_no_value.md) |
-| page_v2_empty_compressed.parquet | An INT32 column with DataPageV2, all 
values are null, the zero-sized data is compressed using ZSTD |
+| page_v2_empty_compressed.parquet | An INT32 column with DataPageV2, all 
values are null, the zero-sized data is compressed using ZSTD. This is a valid 
non-zero bytes ZSTD stream that uncompresses into 0 bytes. |
+| datapage_v2_empty_datapage.snappy.parquet | A compressed FLOAT column with 
DataPageV2, a single row, value is null, the file uses Snappy compression, but 
there is no data for uncompression (see [related 
issue](https://github.com/apache/arrow-rs/issues/7388)). The zero bytes must 
not be attempted to be uncompressed, as this is an invalid Snappy stream. |
 | unknown-logical-type.parquet | A file containing a column annotated with a 
LogicalType whose identifier has been set to an abitrary high value to check 
the behaviour of an old reader reading a file written by a new writer 
containing an unsupported type (see [related 
issue](https://github.com/apache/arrow/issues/41764)). |
 | int96_from_spark.parquet | Single column of (deprecated) int96 values that 
originated as Apache Spark microsecond-resolution timestamps. Some values are 
outside the range typically representable by 64-bit nanosecond-resolution 
timestamps. See [int96_from_spark.md](int96_from_spark.md) for details. |
 
diff --git a/data/datapage_v2_empty_datapage.snappy.parquet 
b/data/datapage_v2_empty_datapage.snappy.parquet
new file mode 100644
index 0000000..30d6fa7
Binary files /dev/null and b/data/datapage_v2_empty_datapage.snappy.parquet 
differ

Reply via email to