rluvaton opened a new issue, #17828:
URL: https://github.com/apache/datafusion/issues/17828

   I found a parquet file that datafusion-cli failed to read and opened a bug 
in:
   - https://github.com/apache/arrow-rs/issues/8495
   
   and added the file in:
   - https://github.com/apache/parquet-testing/pull/96
   
   I can't find it but does the CI run basic sanity on all the files in 
https://github.com/apache/parquet-testing data directory 
(https://github.com/apache/parquet-testing/tree/a3d96a65e11e2bbca7d22a894e8313ede90a33a3/data)
   
   like simple:
   ```bash
   find parquet-testing/data/ -name "*.parquet" -type f | while read file; do
     echo "Processing: $file"
     datafusion-cli --command "select * from '$file'"
   done
   ```
   
   for sanity test


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to