niebayes opened a new issue, #15456: URL: https://github.com/apache/datafusion/issues/15456
### Describe the bug Originally posted at https://github.com/clflushopt/tpchgen-rs/issues/73 **Describe the bug** Using datafusion-cli to read csv files generated by tpchgen-cli sometimes fails. This issue was originally posted at https://github.com/clflushopt/tpchgen-rs/pull/66#issuecomment-2754881275 **To Reproduce** Steps to reproduce the behavior: 1. Git clone the latest of the main branch (rev: https://github.com/clflushopt/tpchgen-rs/commit/cb325ad8a1754c21aa00708bf4302b1899d8bef4) 2. Build with cargo: `cargo build --release` 3. Generate csv files in a dedicated directory: `./target/release/tpchgen-cli -f csv -o gen` 4. Install the latest release of `datafusion-cli`: `cargo install datafusion-cli` 5. Ensure `datafusion-cli` is of version `46.0.1` - Start datafusion-cli by running `datafusion-cli` in your terminal. - Run sql `select version()` which should print a message containing `46.0.1`. 6. Use `datafusion-cli` to read the `part.csv` file by running sql `select * from './gen/part.csv'`. Note be sure to not add extra limit clause. 7. The datafusion-cli should report an error like: `Arrow error: Parser error: Error while parsing value p_partkey for column 0 at line 24597` **Expected behavior** All csv files generated by tpchgen-cli should be readable by datafusion-cli. **Screenshots** <img width="874" alt="Image" src="https://github.com/user-attachments/assets/53dce417-9709-4786-a777-b587ec35673b" /> **Environment (please complete the following information):** - OS: Darwin Mac 24.0.0 Darwin Kernel Version 24.0.0: Mon Aug 12 20:52:18 PDT 2024; root:xnu-11215.1.10~2/RELEASE_ARM64_T8122 arm64 - Compiler Version: 1.85.1 ### To Reproduce _No response_ ### Expected behavior _No response_ ### Additional context _No response_ -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For additional commands, e-mail: github-h...@datafusion.apache.org