niebayes opened a new issue, #15456:
URL: https://github.com/apache/datafusion/issues/15456

   ### Describe the bug
   
   Originally posted at https://github.com/clflushopt/tpchgen-rs/issues/73
   
   **Describe the bug**
   
   Using datafusion-cli to read csv files generated by tpchgen-cli sometimes 
fails.
   This issue was originally posted at 
https://github.com/clflushopt/tpchgen-rs/pull/66#issuecomment-2754881275
   
   **To Reproduce**
   Steps to reproduce the behavior:
   1. Git clone the latest of the main branch (rev: 
https://github.com/clflushopt/tpchgen-rs/commit/cb325ad8a1754c21aa00708bf4302b1899d8bef4)
   2. Build with cargo: `cargo build --release`
   3. Generate csv files in a dedicated directory: 
`./target/release/tpchgen-cli -f csv -o gen`
   4. Install the latest release of `datafusion-cli`: `cargo install 
datafusion-cli`
   5. Ensure `datafusion-cli` is of version `46.0.1`
     - Start datafusion-cli by running `datafusion-cli` in your terminal.
     - Run sql `select version()` which should print a message containing 
`46.0.1`.
   6. Use `datafusion-cli` to read the `part.csv` file by running sql `select * 
from './gen/part.csv'`. Note be sure to not add extra limit clause.
   7. The datafusion-cli should report an error like: `Arrow error: Parser 
error: Error while parsing value p_partkey for column 0 at line 24597` 
   
   **Expected behavior**
   All csv files generated by tpchgen-cli should be readable by datafusion-cli.
   
   **Screenshots**
   
   <img width="874" alt="Image" 
src="https://github.com/user-attachments/assets/53dce417-9709-4786-a777-b587ec35673b";
 />
   
   **Environment (please complete the following information):**
    - OS: Darwin Mac 24.0.0 Darwin Kernel Version 24.0.0: Mon Aug 12 20:52:18 
PDT 2024; root:xnu-11215.1.10~2/RELEASE_ARM64_T8122 arm64
    - Compiler Version: 1.85.1
   
   ### To Reproduce
   
   _No response_
   
   ### Expected behavior
   
   _No response_
   
   ### Additional context
   
   _No response_


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to