Omega359 commented on issue #19425:
URL: https://github.com/apache/datafusion/issues/19425#issuecomment-3702556483

   On my machine it takes till 8G to encounter the buffer fill error:
   ```
   bruce@fedora:~/dev/datafusion$ datafusion-cli -m 1G -c "SELECT \"UserID\", 
extract(minute FROM to_timestamp_seconds(\"EventTime\")) AS m, 
\"SearchPhrase\", COUNT(*) FROM 'benchmarks/data/hits.parquet' GROUP BY 
\"UserID\", m, \"SearchPhrase\" ORDER BY COUNT(*) DESC LIMIT 10;"
   DataFusion CLI v51.0.0
   
   thread 'tokio-runtime-worker' (59949) panicked at 
/home/bruce/.cargo/registry/src/index.crates.io-1949cf8c6b5b557f/datafusion-physical-plan-51.0.0/src/aggregates/group_values/multi_group_by/bytes_view.rs:466:53:
   range end index 2034086 out of range for slice of length 4481
   note: run with `RUST_BACKTRACE=1` environment variable to display a backtrace
   
   thread 'tokio-runtime-worker' (59940) panicked at 
/home/bruce/.cargo/registry/src/index.crates.io-1949cf8c6b5b557f/datafusion-physical-plan-51.0.0/src/aggregates/group_values/multi_group_by/bytes_view.rs:466:53:
   range end index 2069630 out of range for slice of length 66557
   
   thread 'tokio-runtime-worker' (59936) panicked at 
/home/bruce/.cargo/registry/src/index.crates.io-1949cf8c6b5b557f/datafusion-physical-plan-51.0.0/src/aggregates/group_values/multi_group_by/bytes_view.rs:466:53:
   range end index 1653741 out of range for slice of length 81826
   
   thread 'tokio-runtime-worker' (59946) panicked at 
/home/bruce/.cargo/registry/src/index.crates.io-1949cf8c6b5b557f/datafusion-physical-plan-51.0.0/src/aggregates/group_values/multi_group_by/bytes_view.rs:466:53:
   range end index 2030313 out of range for slice of length 636
   Error: Join Error
   caused by
   External error: task 84 panicked with message "range end index 2069630 out 
of range for slice of length 66557"
   bruce@fedora:~/dev/datafusion$ datafusion-cli -m 4G -c "SELECT \"UserID\", 
extract(minute FROM to_timestamp_seconds(\"EventTime\")) AS m, 
\"SearchPhrase\", COUNT(*) FROM 'benchmarks/data/hits.parquet' GROUP BY 
\"UserID\", m, \"SearchPhrase\" ORDER BY COUNT(*) DESC LIMIT 10;"
   DataFusion CLI v51.0.0
   
   thread 'tokio-runtime-worker' (60084) panicked at 
/home/bruce/.cargo/registry/src/index.crates.io-1949cf8c6b5b557f/datafusion-physical-plan-51.0.0/src/aggregates/group_values/multi_group_by/bytes_view.rs:466:53:
   range end index 2077728 out of range for slice of length 825
   note: run with `RUST_BACKTRACE=1` environment variable to display a backtrace
   Error: Join Error
   caused by
   External error: task 44 panicked with message "range end index 2077728 out 
of range for slice of length 825"
   bruce@fedora:~/dev/datafusion$ datafusion-cli -m 8G -c "SELECT \"UserID\", 
extract(minute FROM to_timestamp_seconds(\"EventTime\")) AS m, 
\"SearchPhrase\", COUNT(*) FROM 'benchmarks/data/hits.parquet' GROUP BY 
\"UserID\", m, \"SearchPhrase\" ORDER BY COUNT(*) DESC LIMIT 10;"
   DataFusion CLI v51.0.0
   Error: Arrow error: Io error: failed to fill whole buffer
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to