Re: [PR] fix: hdfs read into buffer fully [datafusion-comet]

2025-07-17 Thread via GitHub
parthchandra commented on PR #2031: URL: https://github.com/apache/datafusion-comet/pull/2031#issuecomment-3085617517 Merged. Thank you for the reviews @andygrove @comphead @Kontinuation! -- This is an automated message from the Apache Git Service. To respond to the message, please log on

Re: [PR] fix: hdfs read into buffer fully [datafusion-comet]

2025-07-17 Thread via GitHub
parthchandra merged PR #2031: URL: https://github.com/apache/datafusion-comet/pull/2031 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr.

Re: [PR] fix: hdfs read into buffer fully [datafusion-comet]

2025-07-17 Thread via GitHub
parthchandra commented on PR #2031: URL: https://github.com/apache/datafusion-comet/pull/2031#issuecomment-3085613885 @comphead @Kontinuation @drexler-sky Opened a tracking issue for fs-hdfs issues: https://github.com/apache/datafusion-comet/issues/2034 -- This is an automated message fr

Re: [PR] fix: hdfs read into buffer fully [datafusion-comet]

2025-07-17 Thread via GitHub
parthchandra commented on PR #2031: URL: https://github.com/apache/datafusion-comet/pull/2031#issuecomment-3084912625 > > reminds me another problem with fs-hdfs. > > The `HdfsErr` returned by `fs-hdfs` read functions does not contain JVM stack traces. If there's a read failure caused by

Re: [PR] fix: hdfs read into buffer fully [datafusion-comet]

2025-07-17 Thread via GitHub
parthchandra commented on code in PR #2031: URL: https://github.com/apache/datafusion-comet/pull/2031#discussion_r2213926681 ## native/hdfs/src/object_store/hdfs.rs: ## @@ -88,19 +88,33 @@ impl HadoopFileSystem { fn read_range(range: &Range, file: &HdfsFile) -> Result {

Re: [PR] fix: hdfs read into buffer fully [datafusion-comet]

2025-07-17 Thread via GitHub
comphead commented on PR #2031: URL: https://github.com/apache/datafusion-comet/pull/2031#issuecomment-3084571582 > This patch looks good to me, and it reminds me another problem with fs-hdfs. > > The `HdfsErr` returned by `fs-hdfs` read functions does not contain JVM stack traces.

Re: [PR] fix: hdfs read into buffer fully [datafusion-comet]

2025-07-16 Thread via GitHub
comphead commented on code in PR #2031: URL: https://github.com/apache/datafusion-comet/pull/2031#discussion_r2212009586 ## native/hdfs/src/object_store/hdfs.rs: ## @@ -88,19 +88,33 @@ impl HadoopFileSystem { fn read_range(range: &Range, file: &HdfsFile) -> Result {

Re: [PR] fix: hdfs read into buffer fully [datafusion-comet]

2025-07-16 Thread via GitHub
parthchandra commented on PR #2031: URL: https://github.com/apache/datafusion-comet/pull/2031#issuecomment-3081817670 @Kontinuation @andygrove @comphead, updated based on review comments -- This is an automated message from the Apache Git Service. To respond to the message, please log

Re: [PR] fix: hdfs read into buffer fully [datafusion-comet]

2025-07-16 Thread via GitHub
parthchandra commented on code in PR #2031: URL: https://github.com/apache/datafusion-comet/pull/2031#discussion_r2210968794 ## native/hdfs/src/object_store/hdfs.rs: ## @@ -88,19 +88,18 @@ impl HadoopFileSystem { fn read_range(range: &Range, file: &HdfsFile) -> Result {

Re: [PR] fix: hdfs read into buffer fully [datafusion-comet]

2025-07-16 Thread via GitHub
Kontinuation commented on PR #2031: URL: https://github.com/apache/datafusion-comet/pull/2031#issuecomment-3079205816 > Sorry @Kontinuation if I check your references https://github.com/datafusion-contrib/fs-hdfs/blob/8c03c5ef0942b75abc79ed673931355fa9552131/c_src/libhdfs/hdfs.c#L1564C15-L1

Re: [PR] fix: hdfs read into buffer fully [datafusion-comet]

2025-07-16 Thread via GitHub
Kontinuation commented on code in PR #2031: URL: https://github.com/apache/datafusion-comet/pull/2031#discussion_r2210729874 ## native/hdfs/src/object_store/hdfs.rs: ## @@ -141,13 +140,15 @@ impl ObjectStore for HadoopFileSystem { let file_status = file.get_file_sta

Re: [PR] fix: hdfs read into buffer fully [datafusion-comet]

2025-07-16 Thread via GitHub
Kontinuation commented on code in PR #2031: URL: https://github.com/apache/datafusion-comet/pull/2031#discussion_r2210716251 ## native/hdfs/src/object_store/hdfs.rs: ## @@ -88,19 +88,18 @@ impl HadoopFileSystem { fn read_range(range: &Range, file: &HdfsFile) -> Result {

Re: [PR] fix: hdfs read into buffer fully [datafusion-comet]

2025-07-16 Thread via GitHub
comphead commented on code in PR #2031: URL: https://github.com/apache/datafusion-comet/pull/2031#discussion_r2210658711 ## native/hdfs/src/object_store/hdfs.rs: ## @@ -88,19 +88,18 @@ impl HadoopFileSystem { fn read_range(range: &Range, file: &HdfsFile) -> Result {

Re: [PR] fix: hdfs read into buffer fully [datafusion-comet]

2025-07-15 Thread via GitHub
andygrove commented on code in PR #2031: URL: https://github.com/apache/datafusion-comet/pull/2031#discussion_r2209180081 ## native/hdfs/src/object_store/hdfs.rs: ## @@ -88,19 +88,18 @@ impl HadoopFileSystem { fn read_range(range: &Range, file: &HdfsFile) -> Result {

Re: [PR] fix: hdfs read into buffer fully [datafusion-comet]

2025-07-15 Thread via GitHub
codecov-commenter commented on PR #2031: URL: https://github.com/apache/datafusion-comet/pull/2031#issuecomment-3076413162 ## [Codecov](https://app.codecov.io/gh/apache/datafusion-comet/pull/2031?dropdown=coverage&src=pr&el=h1&utm_medium=referral&utm_source=github&utm_content=comment&utm_ca

Re: [PR] fix: hdfs read into buffer fully [datafusion-comet]

2025-07-15 Thread via GitHub
parthchandra commented on PR #2031: URL: https://github.com/apache/datafusion-comet/pull/2031#issuecomment-3076378047 @Kontinuation, fyi -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the speci

[PR] fix: hdfs read into buffer fully [datafusion-comet]

2025-07-15 Thread via GitHub
parthchandra opened a new pull request, #2031: URL: https://github.com/apache/datafusion-comet/pull/2031 The `get` and `read_range` methods in hdfs object store implementation do not always read the data requested because the underlying hdfs call may return fewer bytes than requested. ht