Re: dataset write stucks on ThrottledAsyncTaskSchedulerImpl

2023-07-28 Thread Weston Pace
> How many io threads are writing concurrently in a single write_dataset > call? With the default options, and no partitioning, it will only use 1 I/O thread. This is because we do not write to a single file in parallel. If you change FileSystemDatasetWriteOptions::max_rows_per_file then you may

Re: [DISC] ADBC 1.1.0 API revision

2023-07-28 Thread David Li
A late update here: I've merged most of the proposed changes and implementations for ADBC 1.1.0 into a branch [1]. The remaining work is to finish up the C++ implementation and Python bindings, after which I'll propose a vote, hopefully in the next 2 weeks. Any feedback on the API proposal so fa

Re: dataset write stucks on ThrottledAsyncTaskSchedulerImpl

2023-07-28 Thread Wenbo Hu
Hi, Thanks for your detailed explanation, I made some experiment today. Before experiment, 1. To limit the resources used by the server, I use docker, which uses cgroups. But "free" does not respect the resource limit inside the container. 2. I measured the write speed on the host by "dd if=/de