Re: Flight benchmark question

2020-06-17 Thread Yibo Cai
On 6/17/20 8:33 PM, David Li wrote: -- Tessian Warning -- There is something unusual about this email, please take care as it could be malicious. Tessian has flagged this email because the sender could be trying to impersonate someone at your company. The sender, "David Li ", looks si

Re: Flight benchmark question

2020-06-17 Thread David Li
Hey Yibo, Thanks for investigating this! This is a great writeup. There was a PR recently to let clients set gRPC options like this, so it can be enabled on a case-by-case basis: https://github.com/apache/arrow/pull/7406 So we could add that to the benchmark or suggest it in documentation. I thi

Re: Flight benchmark question

2020-06-17 Thread Chengxin Ma
Hi Yibo, Your discovery is impressive. Did you consider the `num_streams` parameter [1] as well? If I understood correctly, this parameter is used for setting the conceptual concurrent streams between the client and the server, while `num_threads` is used for setting the size of the thread p

Re: Flight benchmark question

2020-06-16 Thread Yibo Cai
Find a way to achieve reasonable benchmark result with multiple threads. Diff pasted below for a quick review or try. Tested on E5-2650, with this change: num_threads = 1, speed = 1996 num_threads = 2, speed = 3555 num_threads = 4, speed = 5828 When running `arrow_flight_benchmark`, I find there

Re: Flight benchmark question

2020-06-15 Thread Wes McKinney
On Mon, Jun 15, 2020 at 8:43 AM Antoine Pitrou wrote: > > > Le 15/06/2020 à 15:36, Wes McKinney a écrit : > > > > When you have only a single server, all the gRPC traffic goes through > > a common port and is handled by a common server, so if both client and > > server are roughly IO bound you are

Re: Flight benchmark question

2020-06-15 Thread Antoine Pitrou
Le 15/06/2020 à 15:36, Wes McKinney a écrit : > > When you have only a single server, all the gRPC traffic goes through > a common port and is handled by a common server, so if both client and > server are roughly IO bound you aren't going to get better performance > by hitting the server with m

Re: Flight benchmark question

2020-06-15 Thread Wes McKinney
We had a _very_ similar discussion in April https://lists.apache.org/thread.html/rd2aa01f460dd1092c60d1ba75087c2ce87c81ac543a246549b4713fb%40%3Cdev.arrow.apache.org%3E When you have only a single server, all the gRPC traffic goes through a common port and is handled by a common server, so if both

Flight benchmark question

2020-06-15 Thread Yibo Cai
I'm evaluating flight benchmark [1] on single host. Met with one problem. Would like to seek for help. Flight benchmark has a "num_threads" parameter [1] to set "number of current gets". Counter-intuitively, setting it to larger values drops performance, "arrow-flight-benchmark --num_threads=1"