Lets say am using 4 machines with 3gb ram. My data is customers records with 5 columns each in two tables with 0.5 million records. I want to perform join query on these two tables.
Thanks, Udbhav Agarwal From: Akhil Das [mailto:[email protected]] Sent: 13 March, 2015 12:16 PM To: Udbhav Agarwal Cc: [email protected] Subject: Re: spark sql performance The size/type of your data, and your cluster configuration would be fine i think. Thanks Best Regards On Fri, Mar 13, 2015 at 12:07 PM, Udbhav Agarwal <[email protected]<mailto:[email protected]>> wrote: Thanks Akhil, What more info should I give so we can estimate query time in my scenario? Thanks, Udbhav Agarwal From: Akhil Das [mailto:[email protected]<mailto:[email protected]>] Sent: 13 March, 2015 12:01 PM To: Udbhav Agarwal Cc: [email protected]<mailto:[email protected]> Subject: Re: spark sql performance That totally depends on your data size and your cluster setup. Thanks Best Regards On Thu, Mar 12, 2015 at 7:32 PM, Udbhav Agarwal <[email protected]<mailto:[email protected]>> wrote: Hi, What is query time for join query on hbase with spark sql. Say tables in hbase have 0.5 million records each. I am expecting a query time (latency) in milliseconds with spark sql. Can this be possible ? Thanks, Udbhav Agarwal
