Hbase is 0.98.0 Phoenix is 4.0
On Sat, Sep 6, 2014 at 12:04 PM, Vikas Agarwal <vi...@infoobjects.com> wrote: > Yes, that is why it is a trouble for me. However, on contrary, HBase shell > is also on the same machine and same environment, so if it is an issue of > resource (CPU or memory) it should have affected the HBase too, but HBase > is able to give me results within 0.0150 seconds. :( > > No, I haven't tested it outside AWS. I guess, it should not be the case > due to much better performance by native HBase query on HBase shell. > > > On Sat, Sep 6, 2014 at 11:59 AM, James Taylor <jamestay...@apache.org> > wrote: > >> Something is up in your environment. What version of Phoenix and HBase >> are you using and in what environment? Have you tried this locally, >> outside of AWS to compare? >> >> Take a look at our perf numbers, generated more-or-less daily, and >> which run over more data that what you're testing against: >> http://phoenix-bin.github.io/client/performance/phoenix-20140904095313.htm >> >> Some of these are point queries and they take in the neighborhood of >> 0.01 seconds. >> >> Thanks, >> James >> >> On Fri, Sep 5, 2014 at 10:48 PM, Vikas Agarwal <vi...@infoobjects.com> >> wrote: >> > Missed to mention that count query (posted in my last mail) is also >> taking >> > very long time to return the count. >> > >> > >> > On Sat, Sep 6, 2014 at 11:17 AM, Vikas Agarwal <vi...@infoobjects.com> >> > wrote: >> >> >> >> As I mentioned, schema is nothing but bunch of fields (some being >> >> integers, longs and text) along with primary key (row key) and I am >> making >> >> simple query to get result for a particular primary key, nothing more >> than >> >> that. >> >> >> >> 0: jdbc:phoenix:localhost> SELECT count(1) FROM table_name; >> >> >> >> +------------+ >> >> >> >> | COUNT(1) | >> >> >> >> +------------+ >> >> >> >> | 4667515 | >> >> >> >> +------------+ >> >> >> >> 1 row selected (132.11 seconds) >> >> >> >> >> >> >> >> On Sat, Sep 6, 2014 at 11:09 AM, Puneet Kumar Ojha >> >> <puneet.ku...@pubmatic.com> wrote: >> >>> >> >>> If you can share the schema,data type,cardinality of each dimension >> and >> >>> usual queries, I can help to design a schema with performance of less >> than 1 >> >>> sec using Phoenix. >> >>> >> >>> >> >>> >> >>> Thanks >> >>> >> >>> >> >>> >> >>> >> >>> >> >>> ------ Original message------ >> >>> >> >>> From: James Taylor >> >>> >> >>> Date: Sat, Sep 6, 2014 10:15 AM >> >>> >> >>> To: user; >> >>> >> >>> Subject:Re: Phoenix response time >> >>> >> >>> >> >>> >> >>> Vikas, >> >>> Please post your schema and query. >> >>> Thanks, >> >>> James >> >>> >> >>> On Fri, Sep 5, 2014 at 9:18 PM, Vikas Agarwal <vi...@infoobjects.com> >> >>> wrote: >> >>> > Ours is also a single node setup right now and as of now there are >> less >> >>> > than >> >>> > 1 million rows which is expected to grow around 100m at minimum. >> >>> > >> >>> > I am aware of secondary indexes but when I am querying on >> primary/row >> >>> > key, >> >>> > why would it take so much time? >> >>> > >> >>> > I am directly querying using sqlline for Phoenix and hbase shell for >> >>> > HBase >> >>> > query. I am not expecting to do any fine tuning for such small >> dataset. >> >>> > I am >> >>> > assumimg a minimum performance level out of the box. >> >>> > >> >>> > On Friday, September 5, 2014, yeshwanth kumar < >> yeshwant...@gmail.com> >> >>> > wrote: >> >>> >> >> >>> >> hi vikas, >> >>> >> >> >>> >> we used phoenix on a 4 core/23Gb machine, as a single node setup. >> >>> >> used HDP 2.1 >> >>> >> our table has 50-70M rows, >> >>> >> select on that table took less than 2 seconds. >> >>> >> Aggregation queries took less than 8 seconds. >> >>> >> for achieving good performance we created secondary index on the >> >>> >> table. >> >>> >> >> >>> >> make sure you finetuned hbase, >> >>> >> enabling compression on the data makes a difference in response. >> >>> >> if u distribute the data and load over all regions in hbase, >> >>> >> look at the performance tips mentioned in phoenix blog >> >>> >> >> >>> >> -yeshwanth >> >>> >> >> >>> >> >> >>> >> >> >>> >> Cheers, >> >>> >> Yeshwanth >> >>> >> >> >>> >> >> >>> >> >> >>> >> On Fri, Sep 5, 2014 at 5:42 PM, Vikas Agarwal < >> vi...@infoobjects.com> >> >>> >> wrote: >> >>> >>> >> >>> >>> Hi, >> >>> >>> >> >>> >>> Preface: We are testing phoenix using Hortonworks distribution for >> >>> >>> HBase >> >>> >>> on Amazon EC2 instance (r3.large, 2 CPU/15 GB RAM). >> >>> >>> >> >>> >>> With contrast to performance benchmarks, I found Phoenix to be >> very >> >>> >>> slow >> >>> >>> in querying even on primary key or row key. So, tried to increase >> the >> >>> >>> RAM >> >>> >>> for HBase and Phoenix and increasing the CPU and RAM by upgrading >> the >> >>> >>> EC2 >> >>> >>> machine type to r3.xlarge (4 CPU, 30 GB RAM). Results were like >> this: >> >>> >>> >> >>> >>> Time takes in returning result of query on row key: >> >>> >>> With Storm running and very less RAM available: 50 sec >> >>> >>> >> >>> >>> With Storm stopped and RAM available to Phoenix and HBase: 18 sec >> >>> >>> >> >>> >>> With new machine of next higher category (4 CPU and 30 GB RAM): 8 >> sec >> >>> >>> >> >>> >>> Pure HBase query by row key with Storm stopped and (2 CPU, 15 GB >> >>> >>> RAM): >> >>> >>> 0.0150 seconds. :) >> >>> >>> >> >>> >>> So, the difference seems to be many fold of what native HBase is >> >>> >>> providing to us. I am not able to understand how it can be >> possible? >> >>> >>> What I >> >>> >>> am missing here? >> >>> >>> >> >>> >>> -- >> >>> >>> Regards, >> >>> >>> Vikas Agarwal >> >>> >>> 91 – 9928301411 >> >>> >>> >> >>> >>> InfoObjects, Inc. >> >>> >>> Execution Matters >> >>> >>> http://www.infoobjects.com >> >>> >>> 2041 Mission College Boulevard, #280 >> >>> >>> Santa Clara, CA 95054 >> >>> >>> +1 (408) 988-2000 Work >> >>> >>> +1 (408) 716-2726 Fax >> >>> >> >> >>> >> >> >>> > >> >>> > >> >>> > -- >> >>> > Regards, >> >>> > Vikas Agarwal >> >>> > 91 – 9928301411 >> >>> > >> >>> > InfoObjects, Inc. >> >>> > Execution Matters >> >>> > http://www.infoobjects.com >> >>> > 2041 Mission College Boulevard, #280 >> >>> > Santa Clara, CA 95054 >> >>> > +1 (408) 988-2000 Work >> >>> > +1 (408) 716-2726 Fax >> >>> > >> >>> > >> >> >> >> >> >> >> >> >> >> -- >> >> Regards, >> >> Vikas Agarwal >> >> 91 – 9928301411 >> >> >> >> InfoObjects, Inc. >> >> Execution Matters >> >> http://www.infoobjects.com >> >> 2041 Mission College Boulevard, #280 >> >> Santa Clara, CA 95054 >> >> +1 (408) 988-2000 Work >> >> +1 (408) 716-2726 Fax >> > >> > >> > >> > >> > -- >> > Regards, >> > Vikas Agarwal >> > 91 – 9928301411 >> > >> > InfoObjects, Inc. >> > Execution Matters >> > http://www.infoobjects.com >> > 2041 Mission College Boulevard, #280 >> > Santa Clara, CA 95054 >> > +1 (408) 988-2000 Work >> > +1 (408) 716-2726 Fax >> > > > > -- > Regards, > Vikas Agarwal > 91 – 9928301411 > > InfoObjects, Inc. > Execution Matters > http://www.infoobjects.com > 2041 Mission College Boulevard, #280 > Santa Clara, CA 95054 > +1 (408) 988-2000 Work > +1 (408) 716-2726 Fax > > -- Regards, Vikas Agarwal 91 – 9928301411 InfoObjects, Inc. Execution Matters http://www.infoobjects.com 2041 Mission College Boulevard, #280 Santa Clara, CA 95054 +1 (408) 988-2000 Work +1 (408) 716-2726 Fax