Re: Pycassa vs YCSB results.

2013-02-06 Thread Tim Wintle
On Tue, 2013-02-05 at 13:51 -0500, Edward Capriolo wrote: > Without stating the obvious, if you are interested in scale, then why > pick python?. I would (kind of) agree with this point.. If you absolutely need performance here then python isn't the right choice. If, however, you are currently w

Re: Pycassa vs YCSB results.

2013-02-05 Thread Edward Capriolo
Without stating the obvious, if you are interested in scale, then why pick python?. I did want to point out that YCSB is not even the gold standard for benchmarks using cassandra's stress you can get more ops per sec then YCSB. On Tue, Feb 5, 2013 at 1:13 PM, Pradeep Kumar Mantha wrote: > Thanks,

Re: Pycassa vs YCSB results.

2013-02-05 Thread Pradeep Kumar Mantha
Thanks, I will use the multiprocessing package, since I need to scale it to multiple nodes. I will also try to optimize the function calls and use global variables. Thank you very much for your help. On Tue, Feb 5, 2013 at 9:12 AM, aaron morton wrote: > The simple thing to do would be use the

Re: Pycassa vs YCSB results.

2013-02-05 Thread aaron morton
The simple thing to do would be use the multiprocessing package and eliminate all shared state. On a multicore box python threads can run on different cores and battle over obtaining the GIL. Cheers - Aaron Morton Freelance Cassandra Developer New Zealand @aaronmorton http:/

Re: Pycassa vs YCSB results.

2013-02-05 Thread Tim Wintle
On Tue, 2013-02-05 at 21:38 +1300, aaron morton wrote: > The first thing I noticed is your script uses python threading library, which > is hampered by the Global Interpreter Lock > http://docs.python.org/2/library/threading.html > > You don't really have multiple threads running in parallel, tr

Re: Pycassa vs YCSB results.

2013-02-05 Thread aaron morton
The first thing I noticed is your script uses python threading library, which is hampered by the Global Interpreter Lock http://docs.python.org/2/library/threading.html You don't really have multiple threads running in parallel, try using the multiprocessor library. Cheers -

Re: Pycassa vs YCSB results.

2013-02-04 Thread Pradeep Kumar Mantha
Hi, Could some one please let me know any hints, why the pycassa client(attached) is much slower than the YCSB? is it something to attribute to performance difference between python and Java? or the pycassa api has some performance limitations? I don't see any client statements affecting the pyca

Re: Pycassa vs YCSB results.

2013-01-31 Thread Pradeep Kumar Mantha
Thanks.. Please find the script as attachment. Just re-iterating. Its just a simple python script which submit 4 threads. This script has been scheduled on 8 cores using taskset unix command , thus running 32 threads/node. and then scaling to 16 nodes thanks pradeep On Thu, Jan 31, 2013 at 4:38

Re: Pycassa vs YCSB results.

2013-01-31 Thread Tyler Hobbs
Can you provide the python script that you're using? (I'm moving this thread to the pycassa mailing list ( pycassa-disc...@googlegroups.com), which is a better place for this discussion.) On Thu, Jan 31, 2013 at 6:25 PM, Pradeep Kumar Mantha wrote: > Hi, > > I am trying to benchmark cassandra o