So I changed concurrency to 10 and put all the IPs of the nodes in basho
bench config.
Throughput is now around 1500.


On Wed, Jun 27, 2012 at 4:40 PM, Russell Brown <russell.br...@mac.com>wrote:

>
> On 27 Jun 2012, at 12:09, Yousuf Fauzan wrote:
>
> I used examples/riakc_pb.config
>
> {mode, max}.
>
> {duration, 10}.
>
> {concurrent, 1}.
>
>
> Try upping this. On my local 3 node cluster with 8gb ram and an old, cheap
> quad core per box I'd set concurrency to 10 workers.
>
>
> {driver, basho_bench_driver_riakc_pb}.
>
> {key_generator, {int_to_bin, {uniform_int, 10000}}}.
>
> {value_generator, {fixed_bin, 10000}}.
>
> {riakc_pb_ips, [{<IP of one of the nodes>}]}.
>
>
> I add all the IPs here, one entry per node.
>
>
> {riakc_pb_replies, 1}.
>
> {operations, [{get, 1}, {update, 1}]}.
>
>
> On Wed, Jun 27, 2012 at 4:37 PM, Russell Brown <russell.br...@mac.com>wrote:
>
>>
>> On 27 Jun 2012, at 12:05, Yousuf Fauzan wrote:
>>
>> I did use basho bench on my clusters. It should throughput of around 150
>>
>>
>> Could you share the config you used, please?
>>
>>
>> On Wed, Jun 27, 2012 at 4:24 PM, Russell Brown <russell.br...@mac.com>wrote:
>>
>>>
>>> On 27 Jun 2012, at 11:50, Yousuf Fauzan wrote:
>>>
>>> Its not about the difference in throughput in the two approaches I took.
>>> Rather, the issue is that even 200 writes/sec is a bit on the lower side.
>>> I could be doing something wrong with the configuration because people
>>> are reporting throughputs of 2-3k ops/sec
>>>
>>> If anyone here could guide me in setting up a cluster which would give
>>> such kind of throughput.
>>>
>>>
>>> To get the kind of throughput I use multiple threads / workers. Have you
>>> looked at basho_bench[1], it is a simple, reliable tool to benchmark Riak
>>> clusters?
>>>
>>> Cheers
>>>
>>> Russell
>>>
>>> [1] Basho Bench - https://github.com/basho/basho_bench and
>>> http://wiki.basho.com/Benchmarking.html
>>>
>>>
>>> Thanks,
>>> Yousuf
>>>
>>> On Wed, Jun 27, 2012 at 4:02 PM, Eric Anderson 
>>> <ander...@copperegg.com>wrote:
>>>
>>>> On Jun 27, 2012, at 5:13 AM, Yousuf Fauzan <yousuffau...@gmail.com>
>>>> wrote:
>>>>
>>>> Hi,
>>>>
>>>> I setup a 3 machine riak SM cluster. Each machine used 4GB Ram and riak
>>>> OpenSource SmartMachine Image.
>>>>
>>>> Afterwards I tried loading data by following two methods
>>>> 1. Bash script
>>>> #!/bin/bash
>>>> echo $(date)
>>>> for (( c=1; c<=1000; c++ ))
>>>> do
>>>> curl -s -d 'this is a test' -H "Content-Type: text/plain"
>>>> http://127.0.0.1:8098/buckets/test/keys
>>>> done
>>>> echo $(date)
>>>>
>>>> 2. Python Riak Client
>>>> c=riak.RiakClient("10.112.2.185")
>>>> b=c.bucket("test")
>>>> for i in xrange(10000):o=b.new(str(i), str(i)).store()
>>>>
>>>> For case 1, throughput was 25 writes/sec
>>>> For case 2, throughput was 200 writes/sec
>>>>
>>>> Maybe I am making a fundamental mistake somewhere. I tried the above
>>>> two scripts on EC2 clusters too and still got the same performance.
>>>>
>>>> Please, someone help
>>>>
>>>>
>>>>
>>>> The major difference between these two is the first is executing a
>>>> binary, which has to basically create everything (connection, payload, etc)
>>>> every time through the loop.  The second does not - it creates the client
>>>> once, then iterates over it keeping the same client and presumably the same
>>>> connection as well.  That makes a huge difference.
>>>>
>>>> I would not use curl to do performance testing.  What you probably want
>>>> is something like your python script that will work on many
>>>> threads/processes at once (or fire them up many times).
>>>>
>>>>
>>>> Eric Anderson
>>>> Co-Founder
>>>> CopperEgg
>>>>
>>>>
>>>>
>>>>
>>>>
>>> _______________________________________________
>>> riak-users mailing list
>>> riak-users@lists.basho.com
>>> http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com
>>>
>>>
>>>
>>
>>
>
>
_______________________________________________
riak-users mailing list
riak-users@lists.basho.com
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com

Reply via email to