On 14 Nov 2011, at 11:47, Nitish Sharma wrote:

> Hi,
> This is more sort of a discussion than a question. I am just trying to see 
> the trend in how users import their data to Riak. 
> For the data I am using, I was able to achieve almost 150 records/second with 
> PHP library, and 400 records/second with node.js (fairly new with node; was 
> hitting memory wall when trying to import 1 million records).
> What are some hacks/tricks/tweaks to import large amount of data to Riak?

New keys, new data, straight in for the first time, no fetch before store? I've 
had reasonable results creating a *number* of threads and using the Java Raw PB 
client to write. 

For example, maybe have a 1 or a couple of threads that reads data (from 
Oracle, a file, what-have-you) and puts it on a queue, and have a bunch of 
threads that pull data off the queue, create a riak object and store it. From 
my laptop I've got up to 2500 writes a second like this, and it was just ad 
hoc, throw away code with 4 threads against a small 3 node cluster (running on 
desktops.)

I imagine others on the list have more direct, real world examples?

Cheers

Russell

> 
> Cheers
> Nitish
> _______________________________________________
> riak-users mailing list
> riak-users@lists.basho.com
> http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com


_______________________________________________
riak-users mailing list
riak-users@lists.basho.com
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com

Reply via email to