Riak Recap for Dec. 20 - 21

2010-12-22 Thread Mark Phillips
Morning, Afternoon, Evening to All,

For today's Recap we have more Spring/Grails work from Jon Brisbin and
various Q & A.

Enjoy.

On an administrative note, I won't be writing a Recap this Friday on
account of the holiday, but if someone wants to step up and write one,
please feel free. If not, regularly scheduled Recaps will pick back up
on Monday, 12/27.

Happy Holidays to everyone. (Remember to give the gift of Riak this year.)


Mark

Community Manager
Basho Technologies
wiki.basho.com
twitter.com/pharkmillups

Riak Recap for Dec. 20 - 21

1) For any of you Vector Clock fans out there, a project called
"goclock" just surfaced, which aims to provide full vector clock
support for Google's Go. The post references a few Basho blogs on
vector clocks that some of you might recognize.

http://blog.labix.org/2010/12/21/vector-clock-support-for-go

2) More Spring/Grails awesomeness from Jon. Brisbin

Details here --->
http://twitter.com/j_brisbin/status/16970306535034880 and here --->
http://twitter.com/j_brisbin/status/17321732662624257

* Related tweet from @wmacgyver that's worth passing along - (because
we agree :) http://twitter.com/wmacgyver/status/17340803990953985)


3) Q ---  Are there any open issues on ripple that people can work on
who have a good knowledge of Ruby but are new to Riak?

   A --- There sure are. Feel free to pick any one that interest you.
See https://github.com/seancribbs/ripple/issues#issue to get involved.

4)  SamuraiJack__ posted this in #riak on 12/20:

"I thought someone from this channel may find this interesting:
http://joose.it/blog/2010/10/11/introducing-kiokujs/ It has no Riak
backend at the moment, but should be trivial to realize,

5) @peschkaj  gave an interview about why is he spending more of his
time these days on NoSQL DBs. Riak gets a few shout outs.

Blog here ---> 
http://www.brentozar.com/archive/2010/12/jeremiah-peschka-buck-woody-talk-database-communities/

6) This question appeared on Quora yesterday:
http://www.quora.com/Databases/When-would-you-use-Riak-HyperTable-or-MongoDB

(@ericflo took an admirable stab at it but it could definitely use
some fleshing out if anyone is feeling brave. )

___
riak-users mailing list
riak-users@lists.basho.com
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com


Java JCA wrapper for Riak

2010-12-22 Thread bruce kissinger
Most of my day-to-day programming involves using a Weblogic application
server.  Weblogic (like many app servers) supports the J2EE Connector
Architecture (JCA) specification as a way to develop a plugin to a
particular external system like a database.  This would allow the
application server to manage connections to Riak and make it more efficient
for programs running within the app server.  That's the theory anyway...

I am experimenting with JCA in general and considered building a JCA wrapper
for Riak.  I searched the message logs and other resources and it doesn't
look like anyone else is doing this.

Is there anyone else that might be interested in testing this when I'm
done?  if it works, I will contribute it.


-- 
Bruce Kissinger
___
riak-users mailing list
riak-users@lists.basho.com
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com


Re: timeouts while rebalancing

2010-12-22 Thread Dan Reverri
Hi Colin,

For clusters smaller than 5 nodes a certain amount of vnode shuffling has
been observed. The behavior you've observed seems to match bug 946:
https://issues.basho.com/show_bug.cgi?id=946

I'm not sure how much additional disk space per node would be necessary. The
example provided in the bug report for the 3->4 transition shows that each
node received ~10 new vnodes while giving up ~15 vnodes. The worst case
scenario for any particular node would be receiving all 10 new vnodes before
giving up anything. For a 64 partition system, 10 vnodes would be ~15% of
the stored data set. For example, if you have a 100GB data set with N=3, the
stored data set would be 300GB. %15 of 300GB would be 45GB which means a
single node would need at least 145GB to account for a worst case vnode
churn scenario.

Regarding bitcask versus innostore, both are good backends and production
worthy.

Your plan to migrate to a Riak 0.13 cluster is a good one.

Thanks,
Dan

Daniel Reverri
Developer Advocate
Basho Technologies, Inc.
d...@basho.com


On Tue, Dec 21, 2010 at 7:30 PM, Colin Surprenant <
colin.surpren...@gmail.com> wrote:

> Hi,
>
> My bucket is using the default N=3. When writing, I am using W=1 and
> when reading N=1.
>
> My cluster has settled down now. After the addition of the 4th node,
> one of the node started to use diskspace at a very rapid pace, heading
> quickly toward 100% usage so I had to remove the 4th node. It took a
> few hours for the cluster to settle down. I am back at square one. I
> think at this point it will be easier to lease my 0.10.1 cluster
> as-is, create a new cluster using the latest 0.13 and extract and
> refeed the data into the new cluster without letting it grow out of
> proportion on each node.
>
> What would be a reasonable single node data size to be able to cope
> "seamlessly" with node additions?
> Should I consider using bitcask over innostore, making sure each node
> can hold its keyspace in memory?
>
> Thanks.
> Colin
>
> On Tue, Dec 21, 2010 at 1:08 PM, Dan Reverri  wrote:
> > Hi Collin,
> > I would not expect keys to return not found even during handoff. An
> > individual vnode may return not found if the requested data has not been
> > transferred but the other replicas should be able to satisfy the quorum.
> > What values of N, R, and W are you using?
> > Thanks,
> > Dan
> > Daniel Reverri
> > Developer Advocate
> > Basho Technologies, Inc.
> > d...@basho.com
> >
> >
> > On Mon, Dec 20, 2010 at 5:18 PM, Colin Surprenant
> >  wrote:
> >>
> >> Yup, same version.
> >>
> >> On Mon, Dec 20, 2010 at 7:58 PM, Alexander Sicular 
> >> wrote:
> >> > Did you add the same version of riak to your 0.10.1 cluster? I
> wouldn't
> >> > mismatch...
> >> >
> >> > On Dec 20, 2010, at 3:46 PM, Colin Surprenant wrote:
> >> >
> >> >> Hi,
> >> >>
> >> >> Actually, they're not timeout error but Not Found errors for a bunch
> >> >> of keys that have been stored without error while the cluster is
> >> >> rebalancing.
> >> >>
> >> >> Is it "normal" to see Not Found errors while the cluster is
> >> >> rebalancing? If not, what is my problem here? Is it a problem to
> >> >> insert new keys while the cluster is rebalancing?
> >> >>
> >> >> Again, I cannot find any error report other that riak-admin failing
> >> >> with a timeout as described below.
> >> >>
> >> >> Any help/hints approciated, thanks!
> >> >>
> >> >> Colin
> >> >>
> >> >> On Mon, Dec 20, 2010 at 2:50 PM, Colin Surprenant
> >> >>  wrote:
> >> >>> Hi,
> >> >>>
> >> >>> I just added a 4th node in my 0.10.1 + innostore cluster and I am
> >> >>> seeing all kind of timeouts both for retrieving objects and trying
> to
> >> >>> execute riak-admin status which gives me:
> >> >>>
> >> >>> RPC to 'r...@x.x.x' failed: {'EXIT',
> >> >>> {timeout,
> >> >>>  {gen_server2,call,
> >> >>>   [riak_kv_stat,get_stats]}}}
> >> >>>
> >> >>>
> >> >>> Also, the CPU load has seriously increased on the original 3 nodes.
> >> >>> The data rebalancing is quite slow.
> >> >>> I am not seeing anything wrong in the log files.
> >> >>>
> >> >>> Is this an indication that something is going wrong?
> >> >>>
> >> >>> Thanks,
> >> >>> Colin
> >> >>>
> >> >>
> >> >> ___
> >> >> riak-users mailing list
> >> >> riak-users@lists.basho.com
> >> >> http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com
> >> >
> >> >
> >>
> >> ___
> >> riak-users mailing list
> >> riak-users@lists.basho.com
> >> http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com
> >
> >
>
___
riak-users mailing list
riak-users@lists.basho.com
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com