Re: Measuring Riak disk usage

2013-04-10 Thread Matthew Von-Maszewski
Paul, There is a "tool" that will dump statistics for a given .sst file, including compression ratio. It is really designed for in-house usage, so it can be a pain to build (just being honest, and setting expectations). 1. you have download and install libsnappy. Just having it embedded withi

Re: Measuring Riak disk usage

2013-04-10 Thread Paul Wagner
So as far as profiling disk usage of certain newly created objects for future planning is there a specific query that might return recently created objects and their size on disk? Looking at the contents of .../riak/leveldb I can see each of the nodes and the sorted string tables. I can follow the

Re: Measuring Riak disk usage

2013-04-10 Thread Ben McCann
Cool. Thanks for the explanation. On Wed, Apr 10, 2013 at 10:55 AM, Reid Draper wrote: > > On Apr 10, 2013, at 1:45 PM, Jeremiah Peschka > wrote: > > If you've installed from the apt/yum repository you've installed a single > Riak node on your machine. Riak, though, is configured by default to

Re: Measuring Riak disk usage

2013-04-10 Thread Reid Draper
On Apr 10, 2013, at 1:45 PM, Jeremiah Peschka wrote: > If you've installed from the apt/yum repository you've installed a single > Riak node on your machine. Riak, though, is configured by default to write > data to three servers. If some of those servers aren't available, Riak is > going to

Re: Measuring Riak disk usage

2013-04-10 Thread Jeremiah Peschka
If you've installed from the apt/yum repository you've installed a single Riak node on your machine. Riak, though, is configured by default to write data to three servers. If some of those servers aren't available, Riak is going to write to a different server via hinted handoff[1]. Since you are on

Re: Measuring Riak disk usage

2013-04-10 Thread Evan Vigil-McClanahan
Riak has a virtual node on physical node structure. So all of your virtual nodes are running on a single machine. If you were to add more nodes, some of them would migrate to those new nodes, distributing storage and load around the cluster. On Wed, Apr 10, 2013 at 12:36 PM, Ben McCann wrote:

Re: Measuring Riak disk usage

2013-04-10 Thread Ben McCann
Sure, will do. I'm still a little confused about how Riak runs on one machine though. Is it running three server nodes or does it run only a single Riak node and store three copies of the data? Thanks, Ben On Wed, Apr 10, 2013 at 10:14 AM, Matthew Von-Maszewski wrote: > Ben, > > The runtime re

Re: Measuring Riak disk usage

2013-04-10 Thread Mark Phillips
Oops. Looks like this is a duplicate (thanks, Mailman). Here's the current thread: http://lists.basho.com/pipermail/riak-users_lists.basho.com/2013-April/011806.html Mark On Tue, Apr 9, 2013 at 9:14 PM, chengas123 wrote: > Hi, > > I'm currently storing data in MongoDB and would like to evaluat

Re: Measuring Riak disk usage

2013-04-10 Thread Matthew Von-Maszewski
Ben, The runtime recovery log ends in "XX.log" where XX is a six digit numeric. Its size will vary between 30Mbytes and 60Mbytes per vnode directory.no My recommendation is that you change the app.config file's default_bucket_props detailed below. Completely erase the data storage ar

Re: Measuring Riak disk usage

2013-04-10 Thread Ben McCann
Thanks for the help. If I were saving three copies of the data in Riak that would certainly explain it! I installed Riak via the apt repository instructions. Not sure what that does by default. If it's saving

Re: Measuring Riak disk usage

2013-04-10 Thread Matthew Von-Maszewski
Greetings Ben, Also, leveldb stores data in "levels". The very first storage level and the runtime data recovery log are not compressed. That said, I agree with Tom that you are most likely seeing Riak store 3 copies of your data versus only one for mongodb. It is possible to dumb down Riak s

Re: Measuring Riak disk usage

2013-04-10 Thread Tom Santero
Hi Ben, First, allow me to welcome to the list! Stick around, I think you'll like it here. :) How many nodes of Riak are you running vs how many nodes of Mongo? How much more disk space did Riak take? Riak is designed to run as a cluster of several nodes, utilizing replication to provide resili