CorrugatedIron needs a new home

2014-05-06 Thread Jeremiah Peschka
e we left off, email me at < jeremiah.pesc...@gmail.com> and let me know that you're interested. I'm more than happy to help make this transition happen smoothly and painlessly. [1]: http://github.com/DistributedNonsense/CorrugatedIron [2]: http://buffered.io/ --- Jeremiah Pes

Re: Data modeling a write-intensive comment storage cluster

2014-01-27 Thread Jeremiah Peschka
--- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Sun, Jan 26, 2014 at 10:27 PM, fxmy wang wrote: > Thanks for the response, Jeremiah. > > > > > > Then here are my questions: > >

Re: Data modeling a write-intensive comment storage cluster

2014-01-25 Thread Jeremiah Peschka
Responses inline --- sent from a tiny portion of the hive mind... in this case, a phone On Jan 25, 2014 5:16 PM, "fxmy wang" wrote: > > Greetings List, > > I'm a new guy who's only got some experience with RMDBs. So please enlighten me if I'm doing something silly. > > So I'm trying to use Riak f

Re: Maps with multiple buckets

2013-12-17 Thread Jeremiah Peschka
The allowable inputs to an MR map phase include a list of bucket key pairs. If you know your keys in advance the problem is solved. Can you describe a bit more about how you're using MR? Is this an ad hoc query? A predictable report? Time based? --- sent from a tiny portion of the hive mind... in

Re: bucket props question

2013-12-11 Thread Jeremiah Peschka
Based on my understanding of the Bucket Types feature - yes, this feature would solve the problem. --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Wed, Dec 11, 2013 at 12:39 PM, Jordan West wrote: > Hi Br

Re: How does riak scale horizontally

2013-11-30 Thread Jeremiah Peschka
A small portion of the data will be moved to the new node. Specifically virtual nodes will be shuffled to assure even data distribution in the cluster. This is in the docs at [1] [1]: http://docs.basho.com/riak/latest/ops/running/nodes/adding-removing/#The-Node-Join-Process --- sent from a tiny p

Re: Worried about the backends to use

2013-11-29 Thread Jeremiah Peschka
Responses inline. --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Thu, Nov 28, 2013 at 9:57 PM, Suman Kumar wrote: > Hi, > > We have currently around 53 million key value pairs and each week we would

Re: How to Install Riak in windows Machine

2013-11-16 Thread Jeremiah Peschka
Riak isn't supported on Windows. Your best bet is to install on a VM or use AWS/Azure. --- sent from a tiny portion of the hive mind... in this case, a phone On Nov 16, 2013 9:59 AM, "Prakash Thapa" wrote: > Hi, > I am quite new for Riak, is there any possibility to install Rial in > windows Mac

Re: Forcing Siblings to Occur

2013-11-13 Thread Jeremiah Peschka
s/metadata/data/ - each sibling is a discrete copy of whatever data you've put in it + metadata. In the case of the client side indexes, you're right - the bulk of the increased storage will be from metadata. --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server

Re: Yokozuna Schema Changes

2013-11-08 Thread Jeremiah Peschka
s have leap frogged it. Technically you can modify a > schema today but it has to be done by hand and is error prone. > > -Z > > > On Fri, Nov 8, 2013 at 6:42 PM, Jeremiah Peschka < > jeremiah.pesc...@gmail.com> wrote: > >> I notice that YZ issue 130 (support for schema

Yokozuna Schema Changes

2013-11-08 Thread Jeremiah Peschka
I notice that YZ issue 130 (support for schema updates) was created 5 months ago and doesn't have any commits against it right now. Is this still on track to get pushed into the product as part of Riak 2.0 or has no work begun? Thanks --- Jeremiah Peschka - Founder, Brent Ozar Unlimited

Re: Fwd: same 2i key with different value

2013-11-04 Thread Jeremiah Peschka
Yes. You can add any number of arbitrary values to an index. In the C# client, we implement this as a list so you just call object.Index("stuff_bin").Add("thing") --- sent from a tiny portion of the hive mind... in this case, a phone On Nov 4, 2013 5:06 AM, "Louis-Philippe Perron" wrote: > (bump

Re: How does Riak full-text search compare to SolrCloud?

2013-10-30 Thread Jeremiah Peschka
ins aren't there. --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Wed, Oct 30, 2013 at 2:48 AM, Andy wrote: > Hi, > > I saw in the announcement of Riak 2.0 that Solr is now integrated into

Re: Updating Counters on Riak 2.0 pre whatever

2013-10-22 Thread Jeremiah Peschka
e both the 1.4 counter API and the CRDT API. We'll deprecate 1.4 counters whenever Basho does the same. There's no information overload for me, but it is nice to see everything in the same place. --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera

Re: Updating Counters on Riak 2.0 pre whatever

2013-10-22 Thread Jeremiah Peschka
k to understand what happens when users do crazy things with the C# client Since I'm not a Basho FTE, you can expect these things to happen on the public mailing list rather than in the privacy of a Basho engineering chat room. --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP:

Re: Updating Counters on Riak 2.0 pre whatever

2013-10-22 Thread Jeremiah Peschka
t me know if you have any more issues going forward, please. > > Cheers > > Russell > > [1] https://github.com/basho/riak_kv/pull/697 > On 22 Oct 2013, at 19:56, Jeremiah Peschka > wrote: > > > I'm attempting to create a counter on Riak 2.0 built from the devel

Updating Counters on Riak 2.0 pre whatever

2013-10-22 Thread Jeremiah Peschka
{proc_lib,init_p_do_apply,3, [{file,"proc_lib.erl"}, {line,239}]}] --- Jeremiah Peschka - Founder, Brent Ozar Unlimited

Re: Read Before Writes on Distributed Counters

2013-10-17 Thread Jeremiah Peschka
That's why I linked to the video - it's 60 minutes of Cribbs™ brand pedantry. --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Thu, Oct 17, 2013 at 10:45 AM, Sean Cribbs wrote: > Since Jeremiah love

Re: Read Before Writes on Distributed Counters

2013-10-17 Thread Jeremiah Peschka
k. Siblings will be created, but they will not be visible to the end user who is reading from the counter. Check out this demo of the new counter types from Sean Cribbs: https://vimeo.com/43903960 --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified

Re: Unit testing, Riak buckets

2013-10-13 Thread Jeremiah Peschka
ets/cleans > the git repository at the beginning of each test. If you're curious, I can > point you to the relevant sections of code that do this. > > > On Sun, Oct 13, 2013 at 8:34 PM, Jeremiah Peschka < > jeremiah.pesc...@gmail.com> wrote: > >> For Corrugate

Re: Unit testing, Riak buckets

2013-10-13 Thread Jeremiah Peschka
using cron jobs to delete yesterday's buckets doesn't sound like a bad idea. Yes, listing buckets is bad in production. No, this isn't production. Therefore: LIST ALL THE THINGS! --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified De

Re: Corrugated Iron for Riak- WCF Performance Issue

2013-10-10 Thread Jeremiah Peschka
NuGet [4] [1]: http://docs.basho.com/riak/latest/ops/building/planning/system-planning/#Hardware [2]: http://markmail.org/message/nsr2an6ztwcjtopb [3]: https://github.com/danielcrenna/metrics-net [4]: http://www.nuget.org/packages/Metrics --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQ

Re: Bucket properties not updated

2013-10-09 Thread Jeremiah Peschka
Could this be a duplicate of bug 660? https://github.com/basho/riak_kv/issues/660 I can reliably reproduce this on a local 5 node stagedevrel cluster on Riak 1.4.2 and develop. --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for

Re: are Siblings ordered?

2013-10-07 Thread Jeremiah Peschka
thing like modifying the properties in an existing object. Expect more developments as time allows. --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Mon, Oct 7, 2013 at 12:04 PM, Sam Elliott wrote: > Each sibling

Re: are Siblings ordered?

2013-10-07 Thread Jeremiah Peschka
There's no guarantee of return order as far as I know. Since you can't count on clocks anyway... Are you trying to determine which data modifications to apply from multiple siblings? --- sent from a tiny portion of the hive mind... in this case, a phone On Oct 7, 2013 11:40 AM, "Alex Rice" wrote

Re: understanding optimistic concurrency in Riak

2013-10-06 Thread Jeremiah Peschka
Inline: --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Thu, Oct 3, 2013 at 9:19 PM, Alex Rice wrote: > Just trying to paraphrase how I understand it from the Riak docs, plus > helpful feedback from Je

Re: C# client (CI) questions re: IfNotModified

2013-10-02 Thread Jeremiah Peschka
Issue created to track this! https://github.com/DistributedNonsense/CorrugatedIron/issues/180 We definitely need to clean these up. --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Wed, Oct 2, 2013 at 5:17 PM

Re: Riak on SAN

2013-10-02 Thread Jeremiah Peschka
not a SAN. --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Wed, Oct 2, 2013 at 2:12 PM, John E. Vincent < lusis.org+riak-us...@gmail.com> wrote: > I'm going to take a competing view here. > > S

Re: Riak on SAN

2013-10-01 Thread Jeremiah Peschka
Could you do it? Sure. Should you do it? No. An advantage of Riak is that you can avoid the cost of SAN storage by getting duplication at the machine level rather than rely on your storage vendor to provide it. Running Riak on a SAN also exposes you to the SAN becoming your bottleneck; you only

Re: short length hash key generation for riak keys?

2013-09-30 Thread Jeremiah Peschka
ecimal, or BigInteger numbers based on a combination of time since an epoch, a machine identifier, and an internally cycling identifier that resets every millisecond. I can't lay claim to the idea, but I can lay claim to writing parts of it in C#. [1]: https://github.com/peschkaj/rustflakes --- J

Re: allow_mult vs. 2i

2013-09-27 Thread Jeremiah Peschka
e an update. So if that's the case - no need for > allow_mult. Does that sound right? > > -B. > > > On Wed, Sep 25, 2013 at 6:30 PM, Jeremiah Peschka < > jeremiah.pesc...@gmail.com> wrote: > >> inline. >> >> --- >> Jeremiah Peschka - Fou

Re: allow_mult vs. 2i

2013-09-25 Thread Jeremiah Peschka
inline. --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Wed, Sep 25, 2013 at 2:47 PM, Brady Wetherington wrote: > I've built it a solid proof-of-concept system on leveldb, and use some 2i > indexes

Re: Getting X-Riak-Vclock when saving

2013-09-20 Thread Jeremiah Peschka
Depending on your Riak version, and client library support, Riak has a return_head option for put requests. That should return everything but the current object. In other words, it should give you the vector clock without returning the entire object. --- Jeremiah Peschka - Founder, Brent Ozar

Re: VNodes distribution on the ring

2013-09-19 Thread Jeremiah Peschka
cepts/Replication/#Understanding-replication-by-example [2]: http://docs.basho.com/riak/latest/theory/concepts/#Replication [3]: http://littleriakbook.com/#toc_7 --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Thu

Re: Riak and ec2 with vm.args - Node 'r...@ip-10-234-117-74.eu-west-1.compute.internal' not responding to pings.

2013-09-15 Thread Jeremiah Peschka
then ran the following - http://pastebin.com/CaHH3Eve Each node's vm.args is using the IP address for the `-name` parameter. If you're in a VPC, you may want to check the box for "Enable DNS hostname support for instances launched in this VPC." --- Jeremiah Peschka - Foun

Re: secondary index issue

2013-09-15 Thread Jeremiah Peschka
/youmag/index/contentid_bin/1570%2F204004 {"keys":["art3"]} Hope that helps --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Sun, Sep 15, 2013 at 7:21 AM, Fabrice Sznajderman wrote: > Hello, &g

Re: Riak and ec2 with vm.args - Node 'r...@ip-10-234-117-74.eu-west-1.compute.internal' not responding to pings.

2013-09-15 Thread Jeremiah Peschka
]: http://docs.basho.com/riak/latest/ops/building/installing/aws-marketplace/ --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Sun, Sep 15, 2013 at 1:06 AM, David Montgomery wrote: > Hi, > > I am havi

Re: use Siblings to implement a message queue?

2013-09-12 Thread Jeremiah Peschka
--- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Thu, Sep 12, 2013 at 9:45 AM, Alex Rice wrote: > Hey all, thanks for the feedback, this is interesting! > > re: Those CRDT white papers look pretty compl

Re: Resetting bucket properties

2013-09-11 Thread Jeremiah Peschka
Will do. -- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Sep 11, 2013, at 5:38 PM, Sean Cribbs wrote: > Sounds like a bug to me, would you file an issue on riak_kv? > > Sean Cribbs > > On Sep 11

CorrugatedIron v1.4.2 released

2013-09-11 Thread Jeremiah Peschka
otobuf fix, fixed one memory leak in `GetObject` and another in `SetObject` --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop ___ riak-users mailing list riak-users@lists.bash

Resetting bucket properties

2013-09-11 Thread Jeremiah Peschka
ng the values around so that allow_mult = false and LWW = true actually ends up with allow_mult = true and LWW = true: {"props":{"allow_mult":true,"basic_quorum":false,"big_vclock":50,"chash_keyfun":{"mod":"riak_core_util","fun":"chash_std_keyfun"},"dw":"quorum","last_write_wins":true,"linkfun":{"mod":"riak_kv_wm_link_walker","fun":"mapreduce_linkfun"},"n_val":3,"name":"players","notfound_ok":true,"old_vclock":86400,"postcommit":[],"pr":0,"precommit":[],"pw":0,"r":1,"rw":"quorum","search":false,"small_vclock":50,"w":1,"young_vclock":20}} Resetting bucket properties at this point puts the bucket back where it should be. Is this behavior expected or have I missed something subtle? --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop ___ riak-users mailing list riak-users@lists.basho.com http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com

Re: use Siblings to implement a message queue?

2013-09-11 Thread Jeremiah Peschka
: https://github.com/peschkaj/MoarDT/ The only tricky bit is that you can never be quite sure if you've read everything. Oh, and garbage collecting the CRDT can be tricky if you're using a client generated CRDT - you can end up with a large number of siblings if you aren't carefu

Re: .net corrugatediron q. re: content types

2013-09-11 Thread Jeremiah Peschka
Howdy Alex, That shouldn't be happening. I've created an issue and I'll check into it as soon as I get a chance. You can track the issue here https://github.com/DistributedNonsense/CorrugatedIron/issues/171 --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 200

Re: .net corrugatediron q. re: content types

2013-09-11 Thread Jeremiah Peschka
ontentType = RiakConstants.ContentTypes.ProtocolBuffers; ro.SetObject(testObj); */* or */* var ro = new RiakObject(bucket, key); ro.SetObject(testObj, RiakConstants.ContentTypes.ProtocolBuffers); --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified

Re: dealing with binary keys

2013-09-11 Thread Jeremiah Peschka
. Typically, I just nuke the data in my Riak cluster after running Basho bench, but my use case is slightly different and my cluster gets rebuilt regularly anyway. --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On

Re: Missing keys in bucket?

2013-09-03 Thread Jeremiah Peschka
replicated or between the tombstoning and reclamation of the object. [1]: http://lists.basho.com/pipermail/riak-users_lists.basho.com/2011-October/006048.html [2]: http://lists.basho.com/pipermail/riak-users_lists.basho.com/2011-October/006237.html --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MC

Re: Adding Nodes to a Running Cluster

2013-08-25 Thread Jeremiah Peschka
Ah, the second bullet point ("Don't add nodes one at a time if you're adding multiple nodes") clarifies it perfectly. I would've thought otherwise since there are possibilities of false not_found responses during node addition. Thanks for pointing me to that page. --- J

Adding Nodes to a Running Cluster

2013-08-25 Thread Jeremiah Peschka
node cluster, but less problematic with a much larger cluster since a smaller percentage of nodes will participate in the hand off. --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop

Re: MapReduce with 2i input filters

2013-08-24 Thread Jeremiah Peschka
Yokozuna seem like a far more effective fit. --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Sat, Aug 24, 2013 at 8:49 AM, Gints Gailītis wrote: > I guess it's bit hard to read the 3 samples I asked f

Re: VS Sample help

2013-08-20 Thread Jeremiah Peschka
ning. 2) Your Riak installation hasn't been set up to listen on those IP addresses. CorrugatedIron will happily attempt to talk to a tree stump until it times out after exhausting the retry attempts + timeout period. Tree stumps don't have ports ;) --- Jeremiah Peschka - Founder, Bre

Re: VS Sample help

2013-08-19 Thread Jeremiah Peschka
.0", 10017 } ]} The default is to only listen on 127.0.0.1 --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Mon, Aug 19, 2013 at 7:58 AM, wrote: > > > Hello I just got 2 VMs’ setup and can access via

CorrugatedIron v1.4.1 released

2013-08-18 Thread Jeremiah Peschka
ps://github.com/DistributedNonsense/CorrugatedIron/issues/158>) - it's now possible to pass in a full RiakObject and have CorrugatedIron generate the correct RiakObjectId for deletion [1]: http://www.nuget.org/packages/CorrugatedIron/ [2]: http://github.com/DistributedNonsense/CorrugatedIro

Re: vm.args change for 15% to 80% improvement in leveldb

2013-08-14 Thread Jeremiah Peschka
Thanks. Some other databases suggest limiting certain settings based on NUMA affinity, but I wasn't sure how Erlang managed scheduling and foreign memory access. The clarification is much appreciated. --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Clo

Re: vm.args change for 15% to 80% improvement in leveldb

2013-08-14 Thread Jeremiah Peschka
Final question - does NUMA matter? e.g. quad socket system with 8 cores per socket + HT should it be +S 8:8 or +S 32:32 ? --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Wed, Aug 14, 2013 at 4:14 AM, Matthew

Re: vm.args change for 15% to 80% improvement in leveldb

2013-08-13 Thread Jeremiah Peschka
plicitly saying something. --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Tue, Aug 13, 2013 at 5:38 PM, Matthew Von-Maszewski wrote: > ** The following is copied from Basho's leveldb wiki page: > > htt

Re: different versions in upgrade

2013-08-13 Thread Jeremiah Peschka
From http://docs.basho.com/riak/latest/ops/running/rolling-upgrades/ it looks like you should upgrade to 1.3.2 and then 1.4.1 Depending on how badly you need the extra capacity, it would probably be better to start by upgrading all nodes and then adding the new one. -- Jeremiah Peschka

Re: Practical Riak cluster choices in AWS (number of nodes? AZ's?)

2013-08-12 Thread Jeremiah Peschka
(more if you use crappier instances). It's easier and cheaper to spin up more Riak nodes than it is to tune AWS infrastructure. --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Mon, Aug 12, 2013 at 11:3

Re: Practical Riak cluster choices in AWS (number of nodes? AZ's?)

2013-08-11 Thread Jeremiah Peschka
at latency you're willing to introduce into your application. Were I in your position, I'd stand up a 5 node cluster in US-WEST-2 and be done with it. I'd consider Riak EE for my HA/DR solution once the business decides that off-site HA/DR is something it wants/needs. --- Jeremiah

Re:

2013-08-11 Thread Jeremiah Peschka
Glad to hear that worked for you! --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Sat, Aug 10, 2013 at 4:44 PM, Jeremy Ong wrote: > Just a quick follow up, I did this exact approach and things went > sm

Re:

2013-08-09 Thread Jeremiah Peschka
. --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Fri, Aug 9, 2013 at 7:23 PM, Jeremy Ong wrote: > Hi Riak users, > > I'm wondering what the best approach to this is. The scenario is that > I have mo

Re: Nailing down the bucket design?

2013-08-07 Thread Jeremiah Peschka
Responses inline. --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Wed, Aug 7, 2013 at 2:28 AM, Maksymilian Strzelecki wrote: > Hi. I've read somewhere around the Internet that Riak benefits its >

Re: Unit testing persistence

2013-08-02 Thread Jeremiah Peschka
Responses inline. --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Wed, Jul 31, 2013 at 9:41 PM, Wagner Camarao wrote: > Hi all ~ > > Great meetup today - looking forward to upgrading to 1.4 > > I

Re: Riak 1.4 - fastest way to count all records in bucket (100+ millions)

2013-08-01 Thread Jeremiah Peschka
st with LevelDB). [1]: http://basho.com/counters-in-riak-1-4/ --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Thu, Aug 1, 2013 at 12:01 PM, Christian Rosnes < christian.ros...@gmail.com> wrote: > > &g

Re: Error starting Yokozuna

2013-07-29 Thread Jeremiah Peschka
using `make stagedevrel` and this worked for me. [1]: https://github.com/basho/yokozuna/pull/127 [2]: https://github.com/basho/yokozuna/pull/136 --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Sun, Jul 28, 2013 at

Re: Querying multiple 2i via Intersections

2013-07-27 Thread Jeremiah Peschka
ws which data types to index. See Using Search[1] and Search Schema[2] for implementation details [1]: http://docs.basho.com/riak/latest/dev/using/search/ [2]: http://docs.basho.com/riak/latest/dev/advanced/search-schema/ --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 200

Re: Querying multiple 2i via Intersections

2013-07-27 Thread Jeremiah Peschka
. --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Sat, Jul 27, 2013 at 11:16 AM, Guido Medina wrote: > Rohman, > > I think the reason for this is that the cluster will have to do the whole > intersecti

Re: [ANN] Yokozuna 0.7.0

2013-07-26 Thread Jeremiah Peschka
Also - if nobody on list or in IRC is able to help you today, I'll try to spin up an AWS instance and get a build going. Which version of CentOS are you using? --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop O

Re: [ANN] Yokozuna 0.7.0

2013-07-26 Thread Jeremiah Peschka
Well, to be fair I believe that Yokozuna also requires Riak 1.3 or higher and/or a version of CentOS that is newer than 5.2. That being said, I build everything with R15B03, so I may not be the most reliable source. --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP

Re: [ANN] Yokozuna 0.7.0

2013-07-26 Thread Jeremiah Peschka
Have you followed the "Installing Erlang" instructions[1]? They include a reference on how to get Erlang R15B01 up and running on your machine if you have to build Erlang from source. [1]: http://docs.basho.com/riak/1.3.2/tutorials/installation/Installing-Erlang/ --- Jeremiah Peschka

CorrugatedIron and Mono

2013-07-22 Thread Jeremiah Peschka
reply off-list so we don't spam riak-users. Thanks! --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop ___ riak-users mailing list riak-users@lists.basho.com http://lists.ba

Re: Comparing Riak MapReduce and Hadoop MapReduce

2013-07-22 Thread Jeremiah Peschka
Oh, I almost forgot, you can also supply the do_prereduce argument to your reduce phase - this performs a pre-reduce phase on the mapper. This can, depending on the workload, significantly decrease the network overhead between the mappers and the reducer. --- Jeremiah Peschka - Founder, Brent

Re: Comparing Riak MapReduce and Hadoop MapReduce

2013-07-22 Thread Jeremiah Peschka
For JavaScript the number of reducers is configured in the app.config file on each node with the reduce_js_vm_count property. --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Mon, Jul 22, 2013 at 8:07 AM, Xiaoming

Re: Comparing Riak MapReduce and Hadoop MapReduce

2013-07-21 Thread Jeremiah Peschka
Ah, yeah, I'm mistaken about search partitioning. The docs are correct. I have no idea how the scheduling works. If I had to guess, I would guess that it is a streaming operation. -- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Develope

Re: Comparing Riak MapReduce and Hadoop MapReduce

2013-07-21 Thread Jeremiah Peschka
Responses inline. Hopefully they shed some light on the subject. --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Fri, Jul 19, 2013 at 5:07 PM, Xiaoming Gao wrote: > Hi everyone, > > I am trying to le

CorrugatedIron v1.4.0 released

2013-07-19 Thread Jeremiah Peschka
Get it while it's hot. Features include Riak 1.4 support and... stuff. Release notes - https://github.com/DistributedNonsense/CorrugatedIron/blob/v1.4.0/RELEASE_NOTES.md Blog post - http://www.brentozar.com/archive/2013/07/corrugatediron-1-4/ --- Jeremiah Peschka - Founder, Brent Ozar Unli

Re: Riak Search and Sorting

2013-07-18 Thread Jeremiah Peschka
AH HA! And you have now saved me from going crazy trying to track down strange collection related behavior. Ryan Zezeski, you're my hero. --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Thu, Jul 18, 2013 at

Re: Riak Search and Sorting

2013-07-18 Thread Jeremiah Peschka
I just confirmed that today I'm getting the correct sorting in the browser but not in CorrugatedIron. I'm about to start in on a day of working with a client. Will verify this afternoon. --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified

Riak Search and Sorting

2013-07-17 Thread Jeremiah Peschka
this behavior expected? Is there something that I've missed in my query? --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop ___ riak-users mailing list riak-users@lists.bas

Re: Lots of sparse columns. Efficient like Cassandra? Some measures of my dataset

2013-07-17 Thread Jeremiah Peschka
-- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Jul 17, 2013, at 4:38 AM, gbrits wrote: > Somewhere (can't find it now) I've read that Riak, like Cassandra could be > classified as a column

Re: Does Riak support Range Queries over binary safe strings?

2013-07-16 Thread Jeremiah Peschka
. Especially if you're just pulling out a slice of bytes. --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Tue, Jul 16, 2013 at 4:13 PM, gbrits wrote: > Wow, high speed on this list! > > I want

Re: Does Riak support Range Queries over binary safe strings?

2013-07-16 Thread Jeremiah Peschka
JavaScript, you should be able to query over it in MapReduce. This is typically regarded as a Bad Idea™ since an MR query will need to scan all keys in a bucket (which effectively means scanning the entire cluster) and is best done as an infrequent activity to transform data. --- Jeremiah Peschka

Re: Quickly deleting + recreating item in Riak deletes new item

2013-07-15 Thread Jeremiah Peschka
This is what we do for CorrugatedIron integration testing. Test buckets typically have a test name + UUID. Which makes it interesting when I try to verify data via curl while I'm debugging. But it also keeps me from polluting my testing buckets with the output of other failed tests. --- Jer

Re: Yokozuna kv write timeouts on 1.4 (yz-merge-1.4.0)

2013-07-15 Thread Jeremiah Peschka
I've tested this using the PBC interface and a build from the source branch as well. The timeout occurs waiting for any response from protobufs waiting for message size. --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for A

Re: Connection timeout issues

2013-07-15 Thread Jeremiah Peschka
Riak 1.4 introduced a time out setting that defaults to 60,000 seconds. If you need to read for longer periods of time, you'll need to increase the timeout. You can set it on the messages that are being sent in to Riak. Is this, perhaps, what you're running into? --- Jeremiah Peschka

Re: Unique secondary index

2013-07-14 Thread Jeremiah Peschka
ds If-None-Match when you attempt to write new records. --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Sun, Jul 14, 2013 at 4:17 AM, Sandy Herman wrote: > Hello, > > Is it possible to create a unique seconda

Re: CorrugatedIron & Idle Connection

2013-07-11 Thread Jeremiah Peschka
getting rid of/fully enumerating collections before moving on? How are you handling errors processing individual items in the IEnumerable? (This is primarily a concern if you're using the streaming interfaces) --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Clouder

Re: CorrugatedIron & Idle Connection

2013-07-11 Thread Jeremiah Peschka
ease, but we can put that in a point release. --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Thu, Jul 11, 2013 at 6:59 AM, Amir Halatzi wrote: > Hi all, > > Trying to optimize our application connectio

Re: New Counters - client support

2013-07-10 Thread Jeremiah Peschka
at command completes, the counter has a value of 1; assuming no other increments are made. To initialize to a different value, simply increment/decrement the counter to the chosen value and begin from there. -- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Clouder

Re: riak_kv_memory_backend replication

2013-07-10 Thread Jeremiah Peschka
Correct. Unless you've specific an n value of 1 for the bucket. --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Wed, Jul 10, 2013 at 12:57 PM, kpandey wrote: > In a multi node cluster with a b

CorrugatedIron v1.4.0-RC1

2013-07-07 Thread Jeremiah Peschka
We've released CorrugatedIron (the .NET Riak client) v1.4.0-RC1 into the wild. Release notes are available at https://github.com/DistributedNonsense/CorrugatedIron/blob/1.4.0-rc1/RELEASE_NOTES.mdor pasted below. We do need testers for this - this release has been developed against Riak 1.4.0 HEAD,

CorrugatedIron v.1.3.3

2013-07-02 Thread Jeremiah Peschka
CorrugatedIron v1.3.3 has been released. This is a minor fix that adds Get(bucket, key, RiakGetOptions) to the IRiakClient API. You can get it from nuget.org. Release notes available at https://github.com/DistributedNonsense/CorrugatedIron/blob/master/RELEASE_NOTES.md --- Jeremiah Peschka

Re: [ANN] Yokozuna 0.7.0

2013-07-01 Thread Jeremiah Peschka
x27;ll hit you up directly should I run into any weirdness. --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Mon, Jul 1, 2013 at 9:50 AM, Ryan Zezeski wrote: > Yokozuna supports protobuff already. It uses the sam

Re: [ANN] Yokozuna 0.7.0

2013-07-01 Thread Jeremiah Peschka
What level of PBC integration can we expect from Yokozuna? Is that developed but not documented or is that a TBA feature? --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Mon, Jul 1, 2013 at 8:46 AM, Ryan Zezeski

Re: Corrugated Iron Samples for riak

2013-06-01 Thread Jeremiah Peschka
would like to have. [1]: http://basho.com/riaks-config-behaviors-part-2/ --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Sat, Jun 1, 2013 at 8:56 PM, Erik F. Andersen wrote: > Hi! > > I for one have been lo

Re: Corrugated Iron Samples for riak

2013-05-31 Thread Jeremiah Peschka
You can find up to date samples at https://github.com/DistributedNonsense/CorrugatedIron.Samples --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Fri, May 31, 2013 at 9:26 AM, Pandiarajan Subramani < psub

Re: Receiving ulimit warning despite setting it

2013-05-15 Thread Jeremiah Peschka
If you check ulimit through Erlang [1], are you seeing the appropriate ulimit values? [1]: http://riak.markmail.org/search/?q=ulimit#query:ulimit+page:2+mid:bqjbmn3yyh5hdvcb+state:results --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified

Re: On siblings

2013-05-15 Thread Jeremiah Peschka
a sibling should be created. As I understand it, the only way to write into Riak and not get siblings is to set allow_mult to false - even leaving out vector clocks will lead to siblings if allow_mult is true. Or so John Daily's chart claims. [1]: http://basho.com/riaks-config-behaviors-part-2

Re: Using Riak to perform aggregate queries

2013-04-14 Thread Jeremiah Peschka
Hi Chris, Responses are inline --- Jeremiah Peschka - Founder, Brent Ozar Unlimited MCITP: SQL Server 2008, MVP Cloudera Certified Developer for Apache Hadoop On Sun, Apr 14, 2013 at 4:47 PM, Chris Corbyn wrote: > All, > > Just copying this from my stackoverflow post, as the riak ta

Re: Measuring Riak disk usage

2013-04-10 Thread Jeremiah Peschka
ff, []}}, > > This will disable Riak's automatic detection and correction of data loss / > corruption. The feature requires an added 1 to 2% data on disk. > > > Matthew > > > > On Apr 10, 2013, at 9:01 AM, Tom Santero wrote: > > Hi Ben, > > First, al

Re: The suitability of MapReduce

2013-04-09 Thread Jeremiah Peschka
r's head so they can evaluate as they're going through > the article. > > Cheers > Matt > > > On 9 April 2013 02:02, Jeremiah Peschka > 'jeremiah.pesc...@gmail.com');> > > wrote: > >> I want to follow up on the recent "Map phase

  1   2   3   4   >