To the extent you limit your sliced data via an m/r you will reap those savings 
on the wire when transferring back to the client

You can feed an m/r from riak search, 2i or enumerated keys thereby skipping a 
costly bucket scan. 


@siculars
http://siculars.posthaven.com

Sent from my iRotaryPhone

On Jul 16, 2013, at 19:46, Jeremiah Peschka <jeremiah.pesc...@gmail.com> wrote:

> Not a problem.
> 
> MapReduce across an entire keyspace is slow.
> 
> MapReduce when provided with a few bucket/key pairs is the same as a 
> multi-get + processing.
> 
> You can combine 2i + MR to get quick processing of data. Although, at that 
> point, you might as well just process your data on the client side. 
> Especially if you're just pulling out a slice of bytes.
> 
> ---
> Jeremiah Peschka - Founder, Brent Ozar Unlimited
> MCITP: SQL Server 2008, MVP
> Cloudera Certified Developer for Apache Hadoop
> 
> 
> On Tue, Jul 16, 2013 at 4:13 PM, gbrits <gbr...@gmail.com> wrote:
>> Wow, high speed on this list!
>> 
>> I wanted it for near realtime anyway so Map/reduce is out of the question. 
>> Thought somehow it could be done through Riak Search or directly on 
>> secondary indices instead of map/reduce. 
>> Guess not. Oh well, can't have it all. 
>> 
>> Thanks
>> 
>> 
>> 2013/7/17 Jeremiah Peschka [via Riak Users] <[hidden email]>
>>> Following up on Alex's comments -
>>> 
>>> If you know which bytes you need to slice, you can store this in a 
>>> secondary index. You can perform range queries across secondary indices (as 
>>> well as keys).
>>> 
>>> As long as you're storing your data in a way that allows it to be read by 
>>> either Erlang or JavaScript, you should be able to query over it in 
>>> MapReduce. This is typically regarded as a Bad Idea™ since an MR query will 
>>> need to scan all keys in a bucket (which effectively means scanning the 
>>> entire cluster) and is best done as an infrequent activity to transform 
>>> data.
>>> 
>>> ---
>>> Jeremiah Peschka - Founder, Brent Ozar Unlimited
>>> MCITP: SQL Server 2008, MVP
>>> Cloudera Certified Developer for Apache Hadoop
>>> 
>>> 
>>> On Tue, Jul 16, 2013 at 3:45 PM, Alexander Sicular <[hidden email]> wrote:
>>>> I would say no. Riak is generally oblivious as to the content of your 
>>>> data. Any ranges or other method you would use to query needs to be 
>>>> explicitly indexed via riak search or secondary indexes. Once you have 
>>>> found your data you could operate over that data in a map reduce, but I 
>>>> can't speak to "binary safe" blob operations in either erlang or 
>>>> JavaScript although I'm inclined to say yes, you would be able to operate 
>>>> over it in m/r.
>>>> 
>>>> So searching for keys with certain data in the binblob is probably not 
>>>> gonna happen but once you have a key to feed an m/r you could get a slice 
>>>> of that value.
>>>> 
>>>> Make sense?
>>>> -Alexander
>>>> 
>>>> @siculars
>>>> http://siculars.posthaven.com
>>>> 
>>>> Sent from my iRotaryPhone
>>>> 
>>>> On Jul 16, 2013, at 18:17, gbrits <[hidden email]> wrote:
>>>> 
>>>> > First, hello all!
>>>> >
>>>> > Coming from Redis, I love that you can just put any binary blob in Redis
>>>> > which is just treated as a string. This is possible because Redis strings
>>>> > are what they call 'binary safe'. This makes it possible to return 
>>>> > slices of
>>>> > string-encoded binary data, which is super useful for bitset-operations,
>>>> > etc.
>>>> >
>>>> > I'm investigating Riak and I like it a lot so far. Riak seems to have 
>>>> > range
>>>> > queries (on values, as it seems I must make that distinction with
>>>> > column-stores), but I'm not sure if strings in Riak are "Binary safe" in 
>>>> > the
>>>> > above sense. If not, is there another way to store binary data in Riak 
>>>> > and
>>>> > still do range queries over them quickly?
>>>> >
>>>> > To be exact: I want to do multi-key lookups in Riak, where each returned
>>>> > result should be of format: <key,slice(featureX,start,end)>
>>>> >
>>>> > Thanks,
>>>> > Geert-Jan
>>>> >
>>>> >
>>>> >
>>>> > --
>>>> > View this message in context: 
>>>> > http://riak-users.197444.n3.nabble.com/Does-Riak-support-Range-Queries-over-binary-safe-strings-tp4028356.html
>>>> > Sent from the Riak Users mailing list archive at Nabble.com.
>>>> >
>>>> > _______________________________________________
>>>> > riak-users mailing list
>>>> > [hidden email]
>>>> 
>>>> > http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com
>>>> 
>>>> _______________________________________________
>>>> riak-users mailing list
>>>> [hidden email]
>>>> http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com
>>> 
>>> 
>>> _______________________________________________ 
>>> riak-users mailing list 
>>> [hidden email] 
>>> http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com
>>> 
>>> 
>>> If you reply to this email, your message will be added to the discussion 
>>> below:
>>> http://riak-users.197444.n3.nabble.com/Does-Riak-support-Range-Queries-over-binary-safe-strings-tp4028356p4028359.html
>>> To unsubscribe from Does Riak support Range Queries over binary safe 
>>> strings?, click here.
>>> NAML
>> 
>> 
>> View this message in context: Re: Does Riak support Range Queries over 
>> binary safe strings?
>> 
>> Sent from the Riak Users mailing list archive at Nabble.com.
>> 
>> _______________________________________________
>> riak-users mailing list
>> riak-users@lists.basho.com
>> http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com
> 
> _______________________________________________
> riak-users mailing list
> riak-users@lists.basho.com
> http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com
_______________________________________________
riak-users mailing list
riak-users@lists.basho.com
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com

Reply via email to