Hi All,

I posted this question to Stack Overflow a few days back but not much luck.
 Hoping someone here has some thoughts.

I have a use case for an aggregate query across the entire db and all
buckets, I'm wondering the best query method to use, leaning towards
multiple secondary index calls. This won't be a frequently used feature,
possibly invoked once a week or so via scheduled job or something.

Some records have a value in their meta attribute that I'd like to
match/target for the selection.  After the selection I'll need to update
those records.

>From what I've read, secondary index looks great but it is limited to a
single bucket? I also saw "list buckets", which has warnings about
production use, though not sure if that's applicable to such infrequently
used functionality. Thought maybe i could list buckets then perform the
secondary index query on each.

Is there a better way? MapReduce seems heavy, having to load every KV off
the file system. Search seem possible too but index setup/maintenance seems
overkill if there's an easier way.

UPDATE:  i went ahead with a Search index but am now second guessing that.
 This lookup will be part of an encryption key rotation, where we'll be
finding certain values from Riak that are encrypted with a given key then
re-encrypting with a new key.  So, if there are discrepancies or failed
operations between the actual encrypted values and the search index, there
is a potential for data loss, as we'll be discarding keys once rotated.


Sorry for the long winded description.  Any help would be greatly
appreciated.
_______________________________________________
riak-users mailing list
riak-users@lists.basho.com
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com

Reply via email to