Hi,

I need to reindex a bucket with a ~4 million items. If I do a
streaming list keys using the Erlang client and then read/write the
items as they keys come in it puts too much load on the cluster and
other mapred queries that get run timeout. I already have a date based
index on the items and was thinking getting items based on hourly
chunks and update them in batches that way as the times should be
relatively evenly distributed in time. I can then better control the
flow and load of the reindexing operations.

Anyone have any better ideas or use any other strategies when having to reindex?

--
Jeremy

_______________________________________________
riak-users mailing list
riak-users@lists.basho.com
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com

Reply via email to