Hi,
I recently needed to alter my Riak Search schema for a bucket type that
contains ~30 millions rows. As a result, my index was wiped since we are
waiting for a Riak Search 2.2 feature that will sync Riak storage with
Solr index on such an occasion.
I adapted a since script suggested by Evren Esat Özkan there
(https://github.com/basho/yokozuna/issues/130#issuecomment-196189344).
It is a simple python script that will stream keys and trigger a store
action for any items. Unfortunately it failed past 178k items due to
time out on the key stream. I calculated that this kind of reindexation
mechanism would take up to 5 days without a crash to succeed.
I was wondering if there would be a pure Erlang mean to achieve a
complete forced rewrite of every single element in my bucket type rather
that an error prone and very long python process.
How would you guys reindex a 30 million item bucket type in a fast and
reliable way ?
Thanks, Guillaume
_______________________________________________
riak-users mailing list
riak-users@lists.basho.com
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com