On 2022-03-31 9:29 AM, Tealdi Paolo wrote:
Hi Eric

Many thanks for the answer.
I noticed that reindexcollection seems to be SLOWER than DIH import.

(Warning: there be python there)

This is trimmed down from a working script: https://gist.github.com/dmaziuk/57b9c1926578bc10f12c0999c4b7ab53

It is slower than DIH. It commits every document, that's likely part of it. I think in your case, if both cores reside on the same server, you will have contention and extra slow-down from that -- compared to pulling from one server and pushing to another. So I wouldn't expect it to be blazing fast.

The part where it pulls IDs from Solr is trivially modified to pull whole records from your source index, so if you can write python, you can adjust it for your use and see how it goes.

Dima

Reply via email to