Hi Rahul,

This sounds very interesting!

I enjoyed the discussion at CoC and would be very
interested to hear more about the technical details.

I am also curious to know more what you mean by "in-place"
and what the expectation is around downtime.

Either way I am sure this would be a great addition to
the tool belt for getting people to finally move off
ancient versions of Solr.

Look forward to discussing this more on the JIRA!

Luke

From: users@solr.apache.org At: 03/28/25 01:05:57 UTC-4:00To:  
users@solr.apache.org
Subject: Automatic upgrade of Solr indexes over multiple versions

Today upgrading from Solr version X to X+2 requires complete reingestion of
data from source. This comes from Lucene's constraint which only guarantees
index compatibility between the version the index was created in and the
immediate next version.


This reindexing usually comes with added downtime and/or cost. Especially
in case of deployments which are in customer environments and not
completely in control of the vendor, this proposition of having to
completely reindex the data can become a hard sell.


I have developed a way which achieves this reindexing in-place on the same
index. Also, the process automatically keeps "upgrading" the indexes over
multiple subsequent Solr upgrades without needing manual intervention.


It does come with a limitation that all *source* fields need to be either
stored=true or docValues=true. Any copyField destination fields can be
stored=false of course, but as long as the source field (or in general, the
fields you care about preserving) is either stored or docValues true , the
tool can reindex in-place and legitimately "upgrade" the index. For indexes
where this limitation is not a problem (it wasn't for us!), this tool can
remove a lot of operational headaches, especially in environments with
hundreds/thousands of very large indexes.


I had a conversation about this with some of you during "Apache Community
over Code 2024" in Denver, and I could sense some interest. If this feature
sounds appealing, I would like to contribute it to Solr on behalf of my
employer, Commvault. Please let me know if I should create a JIRA and get
the discussion rolling!


Thanks,
Rahul Goswami


Reply via email to