Make sure you use fully qualified domain names everywhere, and that they are routable everywhere. If you start Solr with -h my-short-name then it will be persisted in zk state and will not be routable from all other solr hosts.
Jan Høydahl > 25. apr. 2024 kl. 13:34 skrev Ritesh Kumar (Accenture International Limite) > <v-kur...@microsoft.com.invalid>: > > Hello, > > One additional observation, I do not face the issue if I use IP addresses for > the solr nodes but I observe this issue as soon as I modify solr service to > use hostname instead of the IP address. > > start -f -h XX.XX.XX.XX -p 8983 -cloud -Dsolr.ssl.checkPeerName=false -m 8g > -z > "XX.X.X.XX:2181,YY.Y.Y.YY:2181,ZZ.Z.Z.ZZ:2181,AA.A.A.AA:2181,BB.B.B.BB:2181" > > start -f -h HOSTNAME -p 8983 -cloud -Dsolr.ssl.checkPeerName=false -m 8g -z > "XX.X.X.XX:2181,YY.Y.Y.YY:2181,ZZ.Z.Z.ZZ:2181,AA.A.A.AA:2181,BB.B.B.BB:2181" > > > Thanks > > Ritesh > > ________________________________ > From: Jan H?ydahl <jan....@cominvent.com> > Sent: Wednesday, April 24, 2024 02:21 PM > To: users@solr.apache.org <users@solr.apache.org> > Subject: Re: [EXTERNAL] SOLR Shard nodes are unhealthy > > Hi, > > Indeed, Solr 8.11 used zk client v3.6.2, but I believe it will also work with > newer 3.x versions, you can test this locally on your laptop first to be > sure. > > Note also that network resolution should work between all nodes on the FQDN. > So if your ZK1 node is called ZK1.region3.example.com and is on ip 10.0.1.1 > then it should be routable on that FQDN address from all solr and zk nodes, > and a reverse lookup on IP should also return the same host name. > > Jan > >> 24. apr. 2024 kl. 10:38 skrev Ritesh Kumar (Accenture International Limite) >> <v-kur...@microsoft.com.INVALID>: >> >> Hi Jan, >> >> Your assumptions are correct as those are setup correctly. I will provide >> the logs. >> >> Meanwhile I can give a try with SOLR 8.11.3 and see how it behaves. Moreover >> just wanted to check one thing: >> >> SOLR 8.11 document says "Solr currently uses Apache ZooKeeper v3.6.2." >> Setting Up an External ZooKeeper Ensemble | Apache Solr Reference Guide >> 8.11<https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fsolr.apache.org%2Fguide%2F8_11%2Fsetting-up-an-external-zookeeper-ensemble.html&data=05%7C02%7Cv-kurite%40microsoft.com%7Ca075331631884b3cfb1808dc643bc41a%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C638495455093288541%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C0%7C%7C%7C&sdata=2bN%2B4oq9MJcmNv315lnR4Op9OvPG4yP0cF%2FQDye9%2Ffs%3D&reserved=0<https://solr.apache.org/guide/8_11/setting-up-an-external-zookeeper-ensemble.html>> >> >> However looking at Apache >> ZooKeeper<https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fzookeeper.apache.org%2Freleases.html&data=05%7C02%7Cv-kurite%40microsoft.com%7Ca075331631884b3cfb1808dc643bc41a%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C638495455093299320%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C0%7C%7C%7C&sdata=78iVlW3%2BRilLuGrFr%2FzqdnYYRdDvM5H891djm4ZFI%2B0%3D&reserved=0<https://zookeeper.apache.org/releases.html>>, >> it says "Apache ZooKeeper 3.9.2 is our current release, and 3.8.4 our >> latest stable release." >> >> Hence i wanted to check if i can install zookeeper 3.8.4 along with solr >> 8.11.3 or i have to use ZooKeeper v3.6.2 only. >> >> >> Thanks >> >> Ritesh >> >> ________________________________ >> From: Jan H?ydahl <jan....@cominvent.com> >> Sent: Wednesday, April 24, 2024 01:41 PM >> To: users@solr.apache.org <users@solr.apache.org> >> Subject: [EXTERNAL] Re: SOLR Shard nodes are unhealthy >> >> Hi, >> >> First, you should use the latest 8.11.3 which has a lot of fixes. >> >> Your setup could work if the regions are connected with really low latency >> and high bandwidth. >> >> Your description suggests that your Solr-ZK connection is not as it should. >> >> A few assumptions for this to work: >> * You ave installed ZK correctly as one five-node ensemble, and validated it >> works flawlessly >> * All your Solr nodes lists all five ZK nodes in ZK_HOST, e.g. >> ZK_HOST=zk1:2181,zk2:2182,zk3:2181,zk4:2181,zk5:2181 >> >> You'll have to provide more information regarding your exact configuration >> and/or error log messages to get further on this. >> >> Jan >> >>>> 24. apr. 2024 kl. 04:32 skrev Ritesh Kumar (Accenture International >>>> Limite) <v-kur...@microsoft.com.INVALID>: >>> >>> Hello All, >>> >>> I have setup SOLR-ZK ensemble (Windows) with 6 SOLR VMs across two regions >>> and 5 ZK VMs across three regions. >>> >>> SOLR Version - 8.11.2 >>> ZK Version - 3.6.2 >>> >>> Region1: >>> SOLR1 >>> SOLR2 >>> SOLR3 >>> ZK2 >>> ZK3 >>> >>> Region2: >>> SOLR4 >>> SOLR5 >>> SOLR6 >>> ZK4 >>> ZK5 >>> >>> Region3: >>> ZK1 >>> >>> I am facing issue after restarting ZK and SOLR VMs . The SOLR UI is showing >>> all the 6 SOLR nodes and 5 ZK nodes are active and with leader. >>> >>> All the 3 Solr nodes (1,2,3) of each shard is in recovery and later down >>> and remaining 3 solr nodes (4,5,6) are healthy. Now if i restart solr nodes >>> 4,5,6 they go to recovery/down and nodes 1,2,3 become healthy - but all 6 >>> nodes are never becoming healthy together. I have tried restarting Solr and >>> ZK multiple times. >>> >>> Please let me know if anyone has faced this issue or has suggestions for a >>> fix. >>> >>> >>> >>> >>> Note: I cannot upgrade to latest SOLR version as this is recommened by >>> Sitecore. >>> >>> Thanks >>> Ritesh >> >