Re: REINDEXCOLLECTION gradually slows

2021-03-18 Thread Karl Stoney
A bit more data on this to help... really welcome any ideas as the slow down over time just means we're never really able to finish a reindex via REINDEXCOLLECTION We have disabled auto commits on the destination collection during the reindex, however we're still seeing commits happen as the re

Problem with Backup - Standalone Mode

2021-03-18 Thread adamw
I have a problem while performing a backup. I have a Solr instance in Standalone Mode. I use the following command to do backup: http://solr-content:8983/solr/core_example/replication?command=backup&location=\\resource\backup\core_example&name=core_example In the first time I will see the files

Re: REINDEXCOLLECTION gradually slows

2021-03-18 Thread Karl Stoney
And a bit more... it's almost certainly to do with the commits happening after every batch. When i set rows rather high (for example; 25k) I hold a sustained throughput for a longer duration and on the smaller index, things finish faster. Downside is obviously more memory being used (but we have

Re: Disable commits during a REINDEXCOLLECTION

2021-03-18 Thread Joel Bernstein
Commits need to happen or you'll get a memory leak. There are parts of the transaction log which are held in memory for real-time get. Commits release this memory as indexing occurs. I'm not aware of way to disable this behavior. Joel Bernstein http://joelsolr.blogspot.com/ On Wed, Mar 17, 2

Re: Cannot find Solr 7.4.1 release

2021-03-18 Thread Olivier Tavard
Hi, Thanks for your answer. It was just to be sure. I created the binary package meantime. Best regards, Olivier Le jeu. 18 févr. 2021 à 14:30, Shawn Heisey a écrit : > On 2/18/2021 1:05 AM, Olivier Tavard wrote: > > I wanted to download Solr 7.4.1, but I cannot find the 7.4.1 release into >

How to set maxExpansions parameter for fuzzy search

2021-03-18 Thread Olivier Tavard
Hi, I have a question regarding the fuzzy search. By reading previous questions on the ML in the past I saw that the parameter max expansions is set to 50 in the code. I have the same behavior that for other users meaning that if I have a Solrcloud cluster with many shards, I obtain more results f

Solr config and schema linting and validation

2021-03-18 Thread Karl Stoney
Hi all, I'm trying to add some tests to the pipeline which deploys our solr config, and right now i'm working with bash/xmlint/xmlstarlet, and it's a bit crude. Does anyone have any tips/pointers on how best to validate solrconfig.xml and schema.xml. Is there published schemas for specific solr

Re: Solr config and schema linting and validation

2021-03-18 Thread Alexandre Rafalovitch
I am not aware of any, and I looked. In fact I have two pages of notes of what a service like that would look like if I chose to create one. My main thought was that solrconfig and schema are effectively a graph database and that can be used for validation both against rules and for unused/wrong re