Re: Limiting Backup IO

2023-06-21 Thread Pierre Salagnac
Thanks for starting this thread David. I've been internally working on this, since we have issues (query failures) during backups of big collections because of IO saturation. I see two different approaches to solve this: 1. Throttle at the IO level, like David mentioned. 2. Limit the number of co

Re: Limiting Backup IO

2023-06-29 Thread Pierre Salagnac
Jason, I haven't done much scalability testing, so it's hard to give accurate numbers on when we start having issues. For the environment I looked in detail we run a 16 nodes cluster, and the collection I wasn't able to backup has about 1500 shards, ~1.5 GB each. Core backups/restores are expensiv

Re: Limiting Backup IO

2023-07-06 Thread Pierre Salagnac
know what you think Thanks Le jeu. 29 juin 2023 à 15:37, Pierre Salagnac a écrit : > Jason, I haven't done much scalability testing, so it's hard to give > accurate numbers on when we start having issues. > For the environment I looked in detail we run a 16 nodes cluster, and t

Re: Edismax parsing when using pf parameter

2023-09-06 Thread Pierre Salagnac
I opened a pull request[1] that fixes the case reported. The issue was subqueries with grouped fields like "field:(term1 term2 term3), only the first term was skipped when generating the boost query with fields specified in pf parameter. Unfortunately, this pre-parsing (method splitIntoClauses())

Re: Limit IO while running solr backup

2024-07-22 Thread Pierre Salagnac
Hi Saksham, What Solr version do you run? With SOLR-16879 in Solr 9.4, a new throttling was added to limit the number of concurrent backups per node. If I recall well, the default is 5 per node. Before this fix, all the replica snapshots were started concurrently. As far as I know, there is no me

Re: Significant Backup/Restore Performance Degradation for Large Collections

2024-08-06 Thread Pierre Salagnac
Hi Hakan, > I was also wondering if the max thread size, which is currently 5 in 9.4+, could be configurable Yes, this makes sense to have this configuration. See EnvUtils class, which is now the recommended way to get system properties or environment variables in Solr code. The behavior before

Re: Slow system info

2025-03-26 Thread Pierre Salagnac
What's the running Solr version? Significant performance improvements were done recently with SOLR-17340 and SOLR-17441, respectively in versions 9.7 and 9.8. Le mer. 26 mars 2025 à 14:56, Scott Q. a écrit : > I run a 5 node Solr Cloud setup and for some reason the busier > the cluster gets, the