Mazen Raafat created SOLR-15353:
-----------------------------------

             Summary: Querying multiple collection performance issue
                 Key: SOLR-15353
                 URL: https://issues.apache.org/jira/browse/SOLR-15353
             Project: Solr
          Issue Type: Bug
      Security Level: Public (Default Security Level. Issues are Public)
          Components: search
    Affects Versions: 8.8.1
         Environment: SolrCloud Setup: 3 zookeeper servers and 2 solr cloud 
nodes

each Solr node hosted in AWS m5.xlarge EC2 with 8G RAM dedicated to Solr JVM 
heap 
 in this environment each collection is one shard and 2 replicas

for benchmarking i've used JMeter, setting the thread group=50, and loop 
count=500

 
            Reporter: Mazen Raafat
         Attachments: querying alias points to collection 1 and collection 
2.png, querying collection 1.png, querying collection 2.png, thread group.png

performance degradation when querying multiple collections using aliases that 
points to multiple collection or calling the search handler directly with  
collection query param as follow

{{http://localhost:8983/solr/collection1/select?collection=collection1,collection2,collection3}}

 

in the first test i have queried a collection with about 40k docs, the 
throughput was ~3k req/sec

in the second test  i have queried another collection with about 4k docs, the 
throughput was ~3.5k req/sec

in the third test i have queried an alias that points to both collections and 
viola! the throughput dropped to be ~200 req/sec!

i have tried not to use alias and use 
solrurl/solr/collection1/select?collections=collection2 and got the same result

 

notes:
 # both collections have the same schema
 # query and filter query are the same in all tests 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to