Run the same knn queries at a slow throughput for 30-60 minutes, this should warm up disk caches with hnsw index files, and then you should see a significant drop in the query time. Also make use of "fq" and reduce the document space as much as you can.
On Thu, Mar 28, 2024 at 12:50 PM Iram Tariq <iram.ta...@northbaysolutions.net.invalid> wrote: > Hi Alessandro, > > Thank you for the feedback. Kindly see my comments below, > > *Ale*: > https://www.elastic.co/blog/accelerating-vector-search-simd-instructions, > I > suggest to experiment with simD vector improvements (unless you are > already doing it) > > * We will try this soon. * > > *Ale*: What about the machine memory? > > Following is the system specification: Linux ( CPU:64, RAM:488 GB, > OS:Ubuntu 20.04.6 ) > > *Ale*: you can fine-tune the hyper-parameter to compromise a bit on recall > in favour of performance (hnswBeamWidth, hnswMaxConnections) > > I am trying this as a first step. But I am sure it will impact recall. > > Regards, > > > Iram Tariq | Software Architect > > NorthBay > > Direct: +1 (902) 329-7329 > > iram.ta...@northbaysolutions.net > > www.northbaysolutions.com > > > > > On Thu, Mar 28, 2024 at 5:42 AM Alessandro Benedetti <a.benede...@sease.io > > > wrote: > > > That's interesting. > > I think it's vital to get back some performance tests from the community. > > Since my contribution to support Vector-search in Apache Solr was merged, > > we got little or null feedback to understand its performance, in > real-world > > use cases. > > Blogs, open benchmarks or even just this sort of mail message are > welcome. > > Let me reply in line: > > -------------------------- > > *Alessandro Benedetti* > > Director @ Sease Ltd. > > *Apache Lucene/Solr Committer* > > *Apache Solr PMC Member* > > > > e-mail: a.benede...@sease.io > > > > > > *Sease* - Information Retrieval Applied > > Consulting | Training | Open Source > > > > Website: Sease.io <http://sease.io/> > > LinkedIn <https://linkedin.com/company/sease-ltd> | Twitter > > <https://twitter.com/seaseltd> | Youtube > > <https://www.youtube.com/channel/UCDx86ZKLYNpI3gzMercM7BQ> | Github > > <https://github.com/seaseltd> > > > > > > On Wed, 27 Mar 2024 at 21:06, Kent Fitch <kent.fi...@gmail.com> wrote: > > > > > Hi Iram, > > > > > > Is the machine doing lots of IO? If the hnsw graphs are not entirely in > > > memory, performance will be poor. What JVM? You may get some benefit > from > > > simd support in java 21. Can you use the latest quantisation changes in > > > Lucene to reduce memory footprint of the hnsw graphs? That's a large > > topk, > > > but I guess you need it? > > > > > > Best regards > > > > > > Kent Fitch > > > > > > On Thu, 28 Mar 2024, 5:12 am Iram Tariq, > > > <iram.ta...@northbaysolutions.net.invalid> wrote: > > > > > > > Hi All, > > > > > > > > I am using Dense vectors in SOLR and facing slowness in it. Each > search > > > is > > > > taking 10-25 seconds. I want to reduce the time to 5 seconds (or less > > > > ideally). > > > > > > > > Following configurations are being used. > > > > > > > > > > > > 1. *SOLR Version:* 9.3.0 > > > > 2. *Lucene Version:* 9.7.0 > > > > > *Ale*: > > https://www.elastic.co/blog/accelerating-vector-search-simd-instructions > , > > I > > suggest to experiment with simD vector improvements (unless you are > > already doing it) > > > > > > 3. *Vector Dimensions*: 384 > > > > 4. *Total Shards:* 5 > > > > 5. *Number of Vectors (Per shard*): 43209158 > > > > 6. *JVM for each Instance:* 35GB > > > > > *Ale*: What about the machine memory? > > > > > > 7. *TopK: *1000 (Getting 1000 from each shard) > > > > 8. *Rows: *1000 > > > > 9. *Vector Field Schema: *<fieldType name="knn_vector_384" > > > > class="solr.DenseVectorField" hnswMaxConnections="20" > > > > knnAlgorithm="hnsw" > > > > vectorDimension="384" similarityFunction="cosine" > > hnswBeamWidth="40"/> > > > > > *Ale*: you can fine-tune the hyper-parameter to compromise a bit on > recall > > in favour of performance (hnswBeamWidth, hnswMaxConnections) > > > > > > 10. *Stored*: False > > > > 11. *WebServer:* Apache Tomcat > > > > 12. *System Specs*: Linux ( CPU:64, RAM:488 GB, OS:Ubuntu > 20.04.6 ) > > > > > > > > Any sort of help/clue will be appreciated. > > > > > > > > > > > > > > > > Regards, > > > > > > > > > > > > Iram Tariq | Software Architect > > > > > > > > NorthBay > > > > > > > > Direct: +1 (902) 329-7329 > > > > > > > > iram.ta...@northbaysolutions.net > > > > > > > > www.northbaysolutions.com > > > > > > > > > >