Run the same knn queries at a slow throughput  for 30-60 minutes, this
should warm up disk caches with hnsw index files, and then you should see a
significant drop in the query time. Also make use of "fq" and reduce the
document space as much as you can.

On Thu, Mar 28, 2024 at 12:50 PM Iram Tariq
<iram.ta...@northbaysolutions.net.invalid> wrote:

> Hi  Alessandro,
>
> Thank you for the feedback. Kindly see my comments below,
>
> *Ale*:
> https://www.elastic.co/blog/accelerating-vector-search-simd-instructions,
> I
> suggest to experiment with simD vector improvements  (unless you are
> already doing it)
>
> * We will try this soon. *
>
> *Ale*: What about the machine memory?
>
> Following is the system specification:  Linux ( CPU:64, RAM:488 GB,
> OS:Ubuntu 20.04.6 )
>
> *Ale*: you can fine-tune the hyper-parameter to compromise a bit on recall
> in favour of performance  (hnswBeamWidth, hnswMaxConnections)
>
> I am trying this as a first step. But I am sure it will impact recall.
>
> Regards,
>
>
> Iram Tariq | Software Architect
>
> NorthBay
>
> Direct:  +1 (902) 329-7329
>
> iram.ta...@northbaysolutions.net
>
> www.northbaysolutions.com
>
>
>
>
> On Thu, Mar 28, 2024 at 5:42 AM Alessandro Benedetti <a.benede...@sease.io
> >
> wrote:
>
> > That's interesting.
> > I think it's vital to get back some performance tests from the community.
> > Since my contribution to support Vector-search in Apache Solr was merged,
> > we got little or null feedback to understand its performance, in
> real-world
> > use cases.
> > Blogs, open benchmarks or even just this sort of mail message are
> welcome.
> > Let me reply in line:
> > --------------------------
> > *Alessandro Benedetti*
> > Director @ Sease Ltd.
> > *Apache Lucene/Solr Committer*
> > *Apache Solr PMC Member*
> >
> > e-mail: a.benede...@sease.io
> >
> >
> > *Sease* - Information Retrieval Applied
> > Consulting | Training | Open Source
> >
> > Website: Sease.io <http://sease.io/>
> > LinkedIn <https://linkedin.com/company/sease-ltd> | Twitter
> > <https://twitter.com/seaseltd> | Youtube
> > <https://www.youtube.com/channel/UCDx86ZKLYNpI3gzMercM7BQ> | Github
> > <https://github.com/seaseltd>
> >
> >
> > On Wed, 27 Mar 2024 at 21:06, Kent Fitch <kent.fi...@gmail.com> wrote:
> >
> > > Hi Iram,
> > >
> > > Is the machine doing lots of IO? If the hnsw graphs are not entirely in
> > > memory, performance will be poor. What JVM? You may get some benefit
> from
> > > simd support in java 21. Can you use the latest quantisation changes in
> > > Lucene to reduce memory footprint of the hnsw graphs? That's a large
> > topk,
> > > but I guess you need it?
> > >
> > > Best regards
> > >
> > > Kent Fitch
> > >
> > > On Thu, 28 Mar 2024, 5:12 am Iram Tariq,
> > > <iram.ta...@northbaysolutions.net.invalid> wrote:
> > >
> > > > Hi All,
> > > >
> > > > I am using Dense vectors in SOLR and facing slowness in it. Each
> search
> > > is
> > > > taking 10-25 seconds. I want to reduce the time to 5 seconds (or less
> > > > ideally).
> > > >
> > > > Following configurations are being used.
> > > >
> > > >
> > > >    1. *SOLR Version:* 9.3.0
> > > >    2. *Lucene Version:* 9.7.0
> > >
> > *Ale*:
> > https://www.elastic.co/blog/accelerating-vector-search-simd-instructions
> ,
> > I
> > suggest to experiment with simD vector improvements  (unless you are
> > already doing it)
> >
> > > >    3. *Vector Dimensions*: 384
> > > >    4. *Total Shards:* 5
> > > >    5. *Number of Vectors (Per shard*): 43209158
> > > >    6. *JVM for each Instance:* 35GB
> > >
> > *Ale*: What about the machine memory?
> >
> > > >    7. *TopK: *1000  (Getting 1000 from each shard)
> > > >    8. *Rows: *1000
> > > >    9. *Vector Field Schema:  *<fieldType name="knn_vector_384"
> > > >    class="solr.DenseVectorField" hnswMaxConnections="20"
> > > > knnAlgorithm="hnsw"
> > > >    vectorDimension="384" similarityFunction="cosine"
> > hnswBeamWidth="40"/>
> > >
> > *Ale*: you can fine-tune the hyper-parameter to compromise a bit on
> recall
> > in favour of performance  (hnswBeamWidth, hnswMaxConnections)
> >
> > > >    10. *Stored*: False
> > > >    11. *WebServer:* Apache Tomcat
> > > >    12. *System Specs*:  Linux ( CPU:64, RAM:488 GB, OS:Ubuntu
> 20.04.6 )
> > > >
> > > > Any sort of help/clue will be appreciated.
> > > >
> > > >
> > > >
> > > > Regards,
> > > >
> > > >
> > > > Iram Tariq | Software Architect
> > > >
> > > > NorthBay
> > > >
> > > > Direct:  +1 (902) 329-7329
> > > >
> > > > iram.ta...@northbaysolutions.net
> > > >
> > > > www.northbaysolutions.com
> > > >
> > >
> >
>

Reply via email to