Re: Performance Suggestion for Dense Vectors

rajani m Thu, 28 Mar 2024 13:56:16 -0700

btw this article[1]  by Tom Burgmans is relevant. It is posted on linkedin
though and never came into my feed. I found it on the relevance-search
slack channel.
[1]
https://www.linkedin.com/pulse/testing-dense-vector-search-scale-part-1-ann-tom-burgmans-0tale/



On Thu, Mar 28, 2024 at 3:47 PM rajani m <rajinima...@gmail.com> wrote:

> @Alessandro,
> Is there a solr blog site where we can submit work/articles or are you
> suggesting to post on my own site and share a link here? I prefer the
> former if there is one because there were times when I had my own,
> it hardly had any views and on top of that google blogging made me migrate
> from blogs to sites and sites got deprecated. Is there or can we have a
> solr specific wiki/blog site  where solr users can submit common features
> configs/modules configs/examples/performance metrics and so on....and maybe
> have a voting/likes to confirm it works. We will have one common place to
> submit and look for.
>
>
>
> On Thu, Mar 28, 2024 at 3:33 PM rajani m <rajinima...@gmail.com> wrote:
>
>> Run the same knn queries at a slow throughput  for 30-60 minutes, this
>> should warm up disk caches with hnsw index files, and then you should see a
>> significant drop in the query time. Also make use of "fq" and reduce the
>> document space as much as you can.
>>
>> On Thu, Mar 28, 2024 at 12:50 PM Iram Tariq
>> <iram.ta...@northbaysolutions.net.invalid> wrote:
>>
>>> Hi  Alessandro,
>>>
>>> Thank you for the feedback. Kindly see my comments below,
>>>
>>> *Ale*:
>>> https://www.elastic.co/blog/accelerating-vector-search-simd-instructions,
>>> I
>>> suggest to experiment with simD vector improvements  (unless you are
>>> already doing it)
>>>
>>> * We will try this soon. *
>>>
>>> *Ale*: What about the machine memory?
>>>
>>> Following is the system specification:  Linux ( CPU:64, RAM:488 GB,
>>> OS:Ubuntu 20.04.6 )
>>>
>>> *Ale*: you can fine-tune the hyper-parameter to compromise a bit on
>>> recall
>>> in favour of performance  (hnswBeamWidth, hnswMaxConnections)
>>>
>>> I am trying this as a first step. But I am sure it will impact recall.
>>>
>>> Regards,
>>>
>>>
>>> Iram Tariq | Software Architect
>>>
>>> NorthBay
>>>
>>> Direct:  +1 (902) 329-7329
>>>
>>> iram.ta...@northbaysolutions.net
>>>
>>> www.northbaysolutions.com
>>>
>>>
>>>
>>>
>>> On Thu, Mar 28, 2024 at 5:42 AM Alessandro Benedetti <
>>> a.benede...@sease.io>
>>> wrote:
>>>
>>> > That's interesting.
>>> > I think it's vital to get back some performance tests from the
>>> community.
>>> > Since my contribution to support Vector-search in Apache Solr was
>>> merged,
>>> > we got little or null feedback to understand its performance, in
>>> real-world
>>> > use cases.
>>> > Blogs, open benchmarks or even just this sort of mail message are
>>> welcome.
>>> > Let me reply in line:
>>> > --------------------------
>>> > *Alessandro Benedetti*
>>> > Director @ Sease Ltd.
>>> > *Apache Lucene/Solr Committer*
>>> > *Apache Solr PMC Member*
>>> >
>>> > e-mail: a.benede...@sease.io
>>> >
>>> >
>>> > *Sease* - Information Retrieval Applied
>>> > Consulting | Training | Open Source
>>> >
>>> > Website: Sease.io <http://sease.io/>
>>> > LinkedIn <https://linkedin.com/company/sease-ltd> | Twitter
>>> > <https://twitter.com/seaseltd> | Youtube
>>> > <https://www.youtube.com/channel/UCDx86ZKLYNpI3gzMercM7BQ> | Github
>>> > <https://github.com/seaseltd>
>>> >
>>> >
>>> > On Wed, 27 Mar 2024 at 21:06, Kent Fitch <kent.fi...@gmail.com> wrote:
>>> >
>>> > > Hi Iram,
>>> > >
>>> > > Is the machine doing lots of IO? If the hnsw graphs are not entirely
>>> in
>>> > > memory, performance will be poor. What JVM? You may get some benefit
>>> from
>>> > > simd support in java 21. Can you use the latest quantisation changes
>>> in
>>> > > Lucene to reduce memory footprint of the hnsw graphs? That's a large
>>> > topk,
>>> > > but I guess you need it?
>>> > >
>>> > > Best regards
>>> > >
>>> > > Kent Fitch
>>> > >
>>> > > On Thu, 28 Mar 2024, 5:12 am Iram Tariq,
>>> > > <iram.ta...@northbaysolutions.net.invalid> wrote:
>>> > >
>>> > > > Hi All,
>>> > > >
>>> > > > I am using Dense vectors in SOLR and facing slowness in it. Each
>>> search
>>> > > is
>>> > > > taking 10-25 seconds. I want to reduce the time to 5 seconds (or
>>> less
>>> > > > ideally).
>>> > > >
>>> > > > Following configurations are being used.
>>> > > >
>>> > > >
>>> > > >    1. *SOLR Version:* 9.3.0
>>> > > >    2. *Lucene Version:* 9.7.0
>>> > >
>>> > *Ale*:
>>> >
>>> https://www.elastic.co/blog/accelerating-vector-search-simd-instructions
>>> ,
>>> > I
>>> > suggest to experiment with simD vector improvements  (unless you are
>>> > already doing it)
>>> >
>>> > > >    3. *Vector Dimensions*: 384
>>> > > >    4. *Total Shards:* 5
>>> > > >    5. *Number of Vectors (Per shard*): 43209158
>>> > > >    6. *JVM for each Instance:* 35GB
>>> > >
>>> > *Ale*: What about the machine memory?
>>> >
>>> > > >    7. *TopK: *1000  (Getting 1000 from each shard)
>>> > > >    8. *Rows: *1000
>>> > > >    9. *Vector Field Schema:  *<fieldType name="knn_vector_384"
>>> > > >    class="solr.DenseVectorField" hnswMaxConnections="20"
>>> > > > knnAlgorithm="hnsw"
>>> > > >    vectorDimension="384" similarityFunction="cosine"
>>> > hnswBeamWidth="40"/>
>>> > >
>>> > *Ale*: you can fine-tune the hyper-parameter to compromise a bit on
>>> recall
>>> > in favour of performance  (hnswBeamWidth, hnswMaxConnections)
>>> >
>>> > > >    10. *Stored*: False
>>> > > >    11. *WebServer:* Apache Tomcat
>>> > > >    12. *System Specs*:  Linux ( CPU:64, RAM:488 GB, OS:Ubuntu
>>> 20.04.6 )
>>> > > >
>>> > > > Any sort of help/clue will be appreciated.
>>> > > >
>>> > > >
>>> > > >
>>> > > > Regards,
>>> > > >
>>> > > >
>>> > > > Iram Tariq | Software Architect
>>> > > >
>>> > > > NorthBay
>>> > > >
>>> > > > Direct:  +1 (902) 329-7329
>>> > > >
>>> > > > iram.ta...@northbaysolutions.net
>>> > > >
>>> > > > www.northbaysolutions.com
>>> > > >
>>> > >
>>> >
>>>
>>

Re: Performance Suggestion for Dense Vectors

Reply via email to