mikemccand commented on issue #13519:
URL: https://github.com/apache/lucene/issues/13519#issuecomment-2194673340
OK I managed to run `knnPerfTest.py` from `luceneutil`, using `mpnet`
vectors (768 dims) and I think I am also seeing horrific performance for `int8`
but OK for `int4` and `int7`:
```
quantizedBits recall latency nDoc fanout maxConn beamWidth
visited index ms
32 0.983 2.55 250000 20 64 250 9294 66542 1.00
post-filter
4 0.645 2.13 250000 20 64 250 13010 79605 1.00
post-filter
7 0.943 1.87 250000 20 64 250 11775 78806 1.00
post-filter
8 0.002 2.81 250000 20 64 250 23879 79177 1.00
post-filter
```
NOTE: this is my first time successfully running `knnPerfTest.py` so it's
entirely possible I messed something up! But given that I'm seeing decent
recall for unquantized (32 bit) and 7 bit, I think the 8 bit result is
believable and horrible.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]