jpountz commented on code in PR #14963:
URL: https://github.com/apache/lucene/pull/14963#discussion_r2248864216
##########
lucene/core/src/java/org/apache/lucene/codecs/lucene99/Lucene99HnswVectorsFormat.java:
##########
@@ -137,9 +144,16 @@ public final class Lucene99HnswVectorsFormat extends
KnnVectorsFormat {
private final int numMergeWorkers;
private final TaskExecutor mergeExec;
+ /**
+ * Whether to bypass HNSW graph building for tiny segments (below {@link
#HNSW_GRAPH_THRESHOLD}).
+ * When enabled, segments with fewer than the threshold number of vectors
will store only flat
+ * vectors, significantly improving indexing performance for workloads with
frequent flushes.
+ */
+ private final boolean bypassTinySegments;
Review Comment:
It may be good to be able to pass random values in RandomCodec as well to
help with the test coverage of the approximate case (we have very few tests
that index more than 10k vectors).
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]