[
https://issues.apache.org/jira/browse/SOLR-17892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18027887#comment-18027887
]
Chris M. Hostetter commented on SOLR-17892:
-------------------------------------------
{quote}Here's the fix we're trying out
[https://github.com/rapidsai/cuvs-lucene/pull/29#issuecomment-3362393989]
{quote}
Doesn't this just kick the problem down the road?
The changes you've linked to eliminate the problem with upgrading to 10.3, by
not having compile time deps (in {{{}cuvs-lucene{}}}) on any codec related
classes that have moved (from {{*.codec.*}} to {{*.backward_codec.*}} ) between
10.2 and 10.3 .... but there _*ARE*_ still compile time deps (in
{{{}cuvs-lucene{}}}) on codec classes that might/will move in some other future
lucene version.
If/when (and it's only a matter of time) classes like
{{Lucene99FlatVectorsFormat}} , {{{}Lucene99HnswVectorsReader{}}}, and/or
{{Lucene99HnswVectorsWriter}} get replaced by newer/better versions, and move
to the {{backward_codec}} namespace in Lucene-10.Z, Solr will be unable to
upgrade to Lucene-10.Z unless/until {{cuvs-lucene}} comes out with a new "10.Z
compatible" version.
In the future we (Solr) may get stuck between a rock and a hard place when
attempting a Lucene upgrades that trigger {{NoClassDefFoundError}} in
{{modules/cuvs}} tests – we will be forced to choose:
* Let the Lucene upgrade be blocked indefinitely until a third-party project (
{{cuvs-lucene}} ) gets around to a compatible release.
* Stop supporting {{modules/cuvs}} in what may be a _MINOR_ Solr release,
potentially screwing our users who have built indexes using that module
I really don't like the prospect of either option.
> Add support for cuVS-Lucene as a pluggable codec in Solr
> --------------------------------------------------------
>
> Key: SOLR-17892
> URL: https://issues.apache.org/jira/browse/SOLR-17892
> Project: Solr
> Issue Type: New Feature
> Components: vector-search
> Reporter: Vivek Narang
> Assignee: Ishan Chattopadhyaya
> Priority: Major
> Labels: pull-request-available
> Fix For: main (10.0)
>
> Time Spent: 3h
> Remaining Estimate: 0h
>
> This issue proposes adding *cuVS-Lucene* as a pluggable codec in Solr to
> enable GPU-accelerated vector indexing.
> *Background*
> * [cuVS-Lucene|https://github.com/rapidsai/cuvs-lucene] is a new NVIDIA
> project that integrates GPU acceleration into Lucene for vector search.
> * It supports building HNSW graphs on GPUs via the state-of-the-art *CAGRA*
> algorithm.
> * The first official release of cuVS-Lucene is planned for {*}early
> October{*}.
> * At present, artifacts are not yet published on Maven Central. For early
> adoption, development, and testing, SearchScale is publishing temporary
> artifacts to its Maven repository. Once released, official artifacts will be
> available on Maven Central, and the PR will be updated to use the released
> artifact accordingly.
>
> *Usage in Solr*
> This change introduces the ability to configure Solr to use the
> {{*Lucene101AcceleratedHNSWVectorsFormat*}} provided by the cuVS-Lucene.
> Users can opt to use the GPU-accelerated indexing by selecting the
> {{CuVSCodec}} codec, while retaining compatibility with existing CPU-based
> codecs. Documentation will include steps on how to enable and configure this
> format within Solr. Prerequisites includes a
>
> *Testing Strategy*
> * NVIDIA is dedicating GPU resources for testing cuVS-Lucene directly.
> * In Solr’s test framework:
> ** Tests for cuVS-Lucene will be skipped on non-GPU machines.
> ** On GPU-enabled machines, the tests will run fully, validating integration.
> * We are also exploring the possibility of contributing GPU resources to
> Apache Solr’s CI infrastructure for continuous GPU test coverage (to be
> discussed further with the community).
>
> *Benchmarks*
> We will publish benchmark results shortly to demonstrate the performance
> improvements of GPU-accelerated HNSW graph construction compared to CPU-only
> implementations.
> *Motivation for Solr 10*
> Including cuVS-Lucene support in *Solr 10* would highlight Solr’s adoption of
> first-class GPU acceleration, delivering significant performance improvements
> for vector search and positioning Solr as a leader in large-scale AI/ML
> search workloads.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]