Re: Re: (Gen)AI-Support for searching with Solr/Lucene

solr Tue, 30 Apr 2024 14:14:03 -0700

Thanks to Koji & Alessandro & Matthias & Charlie for your quickreactions to my relatively broad question.

I have listed your answers below so that you and I have a betteroverview.


I agree to Koji that we are discussing RAG.

Thanks to Matthias and Charlie for opening ways of learning anddiscussion.And special thanks to Alessandro for clearly describing real-worldproblems with implementing RAG.

Obviously (as we say in German) very thick boards have to be drilledhere.Currently I'm working on the topic (as a freelancer) out of my owninterest and discussing it with potential customers. Lets see how itgoes on.


Walter

-------------------------------------------------------------------------

By chance I found this paper, which perhaps might be interesting foryou:


https://arxiv.org/abs/2308.14963
Vector Search with OpenAI Embeddings: Lucene Is All You Need
Jimmy Lin, Ronak Pradeep, Tommaso Teofili, Jasper Xian

We provide a reproducible, end-to-end demonstration of vector searchwith OpenAI embeddings using Lucene on the popular MS MARCO passageranking test collection. The main goal of our work is to challenge theprevailing narrative that a dedicated vector store is necessary to takeadvantage of recent advances in deep neural networks as applied tosearch. Quite the contrary, we show that hierarchical navigablesmall-world network (HNSW) indexes in Lucene are adequate to providevector search capabilities in a standard bi-encoder architecture. Thissuggests that, from a simple cost-benefit analysis, there does notappear to be a compelling reason to introduce a dedicated vector storeinto a modern "AI stack" for search, since such applications havealready received substantial investments in existing, widely deployedinfrastructure.


-------------------------------------------------------------------------------
Koji Sekiguchi 25.04.2024 03:45

Hi Walter,
Isn't it an application commonly known as RAG
https://en.wikipedia.org/wiki/Prompt_engineering#Retrieval-augmented_generation
?
--
Koji
-------------------------------------------------------------------------------
Matthias Krüger 25.04.2024 09:35

Hallo Walter,

es gibt viele dieser Startups zur Zeit, wenn Du konkrete Fragen oderIdeen zu Retrieval-Augmented-Generation-Ansätzen und Architekturen mitSolr hast, können wir gern mal sprechen.

Viele Grüße
Matthias Krüger
OpenSource Connections
-------------------------------------------------------------------------------
Alessandro Benedetti 25.04.2024 14:55

Hi Walter,

We've been doing many AI integrations with Solr and we drafted a roadmapto get some funding to implement it directly in Solr:

https://sease.io/2023/10/apache-lucene-solr-ai-roadmap-do-you-want-to-make-it-happen.html

We made little progress so far but hopefully will attract some moreattention.In terms of what you saw, as Koji correctly mentioned, it's very likelyit was Retrieval Augmented Generation:Doing it is extra easy for a quick prototype, to just showcase somecherry picked magic.

But bringing it to production offers many challenges:

- the retrieval phase is still an open problem, you can do lexical(traditional keyword search), you can do vector or hybrid, but it's farfrom being easy.

You may need to chunk if you go with most embedding models.

- the LLM choice is quite challenging as well, you can go quickly withthe latest GPT-X but more likely you need some days to assess the bestcommercial/open solution for your use case and domain- the way you prompt the LLM is also challenging, depending if you justwant a generated answer, citations etc

We presented already some talks and tutorials around, we should havesomething recorded in our training section.

We'll also speak soon on these topics at upcoming conferences:
https://eu.communityovercode.org/schedule/
https://2024.berlinbuzzwords.de/sessions?id=7VSFFK
https://2024.berlinbuzzwords.de/sessions?id=NCPYUH

Hope it helps!
-------------------------------------------------------------------------------
Charlie Hull 25.04.2024 18:00

We've just had the Haystack conference here in Charlottesville with manytalks on RAG and AI on Lucene based engines (can't remember a particularone on Solr but a lot is transferrable). Check out www.haystackconf.com- the videos & slides of all the talks will be published over the nextfew weeks. You should also join Relevance Slack where there's a lot ofdiscussions on these subjects among the 5000 (!) members.

www.opensourceconnections.com/slack
Charlie

On Wed, 24 Apr 2024, 22:56 , <s...@cid.is> wrote:

Hi all,

is anybody already using AI to support searching with Solr/Lucene?
I just had an interesting demo from a german start-up.
I gave them plain text data, which I usually feed into Solr, and they
did some AI magic with these data, so that we could ask
human-language-questions and got human-language-answers.
This can take information retrieval to the next level.
But I'd prefer to have a common production line for this.
And I'm looking for software stacks which would allow standard
production procedures.

Thank you for *every* hint on this.
Walter Claassen

Re: Re: (Gen)AI-Support for searching with Solr/Lucene

Reply via email to