Thanks to Koji & Alessandro & Matthias & Charlie for your quick reactions to my relatively broad question.

I have listed your answers below so that you and I have a better overview.

I agree to Koji that we are discussing RAG.
Thanks to Matthias and Charlie for opening ways of learning and discussion. And special thanks to Alessandro for clearly describing real-world problems with implementing RAG.

Obviously (as we say in German) very thick boards have to be drilled here. Currently I'm working on the topic (as a freelancer) out of my own interest and discussing it with potential customers. Lets see how it goes on.

Walter

-------------------------------------------------------------------------
By chance I found this paper, which perhaps might be interesting for you:

https://arxiv.org/abs/2308.14963
Vector Search with OpenAI Embeddings: Lucene Is All You Need
Jimmy Lin, Ronak Pradeep, Tommaso Teofili, Jasper Xian
We provide a reproducible, end-to-end demonstration of vector search with OpenAI embeddings using Lucene on the popular MS MARCO passage ranking test collection. The main goal of our work is to challenge the prevailing narrative that a dedicated vector store is necessary to take advantage of recent advances in deep neural networks as applied to search. Quite the contrary, we show that hierarchical navigable small-world network (HNSW) indexes in Lucene are adequate to provide vector search capabilities in a standard bi-encoder architecture. This suggests that, from a simple cost-benefit analysis, there does not appear to be a compelling reason to introduce a dedicated vector store into a modern "AI stack" for search, since such applications have already received substantial investments in existing, widely deployed infrastructure.

-------------------------------------------------------------------------------
Koji Sekiguchi 25.04.2024 03:45

Hi Walter,
Isn't it an application commonly known as RAG
https://en.wikipedia.org/wiki/Prompt_engineering#Retrieval-augmented_generation
?
--
Koji
-------------------------------------------------------------------------------
Matthias Krüger 25.04.2024 09:35

Hallo Walter,
es gibt viele dieser Startups zur Zeit, wenn Du konkrete Fragen oder Ideen zu Retrieval-Augmented-Generation-Ansätzen und Architekturen mit Solr hast, können wir gern mal sprechen.
Viele Grüße
Matthias Krüger
OpenSource Connections
-------------------------------------------------------------------------------
Alessandro Benedetti 25.04.2024 14:55

Hi Walter,
We've been doing many AI integrations with Solr and we drafted a roadmap to get some funding to implement it directly in Solr:
https://sease.io/2023/10/apache-lucene-solr-ai-roadmap-do-you-want-to-make-it-happen.html
We made little progress so far but hopefully will attract some more attention. In terms of what you saw, as Koji correctly mentioned, it's very likely it was Retrieval Augmented Generation: Doing it is extra easy for a quick prototype, to just showcase some cherry picked magic.
But bringing it to production offers many challenges:
- the retrieval phase is still an open problem, you can do lexical (traditional keyword search), you can do vector or hybrid, but it's far from being easy.
You may need to chunk if you go with most embedding models.
- the LLM choice is quite challenging as well, you can go quickly with the latest GPT-X but more likely you need some days to assess the best commercial/open solution for your use case and domain - the way you prompt the LLM is also challenging, depending if you just want a generated answer, citations etc

We presented already some talks and tutorials around, we should have something recorded in our training section.
We'll also speak soon on these topics at upcoming conferences:
https://eu.communityovercode.org/schedule/
https://2024.berlinbuzzwords.de/sessions?id=7VSFFK
https://2024.berlinbuzzwords.de/sessions?id=NCPYUH

Hope it helps!
-------------------------------------------------------------------------------
Charlie Hull 25.04.2024 18:00

We've just had the Haystack conference here in Charlottesville with many talks on RAG and AI on Lucene based engines (can't remember a particular one on Solr but a lot is transferrable). Check out www.haystackconf.com - the videos & slides of all the talks will be published over the next few weeks. You should also join Relevance Slack where there's a lot of discussions on these subjects among the 5000 (!) members.
www.opensourceconnections.com/slack
Charlie

On Wed, 24 Apr 2024, 22:56 , <s...@cid.is> wrote:

Hi all,

is anybody already using AI to support searching with Solr/Lucene?
I just had an interesting demo from a german start-up.
I gave them plain text data, which I usually feed into Solr, and they
did some AI magic with these data, so that we could ask
human-language-questions and got human-language-answers.
This can take information retrieval to the next level.
But I'd prefer to have a common production line for this.
And I'm looking for software stacks which would allow standard
production procedures.

Thank you for *every* hint on this.
Walter Claassen

Reply via email to