alessandrobenedetti commented on PR #2809:
URL: https://github.com/apache/solr/pull/2809#issuecomment-2461910357

   As I was progressing on the route of the FileStore, a thought came to my 
mind:
   With the current implementation the embedding model store guarantees a 
singleton life for each model uploaded, and what's  available is the object: 
org.apache.solr.llm.embedding.SolrEmbeddingModel.
   an object able to encode text to vector calling behind the scenes a REST 
API. 
   It's currently a lightweight object, but nothing prevents a future 
contributor to contribute the in-process model for example 
(https://docs.langchain4j.dev/integrations/embedding-models/in-process).
   
   The embeddingModelStore currently handles the instantiation part, so when 
you access a model from a query parser or an update request processor (next on 
my to-do list), you get the object, ready to be used from the store, with no 
need to instantiate the object again (that could be expensive).
   
   From what I'm seeing If I understood correctly, the FileStore will only 
store the configuration file for the model, so can easily access it but if we 
want the singleton mechanism for the model object we need to implement it 
somewhere, if not every time we use the query parser we need to instantiate the 
model, from the Json stored in the FileStore.
   
   So... I'm not convinced anymore that I should spend effort in that direction 
for this specific use case, using the fileStore and deleting the embedding 
model store was seducing, but if I need to implement an additional mechanism to 
handle models to be singleton on top of it, I don't see much benefit especially 
from time perspective.
   
   Please correct me if I missed something or if you believe the current 
implementation won't work in certain scenarios.
   I'm super happy to spend the effort to make a cleaner contribution, but if 
it's a lot of effort only for a "nice to have", I don't think I have that 
luxury right now.
   
   Don't take it as provocative, it's just a genuine perspective of someone 
with limited time to dedicate to the project, if I was paid full-time on this I 
wouldn't have any problem in pursuing nice to haves.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@solr.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@solr.apache.org
For additional commands, e-mail: issues-h...@solr.apache.org

Reply via email to