Hi Jack,

Surely an interesting feature for portals and the like. While there exists 
separate federation frameworks which also work with Solr, I think a more 
light-weight approach through Solr's existing API is attractive to many, 
avoiding yet another layer with new query language etc.

I think a good way to work with the Solr community on this is to create some 
JIRA issues, one mother issue and sub-issues for each component, then 
communicate frequently in those and here on the list as you get started, to get 
early feedback - as opposed to dumping a final solution in one go.

--
Jan Høydahl, search solution architect
Cominvent AS - www.facebook.com/Cominvent
Solr Training - www.solrtraining.com

On 10. mai 2012, at 18:31, Jacek Plebanek wrote:

> Hello,
> 
> I'm starting to work on federated search algorithms for my PhD study.
> I'll use Solr to implement them (Since I have two years experience with
> Solr at my work).
> 
> I thought that at least part of my work could be useful for Solr Project
> and I could contribute some code. I mean specifically the
> components/modifications to add federated search support to Solr.
> 
> By "Federated Search" I mean searching across heterogeneous data sources
> (something different than existing Distributed Search implemented in
> Solr) - to allow Solr to merge results not only from SolrServer
> instances, but also to include results from external sources (eg. search
> engines using different API). The use case would look like this:
> - user sends the request to Solr (eg. SearchRequest)
> - Solr handles the request internally and/or sends it to other Solr
> instances (current Distributed Search) AND sends it to specified
> external data sources using dedicated adapters.
> - Solr merges the results from Solr instances with results from external
> collections and returns the combined results to user.
> 
> To perform this scenario the four common federated search parts should
> be supported:
> - collection representation (external collections probably won't provide
> the same informations as Solr, like tf-idf)
> - collection selection (predict which collections may return relevant
> results and transfer the search request only to them)
> - result merging (merge results based on more limited informations than
> Solr provides)
> - external sources connection (common API to write custom collections
> adapters)
> 
> I thought I would write some federated search components - schema to
> allow developers to implement custom algorithms/plugins for each part of
> federated search scenario.
> 
> 
> What do You think about that?
> 
> 
> Sorry for my English :)
> 
> Jacek Plebanek
> 
> Interdisciplinary Centre for Mathematical and Computational Modelling
> University of Warsaw, Poland
> 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
> For additional commands, e-mail: dev-h...@lucene.apache.org
> 


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to