Learning to Rank algorithms in Lucene

2015-08-18 Thread Ajinkya Kale
Are there any existing packages/examples or prior experience on using Learning to Rank (or Machine Learned Ranking) algorithms as custom Scorer/Ranker for lucene or solr ? How do people deploy Learning to Rank models with Lucene backends ? --ajinkya

Question related to reranking and RankQuery

2015-09-17 Thread Ajinkya Kale
Hi all, I am new to Solr. I have a QParser plugin which uses an implementation of CustomScoreQuery to provide custom score for each document. Is there a way I can use the same plugin to provide score for top N documents after an initial query/sort ? I looked at the ReRankQParserPlugin but it looks

Re: Question related to reranking and RankQuery

2015-09-18 Thread Ajinkya Kale
w you > to directly control the ReRankScore for the top N documents. > > Writing your own RankQuery should be the last resort if you can't make a > Function Query do what you need to do. > > > > Joel Bernstein > http://joelsolr.blogspot.com/ > > On Thu, Sep

Re: Question related to reranking and RankQuery

2015-09-18 Thread Ajinkya Kale
ankQuery=$rqq > reRankDocs=100}&rqq={!func}myFunc() > > I'm not sure if there is a test case demonstrating this but it should work. > > Joel Bernstein > http://joelsolr.blogspot.com/ > > On Fri, Sep 18, 2015 at 2:42 PM, Ajinkya Kale > wrote: > > > Is the

Re: Question related to reranking and RankQuery

2015-09-18 Thread Ajinkya Kale
ankDocs=100}&rqq={!func}myFunc() > > I'm not sure if there is a test case demonstrating this but it should work. > > Joel Bernstein > http://joelsolr.blogspot.com/ > > On Fri, Sep 18, 2015 at 2:42 PM, Ajinkya Kale > wrote: > > > Is there a way I can issue

Solr java.lang.OutOfMemoryError: Java heap space

2015-09-28 Thread Ajinkya Kale
Hi, I am trying to retrieve all the documents from a solr index in a batched manner. I have 100M documents. I am retrieving them using the method proposed here https://nowontap.wordpress.com/2014/04/04/solr-exporting-an-index-to-an-external-file/ I am dumping 10M document splits in each file. I ge