Identifying the most relevant document

2014-04-29 Thread Vishnu
I am trying to solve the following search problem. Say we have 10 different documents d1..d10 Each document contains a type of data say, d1 -> list of movie names, d2 -> list of actor names, d3 -> list of addresses etc. Each document contains list of entities and scores. So d1 contains movie names

Extract Query terms not by reader, but by using documents

2013-11-26 Thread Vishnu Mishra
Hi all, Is this possible to extract Query Terms for particular Document. The existing method extractTerms(Set term) expand the query search over all the documents (IndexReader) which sometime takes long time to execute, even I am Interested only in one Document. This case is much critical wh

Preserve Original Option In Stemming (EnglishMinimalStemFilterFactory).

2015-08-25 Thread Vishnu Mishra
Hi, I was working with Lucene 5.2 and trying to index some document. I am using EnglishMinimalStemFilterFactory and I found that there is no option for keeping the original text as wel as analyzed term into lucene index. WordDelimiterFilterFactory provides preserveOriginal option to do this. Can

SolrIndexSearcher throws Misleading Error Message When timeAllowed is Specified.

2016-01-08 Thread Vishnu Mishra
I am using Solr 5.3.1 and we are facing OutOfMemory exception while doing some complex wildcard and proximity query (even for simple wildcard query). We are doing distributed solr search using shard across 20 cores. The problem description is given below. For example simple query like *q=Tile:(