cloning a NumericTermAttributeImpl

2015-01-08 Thread Shaun Senecal
It appears that NumericTermAttributeImpl doesn't support cloning. Is this simply an oversight, or is there a reason why this isnt possible? I'm trying to clone a State containing a NumericTermAttribute, and I end up with all of the BytesRefs being equal to the last instance, since the Numeric

Re: Lucene search/count performance abrupt degradation (MMapDirectory)

2015-01-08 Thread Erick Erickson
Thanks for closing this off. On Thu, Jan 8, 2015 at 7:21 AM, Piotr Idzikowski wrote: > We have detected the problem: the excessive(!) amount of memory allocated > to Java heap. This articles helped us find the issue: > > http://blog.thetaphi.de/2012/07/use-lucenes-mmapdirectory-on-64bit.html > >

Re: Lucene search/count performance abrupt degradation (MMapDirectory)

2015-01-08 Thread Piotr Idzikowski
We have detected the problem: the excessive(!) amount of memory allocated to Java heap. This articles helped us find the issue: http://blog.thetaphi.de/2012/07/use-lucenes-mmapdirectory-on-64bit.html Although the article suggests the minimum-possible memory approach as a mere recommendation, too

Re: Faceted Search Hierarchy

2015-01-08 Thread Jigar Shah
Yes, Agree, Updates in hierarchy is very common use-case. And seeing path components in different hierarchy is not accepted by customers. In some cases relating based on path components makes sense. Just out of interest, can it be ever done with Lucene, by some customization, or some attempts ma

Re: Faceted Search Hierarchy

2015-01-08 Thread Shai Erera
Not automatically. There's no reason to assume that 'India' is the same in 'India/Gujarat' and 'Asia/India'. Furthermore, if you first add a document with India/Gujarat and later add a document Asia/India, we cannot go back to the other document and update the hierarchy. On Thu, Jan 8, 2015 at 3:2

Re: Faceted Search Hierarchy

2015-01-08 Thread Jigar Shah
Is there some way to achieve this at Lucene level. so i can get facet like below ? Doc1: Asia + Asia/India Doc2: India + Asia/India/Gujarat Which can result into this: Asia/India/Gujarat (2) Can Lucene internally index like above, as 'India' value already exist as path of some other document ?

Filtering MoreLikeThis results

2015-01-08 Thread chrisbamford
Hi, I was wondering if Lucene supports applying a filter to an MLT search? I believe that Solr can do it, but I'm not sure if Lucene can .. A possible use case is "find me the 10 most similar documents to X created in the last month". Thanks - Chris --

Re: lucene scalability query

2015-01-08 Thread Toke Eskildsen
On Thu, 2015-01-08 at 12:03 +0100, sreedevi s wrote: >I am doing a scalability analysis for lucene search in my application.I > was running my junits with different sets of data which are like > 1K,10K,100K and 1000K. [...] Your table copy-paste did not work. I tried extracting the key data:

Re: Faceted Search Hierarchy

2015-01-08 Thread Shai Erera
Lucene does not understand the word "India", therefore the facets that are actually indexed are: Doc1: Asia + Asia/India Doc2: India + India/Gujarat When you ask for top children, you will get Asia + India, both with a count of 1. Shai On Thu, Jan 8, 2015 at 1:48 PM, Jigar Shah wrote: > Very

Faceted Search Hierarchy

2015-01-08 Thread Jigar Shah
Very simple question, on facet Index has 2 documents as follows: Doc1 Indexed facet path: Asia/India Doc2 Indexed facet path: India/Gujarat Now while faceted search facets.getTopChildren() Will it return 1(Asia) result or 2(Asia, India) ? So basically will it join values and return hierarchy

lucene scalability query

2015-01-08 Thread sreedevi s
Hi, I am doing a scalability analysis for lucene search in my application.I was running my junits with different sets of data which are like 1K,10K,100K and 1000K. I kept my indexes in memory and then tried accessing the same test multiple times.ie, I built my index when my test ran first time a