Re: Accumulating facets over a MultiReader

2013-07-01 Thread Shai Erera
Hi, I assume that you use a single TaxonomyReader instance? It must be the same for both indexes, that is, both indexes must share the same taxonomy index, or otherwise their ordinals would not match as well as you may hit such exceptions since one index may have bigger ordinals than what the taxo

Accumulating facets over a MultiReader

2013-07-01 Thread Peng Gao
How do I accumulate counts over a MultiReader (2 IndexReader)? The following code causes an IOException: ArrayList facetRequests = new ArrayList(); for (String groupField : groupFields) facetRequests.add(new CountFacetRequest(new CategoryPath(groupField, '/'), 1)); Face

Re: In memory index (current status in Lucene)

2013-07-01 Thread Steven Schlansker
On Jul 1, 2013, at 2:41 PM, Lance Norskog wrote: > My current open source project is a Directory that is just like RAMDirectory, > but everything is memory-mapped. The idea is it creates a disk file, opens > it, and immediately deletes the file. The file still exists until the > IndexReader/W

Re: In memory index (current status in Lucene)

2013-07-01 Thread Lance Norskog
My current open source project is a Directory that is just like RAMDirectory, but everything is memory-mapped. The idea is it creates a disk file, opens it, and immediately deletes the file. The file still exists until the IndexReader/Writer/Searcher closes it. But, it cannot be found from the

RE: Questions about doing a full text search with numeric values

2013-07-01 Thread Todd Hunt
Erick, Sorry to be dense, but I'm not following this statement: " Pesky users. But it's a different problem, albiet one that we all struggle with. Just processing a single stream and asking "does this letter sequence occur" may not even tokenize the input, just find the character sequence." Ar

RE: How to Perform a Full Text Search on a Number with Leading Zeros or Decimals?

2013-07-01 Thread Todd Hunt
Uwe, Thank you. PatternReplaceFilter is not available in 3.0.3, which is the version of Lucene we are running now. Once I get our environment upgraded to 4.x, then I will give that a try. Thank you for your help. -Original Message- From: Uwe Schindler [mailto:u...@thetaphi.de] Sent:

Re: In memory index (current status in Lucene)

2013-07-01 Thread Emmanuel Espina
Hi Erick! Nice to hear from you again! From time to time my interest in these "Lucene things" returns and I do some experiments :p Just to add to this conversation, I found an interesting link to Mike's blog about memory resident indexes (using another virtual machine) http://blog.mikemccandless.

Re: Relevance ranking calculation based on filtered document count

2013-07-01 Thread Nigel V Thomas
Jack, To demonstrate this, I had set up a simple test, measuring the impact of search quality (Mean Average Precision) against % of an index filtered. The test was setup by taking the TREC 2004 collection, breaking that up into several smaller chunks of increasing size, then measured the MAP of th

Re: highlighting component to searchComponent

2013-07-01 Thread Jack Krupansky
Try asking your question on the “Solr user” email list – this is the Lucene user list! -- Jack Krupansky From: Adrien RUFFIE Sent: Monday, July 01, 2013 4:36 AM To: java-user@lucene.apache.org Subject: highlighting component to searchComponent Hello all I had the following configuration in my

Re: Relevance ranking calculation based on filtered document count

2013-07-01 Thread Jack Krupansky
The very definition of a "filter" in Lucene is that it doesn't influence relevance/scoring in any way, so your question is a contradiction in terms. If you are finding that the use of a filter is affecting the scores of documents, then that is clearly a bug. -- Jack Krupansky -Original M

Relevance ranking calculation based on filtered document count

2013-07-01 Thread Nigel V Thomas
Hi, I would like to know if it is possible to calculate the relevance ranks of documents based on filtered document count? The current filter implementations as far as I know, seems to be applied after the query is processed and ranked against the full set of documents. Since system wide IDF value

Re: In memory index (current status in Lucene)

2013-07-01 Thread Erick Erickson
Hey Emma! It's been a while Building on what Steven said, here's Uwe's blog on MMapDirectory and Lucene: http://blog.thetaphi.de/2012/07/use-lucenes-mmapdirectory-on-64bit.html I've always considered RAMDirectory for rather restricted use-cases. I.e. if I know without doubt that the index is

Re: Questions about doing a full text search with numeric values

2013-07-01 Thread Erick Erickson
WordDelimiterFilter(Factory if you're experimenting with Solr as Jack suggests) will fix a number of your cases since it splits on case change and numeric/alpha changes. There are a bunch of ways to recombine things so be aware that it'll take some fiddling with the parameters. As Jack suggests, us

highlighting component to searchComponent

2013-07-01 Thread Adrien RUFFIE
Hello all I had the following configuration in my solrconfig.xml : But when I start my webapp the following message appears: WARNING: Deprecated syntax found. should move to So I have tried to convert my highlighting to searchComponent with following configuration: