Setting Similarity classes in Benchmark .alg scripts

2013-02-04 Thread Michael O'Leary
I'd like to compare the relevance scores that are returned when using the Similarity classes that are available in Lucene 4.x, and it seems like using the Benchmark component would be a good way to do that. It looks like the isn't currently a way to specify a Similarity class to use in the config s

Re: Lucene vs Glimpse

2013-02-04 Thread Jack Krupansky
Generally, all of your example queries should work fine with Lucene, provided that you carefully choose your analyzer, or even use the StandardAnalyzer. The special characters like underscore and dot generally get treated as spaces and the resulting sequence of terms would match as a phrase. It

Lucene vs Glimpse

2013-02-04 Thread Mathias Dahl
Hi, I have hacked together a small web front end to the Glimpse text indexing engine (see http://webglimpse.net/ for information). I am very happy with how Glimpse indexes and searches data. If I understand it correctly it uses a combination of an index and searching directly in the files themselv

Re: Jar packaging issue

2013-02-04 Thread Nick Burch
On Mon, 4 Feb 2013, karl.wri...@nokia.com wrote: We recently ran into something people might not be fully aware of. Specifically, because codec jars require META-INF/services files in order to be discovered, and each codec has the same files, it's not a straightforward operation to glom all the

Jar packaging issue

2013-02-04 Thread karl.wright
Hello anyone, We recently ran into something people might not be fully aware of. Specifically, because codec jars require META-INF/services files in order to be discovered, and each codec has the same files, it's not a straightforward operation to glom all the Lucene jars of interest into one