Re: German*Filter, Analyzer "cutting" off letters from (french) words...

2011-04-14 Thread Simon Willnauer
On Fri, Apr 15, 2011 at 8:48 AM, Clemens Wyss wrote: > Does the StandardAnalyzer lowercase its terms? yes! simon > >> -Ursprüngliche Nachricht- >> Von: Clemens Wyss [mailto:clemens...@mysign.ch] >> Gesendet: Mittwoch, 13. April 2011 13:34 >> An: java-user@lucene.apache.org >> Betreff: AW:

AW: German*Filter, Analyzer "cutting" off letters from (french) words...

2011-04-14 Thread Clemens Wyss
Does the StandardAnalyzer lowercase its terms? > -Ursprüngliche Nachricht- > Von: Clemens Wyss [mailto:clemens...@mysign.ch] > Gesendet: Mittwoch, 13. April 2011 13:34 > An: java-user@lucene.apache.org > Betreff: AW: German*Filter, Analyzer "cutting" off letters from (french) > words... >

best practice for reusing documents with multi-valued fields

2011-04-14 Thread Christopher Condit
I know that it's best practice to reuse the Document object when indexing, but I'm curious how multi-valued fields affect this. I tried this before indexing each document: doc.removeFields(myMultiValuedField); for (String fieldName: fieldNames) { Field field= doc.getField(field); if (null != f

Re: Applying a sample data set to lucene

2011-04-14 Thread Erick Erickson
I would not go there first. There are examples out there to, for instance, index Wikipedia but that is, IMO, too complex for just starting to get your feet wet. I think you'd be better off looking at the Lucene demo code and trying to understand/modify that as a starting point, see: http://lucene.

Applying a sample data set to lucene

2011-04-14 Thread hari narayanan
Hi, I am new to lucene... and i would like to see its working by applying it to a sample data set Can some one tell me from where i can download a data set so that i can test them using lucene? Thanks, Hari

Re: speed of CheckIndex

2011-04-14 Thread jm
mostly status of the indexes, whether there is some corruption or all is ok. On Thu, Apr 14, 2011 at 9:20 PM, Simon Willnauer < simon.willna...@googlemail.com> wrote: > what kind of diagnostics are you looking for? > > simon > > On Thu, Apr 14, 2011 at 9:14 PM, jm wrote: > > Thanks Erick, but I

Re: speed of CheckIndex

2011-04-14 Thread Simon Willnauer
what kind of diagnostics are you looking for? simon On Thu, Apr 14, 2011 at 9:14 PM, jm wrote: > Thanks Erick, but I guess what you refer to lives in Solr right? I am using > plain Lucene. > > On Thu, Apr 14, 2011 at 7:33 PM, Erick Erickson > wrote: > >> What information do you need? Could you

Re: speed of CheckIndex

2011-04-14 Thread jm
Thanks Erick, but I guess what you refer to lives in Solr right? I am using plain Lucene. On Thu, Apr 14, 2011 at 7:33 PM, Erick Erickson wrote: > What information do you need? Could you just ping the stats component > and parse the results (basically the info on the admin/stats page). > > Best >

Re: speed of CheckIndex

2011-04-14 Thread Erick Erickson
What information do you need? Could you just ping the stats component and parse the results (basically the info on the admin/stats page). Best Erick On Thu, Apr 14, 2011 at 11:56 AM, jm wrote: > Hi, > > I need to collect some diagnostic info from customer sites, so I would like > to get info on

speed of CheckIndex

2011-04-14 Thread jm
Hi, I need to collect some diagnostic info from customer sites, so I would like to get info on the status of lucene indexes...but I don't want the process of collecting to take very long. So I am considering Checkindex. I tested in a small index (60k docs) and it took 12 seconds. A site usually h

switching between Query parsers

2011-04-14 Thread shrinath.m
Consider this case : Lucene index contains documents with these fields : title author publisher I have coded my app to use MultiFieldQueryParser so that it queries all fields. Now if user types something like "author:tom" in search box, how do I make it go with only that field ? -- View thi

NullPointerException in FieldSortedHitQueue

2011-04-14 Thread Antony Bowesman
Upgrading from 2.3.2 to 2.9.4 I get NPE as below Caused by: java.lang.NullPointerException at org.apache.lucene.search.FieldSortedHitQueue$1.createValue(FieldSortedHitQueue.java:224) at org.apache.lucene.search.FieldCacheImpl$Cache.get(FieldCacheImpl.java:224) at org.apache.lucene.s

Index time boost question

2011-04-14 Thread Antony Bowesman
I have a test case written for 2.3.2 that tested an index time boost on a field of 0.0F and then did a search using Hits and got 0 results. I'm now in the process of upgrading to 2.9.4 and am removing all use of Hits in my test cases and using a Collector instead. Now the test case fails as it