Re: java.lang.NoClassDefFoundError: org/apache/lucene/util/CharacterUtils

2010-12-13 Thread Paul Libbrecht
Allow me to recommend a little trick to track the origin of a class which works often: org.apache.lucene.analysis.WhitespaceAnalyzer.class.getResource("WhitespaceAnalyzer.class") will give you a URL that should be the URL of the jar, followed by an exclamation mark, followed by the jar-inter

Re: java.lang.NoClassDefFoundError: org/apache/lucene/util/CharacterUtils

2010-12-13 Thread Lahiru Samarakoon
Dear Uwe, I remove additional jar files and it is working. Thanks a lot for the guidance. Regards, Lahiru

RE: java.lang.NoClassDefFoundError: org/apache/lucene/util/CharacterUtils

2010-12-13 Thread Uwe Schindler
Then it seems that you have a classpath mismatch. Can it be that you are mixing different Lucene versions (like the unreleased 3.x-Branch). StandardAnalyzer in 3.0.2 definitely does not use CharacterUtils (and also not in later versions). WhitespaceTokenizer in 3.x uses it. So you should review yo

Re: java.lang.NoClassDefFoundError: org/apache/lucene/util/CharacterUtils

2010-12-13 Thread Lahiru Samarakoon
org.apache.lucene.analysis.standard.StandardAnalyzer On Tue, Dec 14, 2010 at 12:18 PM, Uwe Schindler wrote: > This class is Lucene 3.1+ only, currently not released. What package wants > to link with it? > > - > Uwe Schindler > H.-H.-Meier-Allee 63, D-28213 Bremen > http://www.thetaphi.de >

Re: Forcing specific index file names

2010-12-13 Thread Chris Hostetter
: It is possible to always have Lucene end up with the : same set of index filenames for each index generation : process? this smells like an XY problem why do you car what the file names are? that's an implementtaion detail of lucene -- the directory as a whole is the index -- what are yo

RE: java.lang.NoClassDefFoundError: org/apache/lucene/util/CharacterUtils

2010-12-13 Thread Uwe Schindler
This class is Lucene 3.1+ only, currently not released. What package wants to link with it? - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- > From: Lahiru Samarakoon [mailto:lahir...@gmail.com] > Sent: Tuesday, D

java.lang.NoClassDefFoundError: org/apache/lucene/util/CharacterUtils

2010-12-13 Thread Lahiru Samarakoon
Hi All, I am using *lucene-3.0.2* with *spring framework-3.0.5*. Spring context initialization failed due to a *java.lang.NoClassDefFoundError: org/apache/lucene/util/CharacterUtils*. But that class can not be found in the jars shipped with *lucene-3.0.2*. I am stumped here. Please advice. Thank

Forcing specific index file names

2010-12-13 Thread Earl Hood
It is possible to always have Lucene end up with the same set of index filenames for each index generation process? I have an application that creates an index for a set of files, and generally, the index files created are the following: _0.cfs segments_2 segments.gen However, it appears somet

Re: The logic of QueryParser

2010-12-13 Thread Robert Muir
On Mon, Dec 13, 2010 at 3:16 PM, Yonik Seeley wrote: > On Mon, Dec 13, 2010 at 3:07 PM, Robert Muir wrote: >> On Mon, Dec 13, 2010 at 3:04 PM, Yonik Seeley >> wrote: >>> I think of the Lucene QueryParser like SQL. SQL is text based and also >>> meant for human entered text - but for either very

Re: The logic of QueryParser

2010-12-13 Thread Yonik Seeley
On Mon, Dec 13, 2010 at 3:07 PM, Robert Muir wrote: > On Mon, Dec 13, 2010 at 3:04 PM, Yonik Seeley > wrote: >> I think of the Lucene QueryParser like SQL. SQL is text based and also >> meant for human entered text - but for either very expert users, or >> programmatically created queries.  You n

Re: The logic of QueryParser

2010-12-13 Thread Robert Muir
On Mon, Dec 13, 2010 at 3:04 PM, Yonik Seeley wrote: > I think of the Lucene QueryParser like SQL. SQL is text based and also > meant for human entered text - but for either very expert users, or > programmatically created queries.  You normally don't want to pass > text from a search box directly

Re: The logic of QueryParser

2010-12-13 Thread Yonik Seeley
On Mon, Dec 13, 2010 at 2:51 PM, Robert Muir wrote: > On Mon, Dec 13, 2010 at 2:43 PM, Yonik Seeley > wrote: >> On Mon, Dec 13, 2010 at 2:10 PM, Brian Hurt wrote: >>>  I was just wondering what the logic was for defaulting to or instead of >>> and. >> >> Largely historical.  I think the origina

Re: The logic of QueryParser

2010-12-13 Thread Brian Hurt
On Mon, Dec 13, 2010 at 2:35 PM, Ahmet Arslan wrote: > > I have googled the mailing list archives and didn't find > > anything. But if > > this has been discussed to death, please just point me to > > the threads in the > > archive. rather than stirring up some old flame war. > > Or just tell me

Re: The logic of QueryParser

2010-12-13 Thread Robert Muir
On Mon, Dec 13, 2010 at 2:43 PM, Yonik Seeley wrote: > On Mon, Dec 13, 2010 at 2:10 PM, Brian Hurt wrote: >>  I was just wondering what the logic was for defaulting to or instead of and. > > Largely historical.  I think the original rational was that it > probably fit better with the traditional

Re: The logic of QueryParser

2010-12-13 Thread Yonik Seeley
On Mon, Dec 13, 2010 at 2:10 PM, Brian Hurt wrote: >  I was just wondering what the logic was for defaulting to or instead of and. Largely historical. I think the original rational was that it probably fit better with the traditional vector space model. There's also not a good reason to change t

Re: The logic of QueryParser

2010-12-13 Thread Robert Muir
On Mon, Dec 13, 2010 at 2:10 PM, Brian Hurt wrote: > I just encountered an unexpected behavior in query parser.  So, if you pass > in a query that is multiple terms, like "cat hat", the query that is > returned uses an or between the two term searches, instead of an and.  That > is, the query will

Re: The logic of QueryParser

2010-12-13 Thread Ahmet Arslan
> I have googled the mailing list archives and didn't find > anything.  But if > this has been discussed to death, please just point me to > the threads in the > archive. rather than stirring up some old flame war.  > Or just tell me what > to google for (the terms I've tried haven't yielded > anyt

The logic of QueryParser

2010-12-13 Thread Brian Hurt
I just encountered an unexpected behavior in query parser. So, if you pass in a query that is multiple terms, like "cat hat", the query that is returned uses an or between the two term searches, instead of an and. That is, the query will return all documents with the given field containing either

Re: Scale up design

2010-12-13 Thread Erick Erickson
Here's a great intro to the garbage collection options: http://www.lucidimagination.com/blog/2009/09/19/java-garbage-collection-boot-camp-draft/ @Ganesh: The issue with 64 bit isn't really performance, it's that you can't allocate much of your memory to the JVM. So by definition your performance w

Re: Scale up design

2010-12-13 Thread William Newport
I've used 30-35gb heaps and it is painful. Sent from my iPhone On Dec 13, 2010, at 2:16 AM, "Danil ŢORIN" wrote: > GC times on large heaps are pretty painfull right now (haven't tried > G1 collector, knowledgeable people : please advise) > > Also it's very dependent on your index and query patt

Re: Scale up design

2010-12-13 Thread Danil ŢORIN
GC times on large heaps are pretty painfull right now (haven't tried G1 collector, knowledgeable people : please advise) Also it's very dependent on your index and query pattern, so you could improve it by using some -XX magic. My recommendation is to scale horizontally (spit index into shards),