Re: Lucene sorting case-sensitive by default?

2008-01-15 Thread Antony Bowesman
Erick Erickson wrote: doc.add( new Field( "f", "This is Some Mixed, case Junk($*%& With Ugly SYmbols", Field.Store.YES, Field.Index.TOKENIZED)); pr

RE: Lucene sorting case-sensitive by default?

2008-01-15 Thread Toke Eskildsen
On Mon, 2008-01-14 at 10:58 -0500, Alex Wang wrote: > Toke, you mentioned "Using a Collator works but does take a fair amount > of memory", can you please elaborate a little more on that. Thanks. We have an index with 10 million records that takes up 37GB. Practically all records have a title, whi

RE: Lucene sorting case-sensitive by default?

2008-01-14 Thread Alex Wang
No problem Erick. Thanks for clarifying it. Alex -Original Message- From: Erick Erickson [mailto:[EMAIL PROTECTED] Sent: Monday, January 14, 2008 12:35 PM To: java-user@lucene.apache.org Subject: Re: Lucene sorting case-sensitive by default? Sorry, I was confused about this for the

Re: Lucene sorting case-sensitive by default?

2008-01-14 Thread Erick Erickson
se, or take any action based on > this message or any information herein. If you have received this > message in error, please advise the sender immediately by reply e-mail > and delete this message. Thank you for your cooperation. > > > -----Original Message- > From: Erick E

RE: Lucene sorting case-sensitive by default?

2008-01-14 Thread Alex Wang
you for your cooperation. -Original Message- From: Erick Erickson [mailto:[EMAIL PROTECTED] Sent: Monday, January 14, 2008 11:24 AM To: java-user@lucene.apache.org Subject: Re: Lucene sorting case-sensitive by default? Several things: 1> do you need to display all the fields? Wo

Re: Lucene sorting case-sensitive by default?

2008-01-14 Thread Erick Erickson
ate a little more on that. Thanks. > > Alex > > -Original Message- > From: Toke Eskildsen [mailto:[EMAIL PROTECTED] > Sent: Monday, January 14, 2008 3:13 AM > To: java-user@lucene.apache.org > Subject: Re: Lucene sorting case-sensitive by default? > > On Fri, 2008

RE: Lucene sorting case-sensitive by default?

2008-01-14 Thread Alex Wang
ache.org Subject: Re: Lucene sorting case-sensitive by default? On Fri, 2008-01-11 at 11:40 -0500, Alex Wang wrote: > Looks like Lucene is separating upper case and lower case while sorting. As Tom points out, default sorting uses natural order. It's worth noting that this implies that defa

Re: Lucene sorting case-sensitive by default?

2008-01-14 Thread Toke Eskildsen
On Fri, 2008-01-11 at 11:40 -0500, Alex Wang wrote: > Looks like Lucene is separating upper case and lower case while sorting. As Tom points out, default sorting uses natural order. It's worth noting that this implies that default sorting does not produce usable results as soon as you use non-ASCI

Re: Lucene sorting case-sensitive by default?

2008-01-11 Thread Erick Erickson
I've often stored a special sort field that's lower-cased. On Jan 11, 2008 11:40 AM, Alex Wang <[EMAIL PROTECTED]> wrote: > Hi All, > > > > I was searching my index with sorting on a field called "Label" which is > not tokenized, here is what came back: > > > > Extended Sites Catalog Asset Store

Re: Lucene sorting case-sensitive by default?

2008-01-11 Thread Tom Emerson
String fields are sorted using natural (lexicographic) order. For characters in ASCII range this means uppercase letters will sort before lowercase letters (e.g., 'A' U+0041 sorts before 'a' U+0061). This behaviour is documented on in the JavaDocs for org.apache.lucene.search.Sort. -tree On

Re: lucene sorting

2006-04-05 Thread Chris Hostetter
I don't know if there is anyway for a Custom Sort to access the lucene score -- but another approach that works very well is to use the FunctionQuery classes from Solr... http://incubator.apache.org/solr/docs/api/org/apache/solr/search/function/package-summary.html ...you can make a FunctionQuer