Re: dash-words

2006-07-23 Thread Yonik Seeley
On 7/23/06, karl wettin <[EMAIL PROTECTED]> wrote: I'm want to filter words with a dash in them. ["x-men"] ["xmen"] ["x", "men"] All of above should be synonyms. The problem is ["x", "men"] requiring a distance between the terms and thus also matching "x-men men". WordDelimiterFilter from Sol

Re: Span Query NLE

2006-07-23 Thread karl wettin
On Sun, 2006-07-23 at 21:31 -0500, Charlie wrote: > Would anyone give me a hint regarding the natural language expression > of the following span query? > spanNear([spanOr([spanNear([field:six, field:hundred], 0, true), > spanNear([field:seven, field:hundred], 0, true)]), > spanOr([field:seven, fi

RE: Special characher & ; : % index/search question

2006-07-23 Thread karl wettin
On Sun, 2006-07-23 at 21:24 -0500, Herbert Wu wrote: > WhitespaceAnalyzer looks brutal. Is it possible that I keep > StandardAnalyzer and at the same time to tell the parser to keep a > list of chars during indexing? Add something like: | < #MYCHARACTERS: ("&" | ":" | "%" | ";") > to t

Span Query NLE

2006-07-23 Thread Charlie
Would anyone give me a hint regarding the natural language expression of the following span query? if creating queries programmatically (it is in Lucene scr) SpanTermQuery t1 = new SpanTermQuery(new Term("field","six")); SpanTermQuery t2 = new SpanTermQuery(new Term("field","h

RE: Special characher & ; : % index/search question

2006-07-23 Thread Herbert Wu
WhitespaceAnalyzer looks brutal. Is it possible that I keep StandardAnalyzer and at the same time to tell the parser to keep a list of chars during indexing? -Herbert -Original Message- From: Erick Erickson [mailto:[EMAIL PROTECTED] Sent: Sunday, July 23, 2006 10:56 AM To: java-user@luc

Re: How reliable is lucene indexing !!

2006-07-23 Thread karl wettin
On Sun, 2006-07-23 at 14:44 -0700, vasu shah wrote: > I have few doubts > The index size will approximately increase by 4000 records per > day. Is lucene good for the application? Sure. > Is it suitable for frequent inserts/updates? Sure, but I don't consider 4000 new documents per day to be

dash-words

2006-07-23 Thread karl wettin
I'm want to filter words with a dash in them. ["x-men"] ["xmen"] ["x", "men"] All of above should be synonyms. The problem is ["x", "men"] requiring a distance between the terms and thus also matching "x-men men". Or? How about storing ["x", "men"] as the first term and the use set a negative pos

How reliable is lucene indexing !!

2006-07-23 Thread vasu shah
Hello Everyone, We have an application and the current search is taking lot of time to return the results. We are doing a search against 8-9 database tables and 1.5 million records. I want to increase the search speed and thinking of implementing lucene search. I went through the documentation

How to use a different score model?

2006-07-23 Thread Marius Heinzmann
Hi everybody, im currently trying to figure out how to implement another score model in the lucene framework. But after reading a lot of source code and a little debugging it seems like i have to write a lot of score-classes -- for each query type one. And finally i need to get lucene to use my

Re: Special characher & ; : % index/search question

2006-07-23 Thread Erick Erickson
the WhitespaceAnalyzer breaks up streams on whitespace, and will give you these characters as tokens. Be careful to use it for indexing AND searching. Also, make sure that's the analyzer in Luke if you submit queries that way (it's a drop-down on the search page, upper right as I remember). On 7/

Re: data gathering

2006-07-23 Thread Erick Erickson
Both of these topics have been discussed at length in this forum. I think you would be well served by searching the archive for both of these topics, and you'd get your answer faster . NFS mounts have problems that have been discussed recently. There is nothing built into Lucene that indexes