Re: JVM Crash in Lucene

2006-02-07 Thread Daniel Pfeifer
Got the same problem. Running 1.5.0_05 on Solaris 10. I've seen that this issue has been reported on Sun's forum but no answer yet. Another interesting thing which I noticed. We previously used the RAMDirectory and we never got JVM-crashes when using RAMDirectory. However, once we started using FS

Re: Reindexing

2006-02-07 Thread Raul Raja Martinez
Hi Eric, I'm in the same situation, I wouldn't normally ask something related to hibernate here but I posted something similar in the hibernate forums on Jan 16th but still haven't got any response. http://forum.hibernate.org/viewtopic.php?t=954137&highlight=lucene It is really obvious that if

Re: How can you simulate inOrder in boolean queries

2006-02-07 Thread Erik Hatcher
The source code to Lucene in Action (www.lucenebook.com) has an example of extending the built-in QueryParser to use SpanNearQuery for in-order phrase searching instead of the default unordered PhraseQuery behavior. That example will show how to use SpanNearQuery made up of SpanTermQuery's

Re: Setting Performance Thresholds

2006-02-07 Thread Erik Hatcher
This should be possible using a HitCollector with the IndexSearcher.search() method, and throwing an exception from the HitCollector when the time limit as expired. Erik On Feb 7, 2006, at 5:55 PM, RONALD MANTAY wrote: Hi , I am working on a search application using lucene

Re: How to get mapping of query terms to number of their occurrences in a doc?

2006-02-07 Thread Chris Hostetter
A cursory reading of your code looks ok ... stemming shouldn't be an issue as long as your measure of success is comparing docs that match your orriginal query with the counts you get out. What i mean by that is that any stemming should have already been taken care of when your query object was c

Re: Reducing Inflated Similarity Scores

2006-02-07 Thread Chris Hostetter
: Ok, I'm quite new to lucene so I don't really know how the Default : Similarity works but from what I gather it is a variation of the : cos-similarity. And the cos-measure penalizes extraneous terms : therefore, how can the score be 1.0? If you are using hte Hits API then the score you are seein

Re: Queries not derived from the text index

2006-02-07 Thread Daniel Noll
Erik Hatcher wrote: On Feb 7, 2006, at 1:09 AM, Daniel Noll wrote: I've got an unusual (if not crazy) question about implementing custom queries. Basically we have a UI where a user can enter a query and then select a bunch of filters to be applied to the query. These filters are currently

Setting Performance Thresholds

2006-02-07 Thread RONALD MANTAY
Hi , I am working on a search application using lucene where I would like application to stop searching and return the hits colleted to date and an appropriate exception, when reaching either a threshold of a) xx seconds of processing time. or b) on finding xxx documents

Setting Performance Thresholds

2006-02-07 Thread RONALD MANTAY
Hi , I am working on a search application using lucene where I would like application to stop searching and return the hits colleted to date and an appropriate exception, when reaching either a threshold of a) xx seconds of processing time. or b) on finding xxx documents

Re: How can you simulate inOrder in boolean queries

2006-02-07 Thread Paul Elschot
On Tuesday 07 February 2006 22:06, RONALD MANTAY wrote: > Hi, can any one help, I am new to lucene. > > Due to requiring a specific Query syntax, I have created a bespoke query parser to build a boolean query which at any time may comprising a mixture of : > > a) Fuzzy terms with

How can you simulate inOrder in boolean queries

2006-02-07 Thread RONALD MANTAY
Hi, can any one help, I am new to lucene. Due to requiring a specific Query syntax, I have created a bespoke query parser to build a boolean query which at any time may comprising a mixture of : a) Fuzzy terms with similarity level set according to length of term.text b) Wild C

RE: Does anybody here do some efforts about RSS/Blog search?

2006-02-07 Thread Byron Miller
So is blogdigger and a few others :) --- "Mordo, Aviran (EXP N-NANNATEK)" <[EMAIL PROTECTED]> wrote: > Technorati is based on lucene > > -Original Message- > From: 盖世豪侠 [mailto:[EMAIL PROTECTED] > Sent: Tuesday, February 07, 2006 11:40 AM > To: java-user@lucene.apache.org > Sub

RE: Does anybody here do some efforts about RSS/Blog search?

2006-02-07 Thread Mordo, Aviran (EXP N-NANNATEK)
Technorati is based on lucene -Original Message- From: 盖世豪侠 [mailto:[EMAIL PROTECTED] Sent: Tuesday, February 07, 2006 11:40 AM To: java-user@lucene.apache.org Subject: Does anybody here do some efforts about RSS/Blog search? I'm interested in this topic. See if we can exchange some ide

Statisically Improbably Phrases

2006-02-07 Thread Lawrence Ogrodnek
Hey all, I was looking around last night and found a thread from last July on this list concerning SIPs. I was wondering if any code has been made available, or if anyone else has had success (and their solutions) using Lucene for generating these. thanks, l

Does anybody here do some efforts about RSS/Blog search?

2006-02-07 Thread 盖世豪侠
I'm interested in this topic. See if we can exchange some ideas.

Re: Search on Keyword rather than Text?

2006-02-07 Thread Erik Hatcher
On Feb 7, 2006, at 5:47 AM, Simon Porritt wrote: Field.Text("index", "New York") ...which of course means that the string is tokenized before being indexed. Running this search: new yo which translates to index:new index:yo returns New York amongst the results, as well as any cities s

Re: Queries not derived from the text index

2006-02-07 Thread Erik Hatcher
On Feb 7, 2006, at 1:09 AM, Daniel Noll wrote: I've got an unusual (if not crazy) question about implementing custom queries. Basically we have a UI where a user can enter a query and then select a bunch of filters to be applied to the query. These filters are currently implemented using

Re: Reindexing

2006-02-07 Thread Amol Bhutada
Hi Revati, If I am not wrong you must be using hibernate intercepter to get the evens on value objects. You will get all lifecycle event in the class so you should handle lucene index addition/updation/deletion there. But I had faced locking problem with hibernate as multiple writers get opened

Re: Queries not derived from the text index

2006-02-07 Thread Michael D. Curtin
Daniel Noll wrote: Is it possible to customise the QueryParser so that it returns Query instances that have no relationship to the text index whatsoever? The syntax that Lucene's QueryParser supports isn't very complicated. I'm sure you could write your own parser from scratch, perhaps with s

Re: Reindexing

2006-02-07 Thread Erik Hatcher
You may likely get better response by posting in the Hibernate list. Erik On Feb 7, 2006, at 7:58 AM, revati joshi wrote: Hello lucene members, i'm the silent member of this group.last week i had sent some query regarding reindexing,but i dn't

Reindexing

2006-02-07 Thread revati joshi
Hello lucene members, i'm the silent member of this group.last week i had sent some query regarding reindexing,but i dn't received any reply from any one.Still i'm stuck up with the same problem of reindexing. i hve completed with the reindexing code using h

RE: Search on Keyword rather than Text?

2006-02-07 Thread Colin Young
I had the same problem (almost exactly, even down to searching city names) and the way I solved it was to provide my own tokenizer that doesn't tokenize -- I'm passing in stuff to the analyzer already tokenized, so in your example I'd pass in "new york" as a single entity to analyze, and the tokeni

Search on Keyword rather than Text?

2006-02-07 Thread Simon Porritt
Hello, We have a document type for which we wish to create an index that may contain spaces, and we don't want the values to be tokenized. The data is a list of cities. We are currently creating the index using Field.Text: Field.Text("index", "New York") ...which of course means that the s

addIndexes causing IndexOutOfBoundsException

2006-02-07 Thread Chris Fraschetti
I have an index of roughly 2 million docs making up almost 200GB and I can't seem to merge any additional indexes into it. Here is the error I continuously get, always with the Index: 85, Size: 13 I couldn't find much in the previous mailing list posts nor on 'ol faithful Google. help/ideas? jav