RE: Newbie questions re: scoring

2006-05-04 Thread Chris Hostetter
: That link appears to be referring to normalized scores (everything is < : 1.0). Is it also not safe to use a threshold for raw scores? Nope. The basic flaw in comparing scores between two queries still holds ... early messages in the threads linked to go into more detail, but as i recall, the

RE: Newbie questions re: scoring

2006-05-04 Thread John Hamilton
> 2) independent of the scores being different, it is not safe to try and > pick a score threshold, this is mentioned in the FAQ... > > http://wiki.apache.org/jakarta-lucene/LuceneFAQ#head-912c1f237bb00259185353182948e5935f0c2f03 That link appears to be referring to normalized scores (everythin

Re: Newbie questions re: scoring

2006-05-04 Thread Chris Hostetter
: 1) I create an index with one document with a searchable field of "All : dogs are brown." If I search on that index with a query of "All dogs : are brown." I do not get a hit with score 1.0, but something low like : 0.38. I tried looking at the scoring algorithm and can't make heads or : tail

Re: using search instead of explain

2006-05-04 Thread Martin Kobele
awesome, thank you very much! Regards, Martin On Thursday 04 May 2006 17:15, Chris Hostetter wrote: > : so the score is always correct, just the Explain can return values > : greater than 0 although the query did not match the document due to a > : bug in Explain? Are all version affected of that

Re: Problem with lucene 1.9.1

2006-05-04 Thread Daniel Naber
On Mittwoch 03 Mai 2006 21:59, Rodrigo Marcuschi wrote: > I am having problems using lucene 1.9.1. I was using 1.4.3 successfully, Mhh, IndexModifier isn't part of Lucene 1.4, did you add it manually? > documents I add, all I get is '1 docs in index'. If I DON'T close the > modifier, It must be

Re: using search instead of explain

2006-05-04 Thread Chris Hostetter
: so the score is always correct, just the Explain can return values : greater than 0 although the query did not match the document due to a : bug in Explain? Are all version affected of that? I use lucene 1.9.1. 1) the score returned by TopDocs or passed to a HitCollector is by definition the co

Re: Obtain terms for only particular field(s)

2006-05-04 Thread Chris Hostetter
: As I understand - IndexReader.terms() returns a termEnum of all the : terms in the index, and I would have to iterate through all of them to : pick out the ones from the fields that I want. Is there a better way : to do this? the TermEnum is garunteed to be in order. you can seek to a particul

Re: Wildcard with Words Ending in 'e'

2006-05-04 Thread Chris Hostetter
: now very useful when combined. I'm going to investigate other means of : taking into account the wildcard queries and handling them in a : different way. Rest assured that if I find a solution I'll post it to : the user group. the most straight forward solution is to have two indexed files -- o

Re: ArrayIndexOutOfBoundsException w/ ImageSearcher

2006-05-04 Thread Daniel Naber
On Mittwoch 03 Mai 2006 01:46, Michael Dodson wrote: > Exception in thread "main" java.lang.ArrayIndexOutOfBoundsException: -1 This is what you get when you try to open a Lucene 1.9 index with Lucene 1.4. Regards Daniel -- http://www.danielnaber.de --

AW: internal Searching behavior or how to get a hit?

2006-05-04 Thread Mathias Keilbach
Thanks a lot for your quick support! I found another site which describes the index structure and files. http://lucene.apache.org/java/docs/fileformats.html -- AFAIK Lucene will have an inverted index which maps tokens (term

Index Stored on Database instead of FileSystem

2006-05-04 Thread Nikhil Goel
Hi, By default, Lucene create a directory for index on FileSystem. Is there a plugin or support provided by Lucene to store the index in Databse instead of FileSystem, where:- 1) Directory can be equal to table in DB . 2) Each file in the index directory represents one row in that table. 3) Data

Re: using search instead of explain

2006-05-04 Thread Martin Kobele
so the score is always correct, just the Explain can return values greater than 0 although the query did not match the document due to a bug in Explain? Are all version affected of that? I use lucene 1.9.1. Thanks! Regards, Martin On Thursday 04 May 2006 09:27, Yonik Seeley wrote: > On 5/4/06,

Re: Updating index if there is a database changes

2006-05-04 Thread Yonik Seeley
On 5/3/06, Kiran Joisher <[EMAIL PROTECTED]> wrote: I m working on a project where I will use lucene to make a search engine on a database. I am new to lucene. I wrote a test program which indexes a table and searches the same.. but now I m stuck on how to update the index in case a database chan

Re: How to realize Sql LIKE?

2006-05-04 Thread Otis Gospodnetic
There is also the following, that some guy named Erik contributed recently: ./contrib/regex/src/java/org/apache/regexp/RegexpTunnel.java ./contrib/regex/src/java/org/apache/lucene/search/regex/SpanRegexQuery.java ./contrib/regex/src/java/org/apache/lucene/search/regex/RegexQuery.java ./contrib/reg

Re: using search instead of explain

2006-05-04 Thread Yonik Seeley
On 5/4/06, Martin Kobele <[EMAIL PROTECTED]> wrote: thank you very much. Let's hope the developers can get this bug fixed soon :) While it would be nice to have a 100% accurate explain(), it should only be used for debugging purposes (and it's normally going to be *much* slower than using a Hit

Newbie questions re: scoring

2006-05-04 Thread Lee, Andrew J \(CA - Toronto\)
Hi, I am new to Lucene and this mailing list, so my apologies if these questions have already been answered. 1) I create an index with one document with a searchable field of "All dogs are brown." If I search on that index with a query of "All dogs are brown." I do not get a hit with score 1.0,

Re: using search instead of explain

2006-05-04 Thread Martin Kobele
Hi Chris, thank you very much. Let's hope the developers can get this bug fixed soon :) Regards, Martin On Wednesday 03 May 2006 17:17, Chris Hostetter wrote: > : Now we want to use Search.search(Query, HitCollection). > : HitCollection adds the document to a BitSet as soon as the score is > : n

RE: Obtain terms for only particular field(s)

2006-05-04 Thread Ramana Jelda
FieldCache.getStringIndex(reader,fieldName) Good luck Jelda > -Original Message- > From: Chun Wei Ho [mailto:[EMAIL PROTECTED] > Sent: Thursday, May 04, 2006 12:36 PM > To: java-user@lucene.apache.org > Subject: Obtain terms for only particular field(s) > > Hi, > > I have a pretty lar

Obtain terms for only particular field(s)

2006-05-04 Thread Chun Wei Ho
Hi, I have a pretty large index and I would like to obtain all the Terms for only one or two particular fields. As I understand - IndexReader.terms() returns a termEnum of all the terms in the index, and I would have to iterate through all of them to pick out the ones from the fields that I want

Controlling the number of segments

2006-05-04 Thread johnhara
Hi. When I add new documents to my index, I sometimes end up with a larger number of segments than i theoretically should be able to. I use all the default settings. If I have understood this right I should never experience more than nine segments in my index when the mergeFactor is 10. I would

Re: How to realize Sql LIKE?

2006-05-04 Thread Erik Hatcher
On May 3, 2006, at 9:39 PM, JustineHMS wrote: I m new to Lucene, Just wondering if there is a way to realize sql 'Like' function in Lucene as following: Select * from user_table where username like '%abc%' I have checked with wildCardQuery, but it can not start with a '%'. What about Regex

Re: Wildcard with Words Ending in 'e'

2006-05-04 Thread Gordon Rogers
Chris Your shot in the dark was correct, I am using Porter Stemming. Thanks to your excellent response I understand now why wildcards and stemming are now very useful when combined. I'm going to investigate other means of taking into account the wildcard queries and handling them in a different