implementation of lucene into opencms

2007-03-30 Thread mohamed hadj taieb
Hi i have implemented lucene with tomcat the application demo gives an interface to write the word to search and when the search is launched it returns the path of the pages which contain that word like that : Document Summary C:\Program Files\Apache Software Foundation\Tomcat 5.5\webapps\jsp-exa

Re: normalized scores

2007-03-30 Thread Donna L Gresh
I'm well aware that some queries will return no results due to my filtering by 0.3. That's the point. I expect that some of my input queries will not be a good match to *any* of the documents in my second index. I'm really doing something much like the "Books Like This" example in Chapter 5 of

Re: normalized scores

2007-03-30 Thread Erik Hatcher
On Mar 30, 2007, at 8:48 AM, Donna L Gresh wrote: It is unfortunate that some scores are being normalized and some may not be. Is there a way to obtain the unnormalized score? Any IndexSearcher.search method that does not return Hits keeps the raw scores. Try out the TopDocs returning one

setBoost on Field

2007-03-30 Thread DECAFFMEYER MATHIEU
Hi, I am parsing this file called Logistics.htm I have a field named "headlines" that contains word "clients" among others. When I don't put a boost on this field, I have as score 0.06 when searching for clients. Then when I put a boost of "10", I have a score of 0.21 Yet I was expecting a score

Retrieving the index format

2007-03-30 Thread Dan Climan
Is there a way to retrieve the tell which format an index is in? The file formats documentation http://lucene.apache.org/java/docs/fileformats.html#Segments%20File indicates that the segments file stores a Format value that can be used to determine the type. Format is -1 as of Lucene 1.4 and

Re: normalized scores

2007-03-30 Thread Donna L Gresh
Thanks Erik, that works great-- Donna >> It is unfortunate that some scores are being normalized and some >> may not >> be. Is there a >> way to obtain the unnormalized score? >Any IndexSearcher.search method that does not return Hits keeps the >raw scores. Try out the TopDocs returning ones

Re: normalized scores

2007-03-30 Thread Chris Hostetter
: I'm well aware that some queries will return no results due to my : filtering by 0.3. : That's the point. I expect that some of my input queries will not be a : good match : to *any* of the documents in my second index. what i'm trying ot make sure you understand is that picking 0.3 as an arbit

how to index a large database

2007-03-30 Thread Mohammad Norouzi
Hi all I am going to index our database. one approach is to join them and then index the fields. but the information are very large say more than 3 millions. so the Sql Server fails to select them. I want to know if anyone has such this experience to indexing huge information of database using lu

Re: how to search over another search

2007-03-30 Thread Mohammad Norouzi
Hi Erick, Why not combine the indexes? That would be the "lucene way"... I combined them by joining tables but it gets very very large and ResaltSet failed to retrieve the fields!!! On 3/26/07, Erick Erickson <[EMAIL PROTECTED]> wrote: The short form is no. Lucene is emphatically NOT a rela