Re: interpreting scores

2009-05-06 Thread Nate
Thanks Anshum. What happens if a search returns only one match, and that match is not very "good"? If scores are only comparable to the scores of other matches in the same search, then the score is effectively meaningless if there is only one match. It seems like a very common need to want to pro

Re: interpreting scores

2009-05-06 Thread Anshum
Hi Nate, The scores are only comparable within the same search and not over different searches as the scores are affected by query as well as docs. About the threshold, I guess you could have count cutoff to get 'x' best matches. Said so coz I'm not really able to recollect anything which could use

TermEnum with deleted dccuments

2009-05-06 Thread Antony Bowesman
I am merging Index A to Index B. First I read the terms for a particular field from index A and some of the documents in A get deleted. I then enumerate the terms on a different field also in index A, but the terms from the deleted document are still present. The termEnum.docFreq() also retu

Re: Boosting query - debuging

2009-05-06 Thread Grant Ingersoll
Hi Liat, Can you post the code you are using to generate the info below? -Grant On May 3, 2009, at 11:43 PM, liat oren wrote: I looked into the output again, and saw that the explain method, explains a different result then the document i thought it did. Within the loop of the results, I r

Re: Modifying score based on tf and slop

2009-05-06 Thread Simon Willnauer
Hey, On Thu, May 7, 2009 at 3:51 AM, Radha Sreedharan wrote: > Hi, > > I made tf return a 1.0f but the issue with that is that now the slop > factor is neglected. > > So even if the tow terms in the span near query or far off or nearby > the score returned is the same. > > I want the no of times o

Re: Modifying score based on tf and slop

2009-05-06 Thread Radha Sreedharan
Hi, I made tf return a 1.0f but the issue with that is that now the slop factor is neglected. So even if the tow terms in the span near query or far off or nearby the score returned is the same. I want the no of times of the term occurring to be neglected but not the slop. Radha On Thu, May 7

interpreting scores

2009-05-06 Thread Nate
Hi all, First, the problem I'm trying to solve: I have two folders, each containing files. I need to match files in one folder with files in the other. Eg: notes/Michael Jackson - Don't Stop 'till You Get Enough.notes songs/Michael Jackson Don't stop until you get enough.mp3 I provide the notes

Re: Modifying score based on tf and slop

2009-05-06 Thread Simon Willnauer
Hey, If I get you right you wanna make tf not affecting the score at all. if so why don't you just return 1.0f by overriding similarity? If you just wanna do that for the query you are using you could override Query#getSimilarity and return a delegate to the actual similarity. Hope that helps. si

Modifying score based on tf and slop

2009-05-06 Thread Radha Sreedharan
Hi all, All I have is a query running on a document with a single field which has some search value. This is all which will be present. No more documents / fields. I have the following specific requirements 1) Length of document should not affect score - Implemented as per lucene documentation u

RE: Lucene index on iPhone

2009-05-06 Thread Aditya
Hi, You can try Clucene @ http://sourceforge.net/projects/clucene/ based on older version of Java Lucene but should be okay. I was able to port it to symbian and Windows Mobile with some efforts. Best Regards, Aditya -Original Message- From: Grant Ingersoll [mailto:gsing...@apache.org]

Re: Exact match on entire field

2009-05-06 Thread Karl Wettin
You should probably tell us the reason to why you need this functionallity. Given you only load the stored comparative field for the first it doesn't really have to be that expensive. If you know that the first hit was not a perfect match then you know that any matching documents with a l

Re: Exact match on entire field

2009-05-06 Thread Erick Erickson
how much data are you talking about here? Could you use a KeywordAnalyzer (perhaps in a duplicated field) with appropriate filtering (to lowercase, remove punctuation, etc)? Best Erick On Wed, May 6, 2009 at 4:50 AM, Laura Hollink wrote: > Hi, > > I am trying to distinguish between a document t

Re: Lucene index on iPhone

2009-05-06 Thread Grant Ingersoll
http://www.lucidimagination.com/search/?q=Objective+C+port+of+Lucene suggests there is an Objective C port. Maybe that works? I haven't done any iPhone dev. On May 6, 2009, at 5:06 AM, Paul Libbrecht wrote: Shashi, the only java I know for iphone is with Cydia on jailbroken iphones. Is

Re: Lucene index on iPhone

2009-05-06 Thread Paul Libbrecht
Shashi, the only java I know for iphone is with Cydia on jailbroken iphones. Is this the type of things you're looking at? paul Le 06-mai-09 à 12:08, Shashi Kant a écrit : I am working on an iPhone application where the Lucene index needs to reside on-device (for multiple reasons). Has anyone

Re: Lucene index on iPhone

2009-05-06 Thread Danil ŢORIN
iPhone doesn't support java, so there is no way to run lucene on it. Creating a sqlite database and search inside it is compltetly different solution, which has nothing to do with Lucene. On Wed, May 6, 2009 at 13:08, Shashi Kant wrote: > Hi all, > > I am working on an iPhone application where t

Lucene index on iPhone

2009-05-06 Thread Shashi Kant
Hi all, I am working on an iPhone application where the Lucene index needs to reside on-device (for multiple reasons). Has anyone figured out a way to do that? As you might know the iPhone contains SQLite - could an index be embedded inside SQLite? or could it be resident separately as a file? Th

Re: Re: I got the score "0.3044460713863373 " for the cosine similarity of two document with the same text content !!

2009-05-06 Thread Kamal Najib
hi, thanks for the reply.see: http://lucene.apache.org/java/2_4_1/api/index.html you will find there the Similarity have created and run to get the similarity between the two Strings.I did the folow: I created a doc: doc.add(new Field("term","this expression of galectin-1 in blood vessel walls w

Exact match on entire field

2009-05-06 Thread Laura Hollink
Hi, I am trying to distinguish between a document that matches the query because the query *appears* in one of the fields, and a document that matches the query because the query equals the complete field. I do want to use an Analyzer for case- and punctuation normalization. For example: