Proximity and Percentage match search in Lucene

2009-04-19 Thread Radha Sreedharan
What I need is the following : If my document field is ( ab,bc,cd,ef) and Search tokens are (ab,bc,cd). Given the following : I should get a hit even if all of the search tokens aren't present If the tokens are found they should be found within a distance x of each other ( proximity search) >

Fwd: Proximity and Percentage match search in Lucene

2009-04-19 Thread Radha Sreedharan
What I need is the following : If my document field is ( ab,bc,cd,ef) and Search tokens are (ab,bc,cd). Given the following : I should get a hit even if all of the search tokens aren't present If the tokens are found they should be found within a distance x of each other ( proximity search)

Proximity and Percentage match search in Lucene

2009-04-25 Thread Radha Sreedharan
What I need is the following : > If my document field is ( ab,bc,cd,ef) and Search tokens are (ab,bc,cd). > > Given the following : > I should get a hit even if all of the search tokens aren't present > If the tokens are found they should be found within a distance x of > each other ( proximi

Re: How to het the score in percentage

2009-05-05 Thread Radha Sreedharan
Even i have a similar requirement. I need the percentage match. The way I am going about it is doing 2 searches eg if my search string is "pizza cheese " and my document has " pizza cheese ketchup" percentage match = ( score of searching "pizza cheese" in " pizza cheese ketchup") / ( score of

Modifying score based on tf and slop

2009-05-06 Thread Radha Sreedharan
Hi all, All I have is a query running on a document with a single field which has some search value. This is all which will be present. No more documents / fields. I have the following specific requirements 1) Length of document should not affect score - Implemented as per lucene documentation u

Re: Modifying score based on tf and slop

2009-05-06 Thread Radha Sreedharan
ilarity and return a delegate to the actual > similarity. > > Hope that helps. > > simon > > On Wed, May 6, 2009 at 7:44 PM, Radha Sreedharan wrote: >> Hi all, >> >> All I have is a query running on a document with a single field which >> has some sear

Highlighting in Lucene ( With SrndQuery & Query Parser )

2009-06-02 Thread Radha Sreedharan
Hi all, *Requirment * I have a query of type *SrndQuery *which is parsed by *QueryParser*.On the hit of this query I want to get the region of words around the match. *Approach followed* I used the highlighting logic code as per example usage in << http://lucene.apache.org/java/2_4_1/api/org/apac

Highlight - SimpleSpanFragment

2009-06-12 Thread Radha Sreedharan
Hi all, I am using *SimpleSpanFragment *to find the words a few positions around the match. I am specifying the number of characters, which is the second parameter of the *SimpleSpanFragment *constructor . However, it always gives the words to the right of the match and does not give me the word

Re: Highlight - SimpleSpanFragment

2009-06-13 Thread Radha Sreedharan
second list and append to my matching text However I really feel that I am doing something redundant and that Lucene should be returning me the matching text !!! Pls Do help me out in this . On Fri, Jun 12, 2009 at 11:25 PM, Radha Sreedharan wrote: > Hi all, > > I am using *SimpleSpanFra

Searching for a special character

2009-06-24 Thread Radha Sreedharan
Hi all, I am using a Standard analyzer on both my search field and my query. I use a SpanNearQuery to search on the search field. One of the query terms has special characters like ( - round open bracket and ) - round close bracket : How does Lucene handle this? Also, the search field has ( and

Re: Modifying score based on tf and slop

2009-07-05 Thread Radha Sreedharan
imilarity? > > If you just wanna do that for the query you are using you could > > override Query#getSimilarity and return a delegate to the actual > > similarity. > > > > Hope that helps. > > > > simon > > > > On Wed, May 6, 2009 at 7:44 PM, Radha

Re: Modifying score based on tf and slop

2009-07-06 Thread Radha Sreedharan
Thanks a lot Mark. Do Correct me if I am wrong. but what this means is tf does not really have the same meaning as it does in case of other queries. Also I think I understand better what hossman has told - in the sense that BC is there in two matching spans , which is why we get higher score - th