Re: get wordno, lineno, pageno for term/phrase

2010-08-07 Thread arun r
I tried putting each page as a document, if the phrase is spread across two pages, then the span search does not capture it. Is there a work around for this ? On Sat, Aug 7, 2010 at 8:00 PM, Babak Farhang wrote: > How about making each line a separate document? You'd worry about > scaling it late

Query Match Count

2010-08-07 Thread Ryan McV
Hello All, Is there a way to count the number of times a query matched in a particular document? For example, say we created a document that had the string "cheese cheese cheese cheese" in the field that is being searched. I would like to know that the query "cheese" matched four times in that pa

Re: LUCENE-2456 (A Column-Oriented Cassandra-Based Lucene Directory)

2010-08-07 Thread Otis Gospodnetic
Utku, you should ask via comments on https://issues.apache.org/jira/browse/LUCENE-2453. What happened with Lucandra? Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ - Original Message > From: Utku Can Topçu > To

Re: get wordno, lineno, pageno for term/phrase

2010-08-07 Thread Babak Farhang
How about making each line a separate document? You'd worry about scaling it later (e.g. the 32-bit limitation in the number of docs in an index).. On Fri, Aug 6, 2010 at 11:37 AM, arun r wrote: > I am trying to create a custom analyzer that will check for pagebreak > and linebreak and add the pa

Re: Need help in understanding output of searcher.explain() function

2010-08-07 Thread Soby Thomas
thanks Jayendra...it was really helpful On Sat, Aug 7, 2010 at 6:07 PM, jayendra patil wrote: > Trying to put up an explanation :- > > 0.022172567 = (MATCH) product of: > 0.07760398 = (MATCH) sum of: > 0.02287053 = (MATCH) weight(payload:ces in 550), product of: > 0.32539415 = queryWeight(

Re: Need help in understanding output of searcher.explain() function

2010-08-07 Thread jayendra patil
Trying to put up an explanation :- 0.022172567 = (MATCH) product of: 0.07760398 = (MATCH) sum of: 0.02287053 = (MATCH) weight(payload:ces in 550), product of: 0.32539415 = queryWeight(payload:ces), product of: 2.2491398 = *idf*(docFreq=157, maxDocs=551) 0.14467494 = queryNor

Re: understanding lucene

2010-08-07 Thread Yakob
On 7/29/10, Erick Erickson wrote: > that code has way too much stuff in it for your first application. > Hibernate > is in there and it looks, from the description, like it tries to search your > database. > I'd *strongly* recommend that you don't go there. > > Try looking at > http://wiki.apa

Need help in understanding output of searcher.explain() function

2010-08-07 Thread Soby Thomas
Hello Guys, I trying to understand how lucene score is calculated. So 'm using the searcher.explain() function. But the output it gives is really confusing for me. Below are the details of the query that I gave and o/p it gave me Query: *It is definitely a CES deal that will be over in Sep or Oct