RE: Lunene 2.3-2.4 switch: Scoring change

2009-01-29 Thread Uwe Schindler
To get the normalized scores use: TopDocCollector collector = new TopDocCollector(99); searcher.search(query, collector); TopDocs td = collector.topDocs(); ScoreDocs[] hits = td.scoreDocs; int docId = hits[1].doc; Document document = searcher.doc(docId); float score = hits[1].score / td.getMax

Re: custom search

2009-01-29 Thread Nail Achmedzhanov
thanks a lot Ian Lea пишет: Hello To do this in one lucene query I think you'll need to flatten your data into something like Doc 1 title: lucene custom search start: 2009/01/28 end: 2009/01/28 Doc 2 title: lucene custom search start: 2009/01/29 end: 2009/01/30 Doc 3 title: some other text

Lucene OpenCms search - Xpath notation?

2009-01-29 Thread Kesarkar, Dipak
Hi, I am using OpenCms 7.0.5 with Lucene search engine. I need to index XML content for which I have a following field configuration in the opencms-search.xml DESC_FIELD_CONFIG country[1]/record[

Re: indexing binary files?

2009-01-29 Thread Ganesh
Use your parser to get the string out of the binary file and index them using Lucene. Store the string as it is, if it is small otherwise store the path and its offset position. The content could be later retrieved. Regards Ganesh - Original Message - From: "Paul Feuer" To: Sent

Re: indexing binary files?

2009-01-29 Thread Paul Feuer
we have parsers for these files. to index them, do the string representations need to be stored (aside from sitting in the index file)? or can the reader simply provide the string in order to record the location of the record in the binary file? if i need to convert the binary file into text fiel

Re: indexing binary files?

2009-01-29 Thread Anshum
Hi Paul, Lucene is a 'text only' saerch lib. i.e. as long as you feed in anything as a string, you'd be able to use lucene else I don't think there's a way. How do you even intend to search in those binary files? as in... what would be the keyword/phrase? asking out of curiosity! -- Anshum Gupta N

Re: indexing binary files?

2009-01-29 Thread Erik Hatcher
Are these some type of parsable-into-text binary files that you have a parser handy for? Erik On Jan 29, 2009, at 10:43 PM, Paul Feuer wrote: Hi - I've looked on the FAQ, the Java Docs, and searched a little in google, but haven't been able to figure out if Lucene can index binary f

Re: Crawler

2009-01-29 Thread Jay Malaluan
Hi, You can check out Nutch at http://lucene.apache.org/nutch/. Regards, Jay Joel Malaluan Haroldo Nascimento-2 wrote: > > > Hi, > > There is any crawler that integrate with index lucene ? > > Thanks > > Haroldo >

indexing binary files?

2009-01-29 Thread Paul Feuer
Hi - I've looked on the FAQ, the Java Docs, and searched a little in google, but haven't been able to figure out if Lucene can index binary files. Our binary files can get up into the 20-30 gigabyte range. If it is possible, anyone have any pointers to what interfaces I should look at? Thanks,

Crawler

2009-01-29 Thread Haroldo Nascimento
Hi, There is any crawler that integrate with index lucene ? Thanks Haroldo _ Conheça o Windows Live Spaces, a rede de relacionamentos do Messenger! http://www.amigosdomessenger.com.br/

Re: Lunene 2.3-2.4 switch: Scoring change

2009-01-29 Thread Erick Erickson
HitCollector scores are not normalized to the 0-1 range, they are "raw". You have to get the maximum score returned and normalize the raw scores. See TopDocs.getMaxScore as I remember. Best Erick On Thu, Jan 29, 2009 at 8:21 PM, AlexElba wrote: > > Hello, > I have project which I am trying to s

Lunene 2.3-2.4 switch: Scoring change

2009-01-29 Thread AlexElba
Hello, I have project which I am trying to switch from lucene 2.3.2 to 2.4 I am getting some strange scores Before my code was: Hits hits= searcher.search(query); Float score = hits.score(1) and scores from hist was from 0-1; 1 was 100% match I change code to use hit collector TopDocCollect

Re: custom search

2009-01-29 Thread Ian Lea
Hello To do this in one lucene query I think you'll need to flatten your data into something like Doc 1 title: lucene custom search start: 2009/01/28 end: 2009/01/28 Doc 2 title: lucene custom search start: 2009/01/29 end: 2009/01/30 Doc 3 title: some other text start: 2009/02/01 end: 2009/02/

custom search

2009-01-29 Thread Nail Achmedzhanov
Hello! Could not you help me? I have a task: There are two entities: 1) Task with text field "title" 2) TimeInterval with two fields "start", "and" type of DateTime Task is linked with TimeInterval as 1:n (one to many) I have to look for task by 1) text in title 2) date contained in some interva

Registration for ApacheCon Europe 2009 is now open!

2009-01-29 Thread Erik Hatcher
Cross-posting this announcement. There are several relevant Lucene/ Solr talks including: Trainings - Lucene Boot Camp (Grant Ingersoll) - Solr Boot Camp (Erik Hatcher) Sessions - Introducing Apache Mahout (Grant) - Lucene Case Studies (Erik) - Advanced Indexing Techniques with Apach