To get the normalized scores use:
TopDocCollector collector = new TopDocCollector(99);
searcher.search(query, collector);
TopDocs td = collector.topDocs();
ScoreDocs[] hits = td.scoreDocs;
int docId = hits[1].doc;
Document document = searcher.doc(docId);
float score = hits[1].score / td.getMax
thanks a lot
Ian Lea пишет:
Hello
To do this in one lucene query I think you'll need to flatten your
data into something like
Doc 1
title: lucene custom search
start: 2009/01/28
end: 2009/01/28
Doc 2
title: lucene custom search
start: 2009/01/29
end: 2009/01/30
Doc 3
title: some other text
Hi,
I am using OpenCms 7.0.5 with Lucene search engine.
I need to index XML content for which I have a following field
configuration in the opencms-search.xml
DESC_FIELD_CONFIG
country[1]/record[
Use your parser to get the string out of the binary file and index them
using Lucene.
Store the string as it is, if it is small otherwise store the path and its
offset position. The content could be later retrieved.
Regards
Ganesh
- Original Message -
From: "Paul Feuer"
To:
Sent
we have parsers for these files.
to index them, do the string representations need to be stored (aside
from sitting in the index file)? or can the reader simply provide the
string in order to record the location of the record in the binary
file?
if i need to convert the binary file into text fiel
Hi Paul,
Lucene is a 'text only' saerch lib. i.e. as long as you feed in anything as
a string, you'd be able to use lucene else I don't think there's a way.
How do you even intend to search in those binary files? as in... what would
be the keyword/phrase? asking out of curiosity!
--
Anshum Gupta
N
Are these some type of parsable-into-text binary files that you have a
parser handy for?
Erik
On Jan 29, 2009, at 10:43 PM, Paul Feuer wrote:
Hi -
I've looked on the FAQ, the Java Docs, and searched a little in
google, but haven't been able to figure out if Lucene can index binary
f
Hi,
You can check out Nutch at http://lucene.apache.org/nutch/.
Regards,
Jay Joel Malaluan
Haroldo Nascimento-2 wrote:
>
>
> Hi,
>
> There is any crawler that integrate with index lucene ?
>
> Thanks
>
> Haroldo
>
Hi -
I've looked on the FAQ, the Java Docs, and searched a little in
google, but haven't been able to figure out if Lucene can index binary
files.
Our binary files can get up into the 20-30 gigabyte range.
If it is possible, anyone have any pointers to what interfaces I should look at?
Thanks,
Hi,
There is any crawler that integrate with index lucene ?
Thanks
Haroldo
_
Conheça o Windows Live Spaces, a rede de relacionamentos do Messenger!
http://www.amigosdomessenger.com.br/
HitCollector scores are not normalized to the 0-1 range, they
are "raw". You have to get the maximum score returned and
normalize the raw scores. See TopDocs.getMaxScore
as I remember.
Best
Erick
On Thu, Jan 29, 2009 at 8:21 PM, AlexElba wrote:
>
> Hello,
> I have project which I am trying to s
Hello,
I have project which I am trying to switch from lucene 2.3.2 to 2.4 I am
getting some strange scores
Before my code was:
Hits hits= searcher.search(query);
Float score = hits.score(1)
and scores from hist was from 0-1; 1 was 100% match
I change code to use hit collector
TopDocCollect
Hello
To do this in one lucene query I think you'll need to flatten your
data into something like
Doc 1
title: lucene custom search
start: 2009/01/28
end: 2009/01/28
Doc 2
title: lucene custom search
start: 2009/01/29
end: 2009/01/30
Doc 3
title: some other text
start: 2009/02/01
end: 2009/02/
Hello!
Could not you help me?
I have a task:
There are two entities:
1) Task with text field "title"
2) TimeInterval with two fields "start", "and" type of DateTime
Task is linked with TimeInterval as 1:n (one to many)
I have to look for task by
1) text in title
2) date contained in some interva
Cross-posting this announcement. There are several relevant Lucene/
Solr talks including:
Trainings
- Lucene Boot Camp (Grant Ingersoll)
- Solr Boot Camp (Erik Hatcher)
Sessions
- Introducing Apache Mahout (Grant)
- Lucene Case Studies (Erik)
- Advanced Indexing Techniques with Apach
15 matches
Mail list logo