Cross-posting this announcement. There are several relevant Lucene/
Solr talks including:
Trainings
- Lucene Boot Camp (Grant Ingersoll)
- Solr Boot Camp (Erik Hatcher)
Sessions
- Introducing Apache Mahout (Grant)
- Lucene Case Studies (Erik)
- Advanced Indexing Techniques with Apach
Hello!
Could not you help me?
I have a task:
There are two entities:
1) Task with text field "title"
2) TimeInterval with two fields "start", "and" type of DateTime
Task is linked with TimeInterval as 1:n (one to many)
I have to look for task by
1) text in title
2) date contained in some interva
Hello
To do this in one lucene query I think you'll need to flatten your
data into something like
Doc 1
title: lucene custom search
start: 2009/01/28
end: 2009/01/28
Doc 2
title: lucene custom search
start: 2009/01/29
end: 2009/01/30
Doc 3
title: some other text
start: 2009/02/01
end: 2009/02/
Hello,
I have project which I am trying to switch from lucene 2.3.2 to 2.4 I am
getting some strange scores
Before my code was:
Hits hits= searcher.search(query);
Float score = hits.score(1)
and scores from hist was from 0-1; 1 was 100% match
I change code to use hit collector
TopDocCollect
HitCollector scores are not normalized to the 0-1 range, they
are "raw". You have to get the maximum score returned and
normalize the raw scores. See TopDocs.getMaxScore
as I remember.
Best
Erick
On Thu, Jan 29, 2009 at 8:21 PM, AlexElba wrote:
>
> Hello,
> I have project which I am trying to s
Hi,
There is any crawler that integrate with index lucene ?
Thanks
Haroldo
_
Conheça o Windows Live Spaces, a rede de relacionamentos do Messenger!
http://www.amigosdomessenger.com.br/
Hi -
I've looked on the FAQ, the Java Docs, and searched a little in
google, but haven't been able to figure out if Lucene can index binary
files.
Our binary files can get up into the 20-30 gigabyte range.
If it is possible, anyone have any pointers to what interfaces I should look at?
Thanks,
Hi,
You can check out Nutch at http://lucene.apache.org/nutch/.
Regards,
Jay Joel Malaluan
Haroldo Nascimento-2 wrote:
>
>
> Hi,
>
> There is any crawler that integrate with index lucene ?
>
> Thanks
>
> Haroldo
>
Are these some type of parsable-into-text binary files that you have a
parser handy for?
Erik
On Jan 29, 2009, at 10:43 PM, Paul Feuer wrote:
Hi -
I've looked on the FAQ, the Java Docs, and searched a little in
google, but haven't been able to figure out if Lucene can index binary
f
Hi Paul,
Lucene is a 'text only' saerch lib. i.e. as long as you feed in anything as
a string, you'd be able to use lucene else I don't think there's a way.
How do you even intend to search in those binary files? as in... what would
be the keyword/phrase? asking out of curiosity!
--
Anshum Gupta
N
we have parsers for these files.
to index them, do the string representations need to be stored (aside
from sitting in the index file)? or can the reader simply provide the
string in order to record the location of the record in the binary
file?
if i need to convert the binary file into text fiel
Use your parser to get the string out of the binary file and index them
using Lucene.
Store the string as it is, if it is small otherwise store the path and its
offset position. The content could be later retrieved.
Regards
Ganesh
- Original Message -
From: "Paul Feuer"
To:
Sent
Hi,
I am using OpenCms 7.0.5 with Lucene search engine.
I need to index XML content for which I have a following field
configuration in the opencms-search.xml
DESC_FIELD_CONFIG
country[1]/record[
thanks a lot
Ian Lea пишет:
Hello
To do this in one lucene query I think you'll need to flatten your
data into something like
Doc 1
title: lucene custom search
start: 2009/01/28
end: 2009/01/28
Doc 2
title: lucene custom search
start: 2009/01/29
end: 2009/01/30
Doc 3
title: some other text
To get the normalized scores use:
TopDocCollector collector = new TopDocCollector(99);
searcher.search(query, collector);
TopDocs td = collector.topDocs();
ScoreDocs[] hits = td.scoreDocs;
int docId = hits[1].doc;
Document document = searcher.doc(docId);
float score = hits[1].score / td.getMax
15 matches
Mail list logo