I'm wondering if your date field's precision may be a little too
much? What I mean is that you are going all the way down to
seconds. Whenever you do a range query you are essentially spawning
a BooleanQuery with a representation of that range. Do you really
need to be that precise? I u
I would say Lucene is a capable of helping you do this. Remember
that it is a set of libraries and you have to build the functionality
you need with that. BUT with a little planning and elbow grease you
will be able to use it to create a great search engine.
Good luck.
On Jan 29, 2008,
Say I have a field of To addresses from an email archive. I do a search and I
get 10 To addresses for a single hit. Then I want to find similar email with
the To addresses containing roughly 75% of those email addresses as well. How
would I do this?
In other words:
I get a result with:
To:
I have highlighter working out of the box but now I need to highlight
wildcards (if they are there). I would imagine someone has done this
on this group :) Any suggestions?
Thanks!
Michael
-
To unsubscribe, e-mail: [EMAIL
When I run through and delete a few documents from my index, is it
wise to call .flush() afterwards? Or is it better to close the index?
Thanks!
Michael
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-m
rong field, that not only
does not exist yet but will never exist. To prevent hidden bugs
cases like this it is important to throw an exception in this case.
Do you agree?
On Jan 8, 2008 11:21 PM, Michael Prichard
<[EMAIL PROTECTED]> wrote:
Meaning that it says "field is no
Meaning that it says "field is not indexed". Where is
sortMissingLastAttribute?
thanks.
On Jan 8, 2008, at 4:13 PM, Ryan McKinley wrote:
what do you mean by "fail"? -- there is the sortMissingLast attribute
Michael Prichard wrote:
ok... i should read the manual more
yetI run a sort on
BCC.I think it fails now...anyway around that?
On Jan 8, 2008, at 1:22 PM, Michael Prichard wrote:
Is it possible to sort on a tokenized field? For example, I break
email address into pieces, i.e.
[EMAIL PROTECTED]
becomes
[EMAIL PROTECTED]
michael.prichard
Is it possible to sort on a tokenized field? For example, I break
email address into pieces, i.e.
[EMAIL PROTECTED]
becomes
[EMAIL PROTECTED]
michael.prichard
michael
prichard
email.com
email
so when sorting on this field I get some strange results. Do I need
to create another field
out it.
-Grant
On Nov 8, 2007, at 8:37 AM, Michael Prichard wrote:
Hello,
I know this has gone around a bit but anyone had any success with
pulling text from Office 2007 files? Any recommendations?
Thanks,
Michael
Hello,
I know this has gone around a bit but anyone had any success with
pulling text from Office 2007 files? Any recommendations?
Thanks,
Michael
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail:
So I should create a singe IndexWriter but allow multiple threads to
use it?
On Oct 30, 2007, at 12:49 AM, Karl Wettin wrote:
30 okt 2007 kl. 05.08 skrev Michael Prichard:
If I multithread an indexing process can two or more processes
write to the same index?
Multiple threads can add
The subject pretty much describes the issue. If I multithread an
indexing process can two or more processes write to the same index?
My knowledge of Lucene says no but I am hoping that I am missing
something. Also, what is a good solution?
Thanks.
Michael
--
I know how to do english text with POI and PDFBox and so on. Now, I want to
start indexing non-english language such as french and spanish. Which
extraction libs are available for me?
I want to do:
Excel
Word
PowerPoint
PDF
HTML
RTF
Thanks!
Michael
--
://wiki.apache.org/jakarta-lucene/
LuceneFAQ#head-48921635adf2c968f79
36dc07d51dfb40d638b82
-Original Message-
From: Michael Prichard [mailto:[EMAIL PROTECTED]
Sent: Wednesday, February 14, 2007 5:02 PM
To: java-user@lucene.apache.org
Subject: Too many open files?!
I am getting this exception
I am getting this exception:
Exception in thread "main" java.io.FileNotFoundException: /index/_gna.f13 (Too
many open files)
This is happening on a SLES10 (64-bit) box when trying to index 18k items.
I can run it on a much lesser SLES9 box without any issues.
Any ideas?!
Thanks,
Michael
---
ARRRGH!!! That's it. Darn, I was half asleep last night when I was
experimenting. I totally feel like a dope.
It worksThanks!
-Michael
On Thursday, July 20, 2006, at 00:36AM, Doron Cohen <[EMAIL PROTECTED]> wrote:
>> doc.add(new Field("to",
>> "[EMAIL PROTECTED]",
>> ...
>> PrefixQu
Tell me I am totally missing something here
I created an index w/ StandardAnalyzer with two fields as follows:
Document doc = new Document();
doc.add(new Field("to", "[EMAIL PROTECTED]", Field.Store.YES,
Field.Index.TOKENIZED));
doc.add(new Field("content", "blah3 blah3 blah3", Field.Store.Y
Not sure if anyone out is doing this, thought about doing this or is just plain
curious. I want to figure out a way to build a search/rule gui's whereas the
user can build searches much like building rules in some mail clients such as
outlook or thunderbird. For example, the user could use dro
19 matches
Mail list logo