Michael J. Prichard wrote:
We are actually grabbing emails by becoming part of the SMTP stream.
This part is figured out and we have archived over 600k emails into a
mysql database. The problem is that since we currently store the
blobs in the DB this databases are getting large and searching
net/projects/ol2mbox), if they are 97-2002 format.
-Original Message-
From: Mike Streeton [mailto:[EMAIL PROTECTED]
Sent: 19 June 2006 08:33
To: java-user@lucene.apache.org
Subject: RE: indexing emails
When you talk about indexing emails are you indexing Outlook mails? We have
only foun
Michael J. Prichard wrote:
I am working on indexing emails and want to have a "to" field. I am
currently putting all the emails on one line seperated w/
spaces...example:
[EMAIL PROTECTED] [EMAIL PROTECTED] [EMAIL PROTECTED]
Then i index that with a StandardAnalyzer as follows:
doc.add(new
: As far as indexing goes index each address in a separate un-tokenized
: field not space delimited in a single field. It is also useful to put
: the To; CC and BCC in a single field to enable you to search to email
INdexing email isn't something i've had to think about a lot in my life ..
but if
and pick up all
emails in the thread.
Mike
-Original Message-
From: Michael Wechner [mailto:[EMAIL PROTECTED]
Sent: 19 June 2006 08:21
To: java-user@lucene.apache.org
Subject: Re: indexing emails
Rob Staveley (Tom) wrote:
> Having spent a lot of time getting this wrong myself in an
emails in the thread.
Mike
-Original Message-
From: Michael Wechner [mailto:[EMAIL PROTECTED]
Sent: 19 June 2006 08:21
To: java-user@lucene.apache.org
Subject: Re: indexing emails
Rob Staveley (Tom) wrote:
> Having spent a lot of time getting this wrong myself in an e-mail
> indexer
-
From: karl wettin [mailto:[EMAIL PROTECTED]
Sent: 16 June 2006 21:13
To: java-user@lucene.apache.org
Subject: Re: indexing emails
On Fri, 2006-06-16 at 15:20 -0400, Michael J. Prichard wrote:
I am working on indexing emails and want to have a "to" field. I am
currently putti
So I have emails with multiple recipients (of course, this is very
common). I currently put them all on the same string seperated by space
and then tokenize them with Standard Analyzer. I was looking into
SynonymAnalyzers and see that you can drop multiple tokens with the same
position. Woul
From: karl wettin [mailto:[EMAIL PROTECTED]
Sent: 16 June 2006 21:13
To: java-user@lucene.apache.org
Subject: Re: indexing emails
On Fri, 2006-06-16 at 15:20 -0400, Michael J. Prichard wrote:
I am working on indexing emails and want to have a "to" field. I am
currently putting all the
d last name still need a PrefixQuery or WildcardQuery, if
you want to search for last names, but it does make some queries possible
which would otherwise blow up.
-Original Message-
From: karl wettin [mailto:[EMAIL PROTECTED]
Sent: 16 June 2006 21:13
To: java-user@lucene.apache.org
Subje
On Fri, 2006-06-16 at 15:20 -0400, Michael J. Prichard wrote:
> I am working on indexing emails and want to have a "to" field. I am
> currently putting all the emails on one line seperated w/ spaces...example:
>
> [EMAIL PROTECTED] [EMAIL PROTECTED] [EMAIL PROTECTED]
>
> Then i index that with
Peter, see this:
http://www.tropo.com/techno/java/lucene/imap.html
http://guests.evectors.it/zoe/
Otis
--- Peter Lewis <[EMAIL PROTECTED]> wrote:
> Hi all
>
> Has anyone got an example of how to index emails from Microsoft
> Exchange using Lucene?
>
> TIA
>
> William
>
>
>
12 matches
Mail list logo