Re: indexing emails

2006-06-19 Thread John Haxby
Michael J. Prichard wrote: We are actually grabbing emails by becoming part of the SMTP stream. This part is figured out and we have archived over 600k emails into a mysql database. The problem is that since we currently store the blobs in the DB this databases are getting large and searching

Re: indexing emails

2006-06-19 Thread Michael J. Prichard
net/projects/ol2mbox), if they are 97-2002 format. -Original Message- From: Mike Streeton [mailto:[EMAIL PROTECTED] Sent: 19 June 2006 08:33 To: java-user@lucene.apache.org Subject: RE: indexing emails When you talk about indexing emails are you indexing Outlook mails? We have only foun

Re: indexing emails

2006-06-19 Thread John Haxby
Michael J. Prichard wrote: I am working on indexing emails and want to have a "to" field. I am currently putting all the emails on one line seperated w/ spaces...example: [EMAIL PROTECTED] [EMAIL PROTECTED] [EMAIL PROTECTED] Then i index that with a StandardAnalyzer as follows: doc.add(new

RE: indexing emails

2006-06-19 Thread Chris Hostetter
: As far as indexing goes index each address in a separate un-tokenized : field not space delimited in a single field. It is also useful to put : the To; CC and BCC in a single field to enable you to search to email INdexing email isn't something i've had to think about a lot in my life .. but if

RE: indexing emails

2006-06-19 Thread Rob Staveley (Tom)
and pick up all emails in the thread. Mike -Original Message- From: Michael Wechner [mailto:[EMAIL PROTECTED] Sent: 19 June 2006 08:21 To: java-user@lucene.apache.org Subject: Re: indexing emails Rob Staveley (Tom) wrote: > Having spent a lot of time getting this wrong myself in an

RE: indexing emails

2006-06-19 Thread Mike Streeton
emails in the thread. Mike -Original Message- From: Michael Wechner [mailto:[EMAIL PROTECTED] Sent: 19 June 2006 08:21 To: java-user@lucene.apache.org Subject: Re: indexing emails Rob Staveley (Tom) wrote: > Having spent a lot of time getting this wrong myself in an e-mail > indexer

Re: indexing emails

2006-06-19 Thread Michael Wechner
- From: karl wettin [mailto:[EMAIL PROTECTED] Sent: 16 June 2006 21:13 To: java-user@lucene.apache.org Subject: Re: indexing emails On Fri, 2006-06-16 at 15:20 -0400, Michael J. Prichard wrote: I am working on indexing emails and want to have a "to" field. I am currently putti

Re: indexing emails --> mutliple "to" emails, setting position same

2006-06-18 Thread Michael J. Prichard
So I have emails with multiple recipients (of course, this is very common). I currently put them all on the same string seperated by space and then tokenize them with Standard Analyzer. I was looking into SynonymAnalyzers and see that you can drop multiple tokens with the same position. Woul

Re: indexing emails

2006-06-18 Thread Michael J. Prichard
From: karl wettin [mailto:[EMAIL PROTECTED] Sent: 16 June 2006 21:13 To: java-user@lucene.apache.org Subject: Re: indexing emails On Fri, 2006-06-16 at 15:20 -0400, Michael J. Prichard wrote: I am working on indexing emails and want to have a "to" field. I am currently putting all the

RE: indexing emails

2006-06-17 Thread Rob Staveley (Tom)
d last name still need a PrefixQuery or WildcardQuery, if you want to search for last names, but it does make some queries possible which would otherwise blow up. -Original Message- From: karl wettin [mailto:[EMAIL PROTECTED] Sent: 16 June 2006 21:13 To: java-user@lucene.apache.org Subje

Re: indexing emails

2006-06-16 Thread karl wettin
On Fri, 2006-06-16 at 15:20 -0400, Michael J. Prichard wrote: > I am working on indexing emails and want to have a "to" field. I am > currently putting all the emails on one line seperated w/ spaces...example: > > [EMAIL PROTECTED] [EMAIL PROTECTED] [EMAIL PROTECTED] > > Then i index that with

Re: Indexing emails from Exchange

2005-06-22 Thread Otis Gospodnetic
Peter, see this: http://www.tropo.com/techno/java/lucene/imap.html http://guests.evectors.it/zoe/ Otis --- Peter Lewis <[EMAIL PROTECTED]> wrote: > Hi all > > Has anyone got an example of how to index emails from Microsoft > Exchange using Lucene? > > TIA > > William > > >