Re: Store/Index Email Address in Lucene

2008-07-06 Thread miztaken
Hi there, sorry for the delay >Q. Can there be multiple addresses in a single document? A. Yes there can be multiple addresses in a single document in single field. >Q. Do you add any other data to the document that you mean to query for? A. Yes there can be other fields as well, if this is

Re: Store/Index Email Address in Lucene

2008-07-05 Thread Karl Wettin
5 jul 2008 kl. 03.29 skrev miztaken: Hi there, for email addresses string such as "[EMAIL PROTECTED]; jack smith" Can there be multiple addresses in a single document? Do you add any other data to the document that you mean to query for? I might do wild card search like john* or jack* or [

Re: Store/Index Email Address in Lucene

2008-07-04 Thread miztaken
Hi there, for email addresses string such as "[EMAIL PROTECTED]; jack smith" I might do wild card search like john* or jack* or [EMAIL PROTECTED] for *wherever.com for phrase search i can do "jack smith" for general search i might do "[EMAIL PROTECTED]" I tokenized the string and indexed it in

Re: Store/Index Email Address in Lucene

2008-07-04 Thread Karl Wettin
Please show us a couple of examples from the "various type of search" you want to be able to handle. The information you supply says nothing about your use cases. In what way do you want to use phrase queries on email addresses? Do you want to tokenize parts of a single email address? Or do

Re: Store/Index Email Address in Lucene

2008-07-03 Thread miztaken
Hi there, Thanks for the comment. So basically it will be lame to add new field for each email address, wont it? How about getting unique tokens from string of email addresses using EmailFilter.java class and storing it in as a single field ? Jamie-52 wrote: > > Hi miztaken > > Check out: >

Re: Store/Index Email Address in Lucene

2008-07-03 Thread Jamie
Hi miztaken Check out: http://openmailarchiva.svn.sourceforge.net/viewvc/openmailarchiva/Server/trunk/src/com/stimulus/archiva/search/EmailFilter.java?view=markup I think its what you want. I want to index email address in such a way that i can do WildCard, Phrase and Simple search on those it

RE: Store/Index Email Address in Lucene

2008-07-03 Thread John Griffin
Miz, The StandardAnalyzer recognizes email addresses as is. That is, it pays attention to the '@' symbol. Just store an email address in a field and search them normally. This assumes you are going to store the different emails in separate fields. There is an alternative strategy if you need it.