RE: Indexing puncutation

2005-06-29 Thread Chris Hostetter
Thomas" <[EMAIL PROTECTED]> : Reply-To: java-user@lucene.apache.org : To: java-user@lucene.apache.org : Subject: RE: Indexing puncutation : : Thanks for the advice. I have replaced punctuation before the index is : built and then queried on the same lack of punctuation. I had to create : a

RE: Indexing puncutation

2005-06-29 Thread Aigner, Thomas
punctuation but keep the contents the original. Tom -Original Message- From: Ken Krugler [mailto:[EMAIL PROTECTED] Sent: Wednesday, June 29, 2005 10:39 AM To: java-user@lucene.apache.org Subject: Re: Indexing puncutation >I do a vaguely similar thing; I have to strip accents f

Re: Indexing puncutation

2005-06-29 Thread Ken Krugler
I do a vaguely similar thing; I have to strip accents from characters such as e-acute out of both my input data and my incoming search queries to put them into a standard form. I do this with a custom TokenFilter subclass. I have an analyzer that includes this filter along with some of the s

Re: Indexing puncutation

2005-06-29 Thread Peter Pimley
I'm not sure how useful this reply is, but hey ;) me too! I do a vaguely similar thing; I have to strip accents from characters such as e-acute out of both my input data and my incoming search queries to put them into a standard form. I do this with a custom TokenFilter subclass. I have a

Re: Indexing puncutation

2005-06-28 Thread Erik Hatcher
On Jun 28, 2005, at 3:37 PM, Chris D wrote: Lastly, and someone should correct me if I'm wrong, but you should always use the same analyzer to create and to query the index. Otherwise queries that should return hits wont. For instance the following. The canoist paddles Could be indexed as [

Re: Indexing puncutation

2005-06-28 Thread Chris D
On 6/28/05, Aigner, Thomas <[EMAIL PROTECTED]> wrote: > Thanks for the info Chris. > > > > I'd thought I'd provide some more infomation. One problem is the > descriptions are not easily formatted. In other words, the description > doesn't follow a certain set of rules (num num - alpha alpha etc

RE: Indexing puncutation

2005-06-28 Thread Aigner, Thomas
g my SynonymnAnalyzer for some aliases to build the index and the SnowballAnalyzer to query the index (nice stemming in it) Tom -Original Message- From: Chris D [mailto:[EMAIL PROTECTED] Sent: Tuesday, June 28, 2005 2:41 PM To: java-user@lucene.apache.org Subject: Re: Indexing puncutati

Re: Indexing puncutation

2005-06-28 Thread Chris D
On 6/28/05, Aigner, Thomas <[EMAIL PROTECTED]> wrote: > Hello all, > > I am VERY new to Lucene and we are trying out Lucene to see if > it will accomplish the vast majority of our search functions. > > I have a question about a good way to index some of our product > description c