Re: Customer TokenFilter

2010-05-27 Thread tsuraan
> Looks correct! Wrapping by CharBuffer is very intelligent! In Lucene 3.1 the > new Term Attribute will implement CharSequence, then its even simplier. You > may also look at 3.1's ICU contrib that has support even for Normalizer2. Ok, I've only been looking at 3.0.1 so far; I'll check out the 3.

RE: Customer TokenFilter

2010-05-27 Thread Uwe Schindler
H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- > From: tsuraan [mailto:tsur...@gmail.com] > Sent: Thursday, May 27, 2010 10:38 PM > To: java-user > Subject: Re: Customer TokenFilter > > > I'd like to

Re: Customer TokenFilter

2010-05-27 Thread tsuraan
> I'd like to have all my queries and terms run through Unicode > Normalization prior to being executed/indexed.  I've been using the > StandardAnalyzer with pretty good luck for the past few years, so I > think I'd like to write an analyzer that wraps that, and tacks a > custom TokenFilter onto th

Customer TokenFilter

2010-05-26 Thread tsuraan
I'd like to have all my queries and terms run through Unicode Normalization prior to being executed/indexed. I've been using the StandardAnalyzer with pretty good luck for the past few years, so I think I'd like to write an analyzer that wraps that, and tacks a custom TokenFilter onto the chain pr