Re: Lucene Analyzer that can handle C++ vs C#

2009-12-24 Thread AHMET ARSLAN
> > That is awesome, just one thing, and forgive me if i sound > ignorant.  What is > "FastZemberek zemberek"? Oh, it's just a copy-paste accident, delete it :) - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.o

Re: Lucene Analyzer that can handle C++ vs C#

2009-12-24 Thread maxSchlein
That is awesome, just one thing, and forgive me if i sound ignorant. What is "FastZemberek zemberek"? Ahmet Arslan wrote: > > >> public class CustomFilter extends TokenFilter >> { >>     protected CustomFilter(TokenStream >> tokenStream) >>     { >>         super(tokenStream); >>     } >>    

RE: Lucene Analyzer that can handle C++ vs C#

2009-12-24 Thread Uwe Schindler
Hi, > Here is the the one that uses new token stream api: > > public final class CustomFilter extends TokenFilter { > >private final TermAttribute termAtt; > > public CustomFilter(TokenStream in, FastZemberek zemberek) { > super(in); > termAtt = (TermAttribute) addAttrib

Re: Lucene Analyzer that can handle C++ vs C#

2009-12-24 Thread AHMET ARSLAN
> public class CustomFilter extends TokenFilter > { >     protected CustomFilter(TokenStream > tokenStream) >     { >         super(tokenStream); >     } >     @Override >     public Token next(final Token reusableToken) > throws IOException >     { >         Token nextToken = > input.next(reusabl

Re: Lucene Analyzer that can handle C++ vs C#

2009-12-24 Thread maxSchlein
Here is the solution. I used a CustomAnalyzer that calls CustomFilter. Easy enough, but now if I want to use the current version of lucene, 3.0 these methods are no longer there. TokenStream.next() or TokenStream.next(Token). In 2.9.0 these methods were deprecated as are Token.setTermText()

Re: Lucene Analyzer that can handle C++ vs C#

2009-12-15 Thread Weiwei Wang
; Sent: Saturday, December 12, 2009 5:27 AM > Subject: Re: Lucene Analyzer that can handle C++ vs C# > > > > What we did in DBSight is to provide a reserved list of words for every > > Lucene Analyzer. > > This way you can handle any special characters like C++ and C#.

Re: Lucene Analyzer that can handle C++ vs C#

2009-12-15 Thread Ganesh
How about KeywordAnalyzer? It will treat C++ and C# as single term. Regards Ganesh - Original Message - From: "Chris Lu" To: Sent: Saturday, December 12, 2009 5:27 AM Subject: Re: Lucene Analyzer that can handle C++ vs C# > What we did in DBSight is to provide a res

答复: Lucene Analyzer that can handle C++ vs C#

2009-12-11 Thread 王巍巍
Mappingcharfilter may bring you some ideas - 原始邮件 - 发件人: maxSchlein 发送时间: 2009年12月12日 星期六 1:09 收件人: java-user@lucene.apache.org 主题: Lucene Analyzer that can handle C++ vs C# Can someone please point me in the right direction. We are creating an application that needs to beable to

Re: Lucene Analyzer that can handle C++ vs C#

2009-12-11 Thread Chris Lu
What we did in DBSight is to provide a reserved list of words for every Lucene Analyzer. This way you can handle any special characters like C++ and C#. Any common analyzers usually are not suitable for these special words. -- Chris Lu - Instant Scalable Full-Text Search

Re: Lucene Analyzer that can handle C++ vs C#

2009-12-11 Thread AHMET ARSLAN
> Can someone please point me in the right direction. > > We are creating an application that needs to beable to > search on C++ and get > back doc's that have C++ in it.  The StandardAnalyzer > does not seem to index > the "+", so a search for "C++" will bring back docs that > contain, C++, C, >

Lucene Analyzer that can handle C++ vs C#

2009-12-11 Thread maxSchlein
Can someone please point me in the right direction. We are creating an application that needs to beable to search on C++ and get back doc's that have C++ in it. The StandardAnalyzer does not seem to index the "+", so a search for "C++" will bring back docs that contain, C++, C, C#, etc. The