>
> That is awesome, just one thing, and forgive me if i sound
> ignorant. What is
> "FastZemberek zemberek"?
Oh, it's just a copy-paste accident, delete it :)
-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.o
That is awesome, just one thing, and forgive me if i sound ignorant. What is
"FastZemberek zemberek"?
Ahmet Arslan wrote:
>
>
>> public class CustomFilter extends TokenFilter
>> {
>> protected CustomFilter(TokenStream
>> tokenStream)
>> {
>> super(tokenStream);
>> }
>>
Hi,
> Here is the the one that uses new token stream api:
>
> public final class CustomFilter extends TokenFilter {
>
>private final TermAttribute termAtt;
>
> public CustomFilter(TokenStream in, FastZemberek zemberek) {
> super(in);
> termAtt = (TermAttribute) addAttrib
> public class CustomFilter extends TokenFilter
> {
> protected CustomFilter(TokenStream
> tokenStream)
> {
> super(tokenStream);
> }
> @Override
> public Token next(final Token reusableToken)
> throws IOException
> {
> Token nextToken =
> input.next(reusabl
Here is the solution. I used a CustomAnalyzer that calls CustomFilter.
Easy enough, but now if I want to use the current version of lucene, 3.0
these methods are no longer there. TokenStream.next() or
TokenStream.next(Token). In 2.9.0 these methods were deprecated as are
Token.setTermText()
; Sent: Saturday, December 12, 2009 5:27 AM
> Subject: Re: Lucene Analyzer that can handle C++ vs C#
>
>
> > What we did in DBSight is to provide a reserved list of words for every
> > Lucene Analyzer.
> > This way you can handle any special characters like C++ and C#.
How about KeywordAnalyzer? It will treat C++ and C# as single term.
Regards
Ganesh
- Original Message -
From: "Chris Lu"
To:
Sent: Saturday, December 12, 2009 5:27 AM
Subject: Re: Lucene Analyzer that can handle C++ vs C#
> What we did in DBSight is to provide a res
Mappingcharfilter may bring you some ideas
- 原始邮件 -
发件人: maxSchlein
发送时间: 2009年12月12日 星期六 1:09
收件人: java-user@lucene.apache.org
主题: Lucene Analyzer that can handle C++ vs C#
Can someone please point me in the right direction.
We are creating an application that needs to beable to
What we did in DBSight is to provide a reserved list of words for every
Lucene Analyzer.
This way you can handle any special characters like C++ and C#.
Any common analyzers usually are not suitable for these special words.
--
Chris Lu
-
Instant Scalable Full-Text Search
> Can someone please point me in the right direction.
>
> We are creating an application that needs to beable to
> search on C++ and get
> back doc's that have C++ in it. The StandardAnalyzer
> does not seem to index
> the "+", so a search for "C++" will bring back docs that
> contain, C++, C,
>
Can someone please point me in the right direction.
We are creating an application that needs to beable to search on C++ and get
back doc's that have C++ in it. The StandardAnalyzer does not seem to index
the "+", so a search for "C++" will bring back docs that contain, C++, C,
C#, etc. The
11 matches
Mail list logo