Re: Stripping plurals in Spanish

Rafael Capdevielle Fri, 26 Feb 2010 05:15:09 -0800

Thank you very much Andi! it work like a charm!

Thanks!


Rafa

On Fri, Feb 26, 2010 at 12:22 AM, Andi Vajda <va...@apache.org> wrote:

>
> On Tue, 23 Feb 2010, Rafael Capdevielle wrote:
>
>  I want to ask if there is any predefine way to strip plurals for spanish
>> language.
>>
>> I already have a StopSpanishAnalyzer for Stop words:
>>
>> class StopSpanishAnalyzer(lucene.
>> PythonAnalyzer):
>>
>>   def __init__(self, stopWords):
>>       super(StopSpanishAnalyzer, self).__init__()
>>       self.stopWords = stopWords
>>
>>   def tokenStream(self, fieldName, reader):
>>       result = lucene.StandardTokenizer(reader)
>>       result = lucene.LowerCaseFilter(result)
>>       result = lucene.StopFilter(result, self.stopWords)
>>       result = lucene.ISOLatin1AccentFilter(result)
>>
>
> Yes, add something like:
>
>         result = lucene.SnowballFilter(result, lucene.SpanishStemmer())
>
> The Snowball contrib is normally built along with PyLucene.
>
> Andi..
>
>
>        return result
>>
>> is there any way to modify this Analyzer to use the SpanishStemer or
>> another
>> solution to strip plurals in order to improved accuracy of queries?
>>
>> Thanks!
>>
>> Rafael
>>
>>

Re: Stripping plurals in Spanish

Reply via email to