date:20160314

Re: Problem with porter stemming

2016-03-14 Thread Benson Margulies

Stemming is an inherently limited process. It doesn't know about the word 'news', it just has a rule about 's'. Some of us sell commercial products that do more complex linguistic processing that knows about which words are which. There may be open source implementations of similar technology.

Re: Problem with porter stemming

2016-03-14 Thread Ahmet Arslan

Hi Dwaipayan, Another way is to use KeywordMarkerFilter. Stemmer implementations respect this attribute. If you want to supply your own mappings, StemmerOverrideTokenFilter could be used as well. ahmet On Monday, March 14, 2016 4:31 PM, Dwaipayan Roy wrote: I am using EnglishAnalyzer wi

RE: Problem with porter stemming

2016-03-14 Thread Markus Jelsma

Hi - if you don't want specific words passed through a stemmer, you need to supply a CharArraySet with exclusions as the second argument to its constructor. Markus -Original message- > From:Dwaipayan Roy > Sent: Monday 14th March 2016 15:31 > To: java-user@lucene.apache.org > Subject: Pr

Problem with porter stemming

2016-03-14 Thread Dwaipayan Roy

I am using EnglishAnalyzer with my own stopword list. EnglishAnalyzer uses the porter stemmer (snowball) to stem the words. But using the EnglishAnalyzer, I am getting erroneous result for 'news'. 'news' is getting stemmed into 'new'. Any help would be appreciated.

Re: Problem with porter stemming

Re: Problem with porter stemming

RE: Problem with porter stemming

Problem with porter stemming

4 matches

Site Navigation

Mail list logo

Footer information