Hi
I am working on using Arabic Stemmer 
https://lucene.apache.org/core/3_6_0/api/all/org/apache/lucene/analysis/ar/ArabicStemmer.html
in suffixes there is a character  THE_MARBUTA (\u0629)
when this Stemmer applies stemSuffix it will remove THE_MARBUTA(ة) which will 
change some words for example car will be changed with mobile (سيارة => سيار) 
truck will be changed to charger (شاحنة => شاحن)
My question is. Is this a correct behavior or it is bug
Thanks
Suleman Mubarik | Software Development Engineer | SDL | smuba...@sdl.com

</pre><font face="arial" size="2" color="#736F6E">



<a 
href="http://www.sdl.com/?utm_source=Email&utm_medium=Email%2BSignature&utm_campaign=SDL%2BStandard%2BEmail%2BSignature";>
<img src="http://www.sdl.com/Content/images/SDLlogo2014.png"; 
border=0><br><br>www.sdl.com
</a><br><br>

<font face="arial" size="1" color="#736F6E">

<b>SDL PLC confidential, all rights reserved.</b>

If you are not the intended recipient of this mail SDL requests and requires 
that you delete it without acting upon or copying any of its contents, 
and we further request that you advise us.<BR><BR>
SDL PLC is a public limited company registered in England and Wales.  
Registered number: 02675207.

<br>

Registered address: Globe House, Clivemont Road, Maidenhead, Berkshire SL6 7DY, 
UK.</font>


This message has been scanned for malware by Websense. www.websense.com

Reply via email to