On Tue, Jul 13, 2010 at 07:35:36PM -0500, Chris Owen wrote: > On Jul 13, 2010, at 7:32 PM, Jason Haar wrote: > > > For some weird reason I seem to get a lot of Chinese spam - and even > > with TextCat enabled, SA is unable to recognise it as Chinese (ie I want > > to score on X-Spam-Languages:). I've Googled around and it looks like > > TextCat ceased development some time ago, so I was wondering if there is > > any known alternative that is more capable? > > Well according to the TextCat web site: > > http://www.let.rug.nl/~vannoord/TextCat/competitors.html
It's more of the implementation that needs an update than TextCat algorithm itself. Charset/case awareness: https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6229 Better database: https://issues.apache.org/SpamAssassin/show_bug.cgi?id=4152 Etc.. feel free to chime in..