I’m looking to get some more information on how reliable TextCat can be
considered at this point.

We are running 3.4.0, and have enabled TextCat with some more aggressive
scoring a few month ago based on user requests. For the most part, people
are very happy with this, we had some very bizarre spam that was sailing
through postscreen and spamass and this has taken care of that problem.

It has however introduced a new problem - false positives. I see a bunch of
my daily run cron outputs ending up in the spam box and we find users here
and there that find perfectly valid email (in their allowed languages)
tagged as another language.

Are there any other options for filtering based on language, or any known
patches/fixes for TextCat to make it a bit less aggressive when it runs
across gibberish that is probably not any particular language?

Thanks,

Charles

Reply via email to