On 07/02/2014 08:00 AM, Steve Bergman wrote:


On 07/02/2014 12:52 AM, Axb wrote:
Site wide bayes works VERY well even under such ugly conditions as
traffic with multiple languages, for ham as well as spam.

Please tell me more.

This goes against Paul Graham's orginal advice, IIRC. And it goes
against intuition. Then again. Bayesian statistics go against intuition.

It's hard to let go and trust a systen-wide Bayes. But I'm listening...

It works, trust me. SA's Bayes implementation is incredibly robust.


My site wide Bayes DB is not exactly small.

0.000          0   23850755          0  non-token data: nspam
0.000          0   10702302          0  non-token data: nham

Would I run a monster this size of it didn't work? Nope.

I waited a long time to be able to use something really 100% site wide (not per server) till we got the ability to use Redis which was FAST, robust and doesn't cause me headaches as sql, file permissions issues, etc.

I can't give you a scientific reason for not using per user Bayes
Site wide works for my +2000 corp domains which includes .tr, .ru, .cn, .ua, .es, .fr,.de plus a ton of other major CCtld domains

AND: I only run autolearn. NO manual/scheduled training.



Reply via email to