On Tue, 18 Jul 2006, Dirk Bonengel wrote:
> did you investigate auto-learning? This might let your system learn ham 
> as well as spam. Works fine here (same situation  - gateway server to a 
> Lotus Notes system, no feedback loop possible)

May be I should change the threshholds for autolearning
different from the default? (I never touched them so far).
I just found *lots* 'autolearn=ham' in my log,
and I can not believe that so many are correct.

Out of the current log I see Mail classified as
   21805 ham
   11493 autolearned as ham   (this seems suspiciously high?)
   85963 spam
   52977 autolearned as spam

So I fear the 'skew' in my database comes form autoloearning
'bayes-fodder' of spammers and not fron 'skewed explicite learning'.

WHat may make it even worse is, that 'inhouse mail==ham' is
never learned, because it's never spamchecked (users did complain
too much about the slowdown, so only the 'outside' goes through the
Spamfilter).

Stucki

-- 
Christoph von Stuckrad      * * |nickname |<[EMAIL PROTECTED]>   \
Freie Universitaet Berlin   |/_*|'stucki' |Tel(days):+49 30 838-5 57 78|
Mathematik & Informatik EDV |\ *|if online|Tel(else):+49 30 77 39 66 00|
Arnimallee 6 / 14195 Berlin * * |on IRCnet|Fax(alle):+49 30 838-75 454/

Reply via email to