Re: Autolearning from rules rather than score

Karsten Bräckelmann Wed, 28 Jan 2009 13:03:31 -0800

On Wed, 2009-01-28 at 12:16 -0800, John Hardin wrote:
> On Wed, 28 Jan 2009, RW wrote:
> 
> > I was wondering if it's possible to control autolearning  based on rules.


No. And even tweaking the various thresholds will not help, since
auto-learning is based on the score *without* Bayes, etc.

> > I'm scoring DSPAM into Spamassassin, and since DSPAM autolearns 

By that you mean... Using the DSPAM plugin for SA? And the rule you want
to base auto-learning upon is the DSPAM plugin one?

> > everything, it would be sensible to synchronise the learning, so when I 
> > correct dspam the same set of mails need to be corrected in 
> > Spamassassin.
> 
> "correcting" implies a manual process, does it not? If you have a corpus 
> of mislearned messages you're feeding to DSPAM to correct it, can you not 
> do the same thing via sa-learn to retrain SA?

SA will not learn any message twice, unless the classification changed.
In that case it isn't necessary to forget the previous wrong learned
tokens either. So, yeah, I guess 'RW' could...


-- 
char *t="\10pse\0r\0dtu...@ghno\x4e\xc8\x79\xf4\xab\x51\x8a\x10\xf4\xf4\xc4";
main(){ char h,m=h=*t++,*x=t+2*h,c,i,l=*x,s=0; for (i=0;i<l;i++){ i%8? c<<=1:
(c=*++x); c&128 && (s+=h); if (!(h>>=1)||!t[s+h]){ putchar(t[s]);h=m;s=0; }}}

Re: Autolearning from rules rather than score

Reply via email to