On Wed, 2009-01-28 at 12:16 -0800, John Hardin wrote:
> On Wed, 28 Jan 2009, RW wrote:
> 
> > I was wondering if it's possible to control autolearning  based on rules.

No. And even tweaking the various thresholds will not help, since
auto-learning is based on the score *without* Bayes, etc.

> > I'm scoring DSPAM into Spamassassin, and since DSPAM autolearns 

By that you mean... Using the DSPAM plugin for SA? And the rule you want
to base auto-learning upon is the DSPAM plugin one?

> > everything, it would be sensible to synchronise the learning, so when I 
> > correct dspam the same set of mails need to be corrected in 
> > Spamassassin.
> 
> "correcting" implies a manual process, does it not? If you have a corpus 
> of mislearned messages you're feeding to DSPAM to correct it, can you not 
> do the same thing via sa-learn to retrain SA?

SA will not learn any message twice, unless the classification changed.
In that case it isn't necessary to forget the previous wrong learned
tokens either. So, yeah, I guess 'RW' could...


-- 
char *t="\10pse\0r\0dtu...@ghno\x4e\xc8\x79\xf4\xab\x51\x8a\x10\xf4\xf4\xc4";
main(){ char h,m=h=*t++,*x=t+2*h,c,i,l=*x,s=0; for (i=0;i<l;i++){ i%8? c<<=1:
(c=*++x); c&128 && (s+=h); if (!(h>>=1)||!t[s+h]){ putchar(t[s]);h=m;s=0; }}}

Reply via email to