On Sat, 16 May 2020 09:26:23 -0400 micah anderson wrote: > RW <rwmailli...@googlemail.com> writes:
> >> Would it be better to use it for mass-check and contribute some to > >> the overall rule scoring? > > > > If you use it for Bayes or mass-checks I'd suggest not relaxing any > > pre-SpamAssassin checks. Some people do that to keep the numbers up, > > but optimizing around spam that doesn't reach SpamAssassin seems > > like a bad idea to me. > > Each of the mails is 100% spam, or backscatter presumably. Just to be clear, by "pre-SpamAssassin checks" I meant checks that would be run on the MTA before mail is passed to SpamAssassin, e.g. IP blocklists, rDNS etc. > so what I'd like to do is have an > automated way to tune my rule scoring, or improve/add rules based on > what gets sent there. You can't expect to do that without a large ham corpus.