Re: How to reject mails with special message-id (Debian, Amavis, Spamassassin)

Joe Quinn Tue, 20 Sep 2016 09:09:40 -0700

On 9/20/2016 9:46 AM, Thomas Barth wrote:

Am 20.09.2016 um 15:27 schrieb Bowie Bailey:
X-Spam-Status: Yes, score=14.009 tag=2 tag2=6.31 kill=6.31
        tests=[HTML_MESSAGE=0.001, MESSAGEID_LOCAL=8,
MIME_HTML_ONLY=1.105,
        PYZOR_CHECK=1.985, RCVD_IN_BRBL_LASTEXT=1.644, RDNS_NONE=1.274]
        autolearn=no autolearn_force=no
The base SA ruleset is optimized to detect spam with a score of 5.0.  If
you raise that score, you will allow more spam to come through. If you
lower that score, you will see more legitimate messages blocked as
spam. Make sure you know what you are doing before you change thisscore.
I read that 5.0 is aggressive and suitable for single user setup,conservative values are 8.0 or 11.0.
required_score n.nn (default: 5)
https://spamassassin.apache.org/full/3.2.x/doc/Mail_SpamAssassin_Conf.html
I ve checked most of the mails recognized as spam. The lowest scorewas 8.6x so far.
Here is another mail from ...local. It definitely was spam with zipattachment. Common is a sender address with digits.<wynn.54...@allfromboats.com> -> <tba...@txbweb.de>, quarantine:l/spam-lEHVGcheLkyq.gz, Message-ID:<20160920202635.6b90ec7...@allfromboats.com.local>, mail_id:lEHVGcheLkyq, Hits: 19.118
May be I also should block sender adresses with more than 2 digits inthe name?

My experience has been that spam scoring gets error-dominated prettyrapidly outside the range near 5.0. That is to say, the difference inactual spamminess between messages scored 4 and 6 is far morepredictable and significant than between -1 and 1, or 10 and 12. Even ascore of 8.0 I would expect to take months of tuning to get right,between rescoring rules and RBLs appropriately and then giving the bayesthresholds accurate scores on top of that. The furthest I would probablygo is 4.5 to 6.0. Outside that range, it's easy to run intounpredictable "why was this spam blocked and that spam wasn't" scenarios.

Many of the stock published rules are scored by AI, which runs anoptimization problem to get the most spam on the right side of 5.0 andthe most ham on the left side. For the purposes of solving that problem,the difference between a message scoring 4.8 and 4.9 is the same as thedifference between 4.0 and 4.9, or -50 and 4.9. Developers smooth outthe scoring curve by determining what rules the AI gets to score and forhow much, but that effect is strongest where we can quantify itsusefulness (near the default threshold).


Bayes is scored with a similar consideration, built around probability.

Re: How to reject mails with special message-id (Debian, Amavis, Spamassassin)

Reply via email to