Re: Calling spamassassin directly yields very different results than calling spamassassin via amavis-new

John Hardin Wed, 16 Jan 2013 11:16:37 -0800

On Wed, 16 Jan 2013, Ben Johnson wrote:

On 1/16/2013 11:00 AM, John Hardin wrote:


That's odd. That suggests you SA wasn't looking up those DNSBLs, or they
would have contributed to the score.

Check your trusted networks setting. One difference between SMTP-time
and SA-time DNSBL checks is that SMTP-time checks the IP address of the
client talking to the MTA, while SA-time can go back up the relay chain
if necessary (e.g. to check the client IP submitting to your ISP if your
ISP's MTA is between your MTA and the Internet, rather than always
checking your ISP's MTA IP address).


Are you referring to SA's "trusted_networks" directive?


Yes.

If so, it is commented-out (presumably by default). Does this need to beset? I've read the info re: trusted_networks athttp://spamassassin.apache.org/full/3.3.x/doc/Mail_SpamAssassin_Conf.html, but I'm struggling to understand it.


It means "which MTAs are trusted to not forge Received headers".

There is a related one: internal_networks, which lists networks that areconsidered "internal" to your inbound mail topology. Sorry I missed thatone in my first message. This one you'd set if you were retrieving youremail from your ISP rather than directly exposing a MTA to the Internet.

If the info is helpful, I have a very simple setup here: a single server
with a single public IP address and a single MTA.

That's the assumed default environment. If you aren't explicitly settingtrusted_networks and internal_networks you should be okay.

That said, the Bayes scores seem to be much more accurate now, too. I
was hardly ever seeing BAYES_99 before, but now almost all spam messages
have BAYES_99.


Odd. SMTP-time hard rejects shouldn't change that.


That's what I figured. I wonder if feeding all of the messages that I
"auto-learned manually" -- messages that were tagged as spam (but for
reasons unrelated to Bayes) -- contributed significantly to this change.


Quite possibly.

That shouldn't be the case. SA and sa-learn both use a shared-access
database; if you're training the database that SA is learning, the
results of training should be effective immediately.


Okay, good. Bowie's response to this question differed (he suggested
that Amavis would need to be restarted for Bayes to be updated),

No, he didn't, he said that in a situation where you'd have to restartspamd, you instead need to restart amavisd. One such situation is afterrunning sa-update and getting updated rules.

but I'm pretty sure that restarting Amavis is not necessary. It seemsunlikely that Amavis would copy the entire Bayes DB (which is stored inMySQL on this server) into memory every time that the Amavis service isstarted. To do so seems self-defeating: more RAM usage, worseperformance, etc.


Right.

So, I emptied the Bayes DB and re-trained ham and spam on my hand-sorted
corpus. The net result was to discard all previous end-user training, if
I understand correctly.


That is correct.

Everything still looks good; mostly BAYES_99 on the messages that are
and should be marked as spam, and no false-positives at all.


yay!

I've disabled the Antispam plug-in for now, for the reasons we've
already discussed. I have asked the Dovecot mailing list for suggestions
regarding how best to pre-screen end-user training submissions.

I think I'm in pretty good shape here, unless setting trusted_networks
is a must, in which case I could use some guidance.


No, sounds like you're good for that.

--
 John Hardin KA7OHZ                    http://www.impsec.org/~jhardin/
 jhar...@impsec.org    FALaholic #11174     pgpk -a jhar...@impsec.org
 key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
  It is criminal to teach a man not to defend himself when he is the
  constant victim of brutal attacks.              -- Malcolm X (1964)
-----------------------------------------------------------------------
 Tomorrow: Benjamin Franklin's 307th Birthday

Re: Calling spamassassin directly yields very different results than calling spamassassin via amavis-new

Reply via email to