From: "Mike Jackson" <[EMAIL PROTECTED]>
RANK RULE NAME COUNT %OFRULES %OFMAIL
%OFSPAM %OFHAM
------------------------------------------------------------ 1
HTML_MESSAGE 45870 5.13 27.72 70.37
55.36
Wait... so 27% of all mail is HTML, 70% of spam is HTML, and 55% of
ham is HTML?
That seems consistent with my numbers too. People like their pretty
email.
Maybe you'd better check your numbers again. :) I'm trying to point out
that these numbers imply a massive amount of "dark matter" email that is
neither spam, ham, nor HTML.
I'm guessing this is due to MAIL FROM commands that end up with all their
corresponding RCPT TO commands rejected. These would not result in a DATA
so there's nothing to spam-scan. If these are counted in %ofmail, but not
in %ofspam or %ofham, that would explain the apparent discrepancy.
My sa-stats script produces different result output:
Total messages: Ham: Spam:
-------------------------------------
5408 3346 2062
Top ham rules: Ham: Spam: % Ham: % Spam:
----------------------------------------------------------------------
HTML_MESSAGE 762 1050 22.77 50.92
It looks like 33.5% of all email was HTML:
(762 + 1050) / 5408
Okay, so my numbers are radically different than the OP's, but I'd still say
that people like their pretty email.
Grasshopper, go and read up on the parallel resistor formula. Then
contemplate upon its possible application here. Enlightenment may follow.
{^_^}