Mark said:

> A few weeks ago I thought of an interesting new "statistical" way of
> fighting spam. Having collected a few weeks worth of maillog data (about 500
> MB), I wrote a small Perl script, matching IP addresses by the following
> simple rule (in words):
> 
> "Look at SMTP connections, and consider all IP addresses spam that, in one
> session, deliver to 4 or more local recipients simultaneously."
> 
> Naturally, I ran a post-process pass, so as to filter out white-list data
> (mailing lists and such). Based on the idea that, apart from mailing lists,
> there is hardly ever a decent reason for a remote host to deliver to more
> than 4 recipients simultaneously, the result was, indeed, astounding. :)
> Near 98% was pure spam.
> 
> So, I was wondering, regardless of this particular idea, have the SA
> developers ever seriously thought about doing maillog analysis? That may
> make things a bit platform/MTA dependent, but you really can learn a great
> deal about spammers based on "history" too, and not merely a per-message
> analysis.

The difficulty there is the manual whitelisting.  as far as I can see,
you're measuring bulkiness there; the hard part for a useful SpamAssassin
test is to come up with methods that can work without whitelisting.

--j.


-------------------------------------------------------
This SF.net email is sponsored by: eBay
Get office equipment for less on eBay!
http://adfarm.mediaplex.com/ad/ck/711-11697-6916-5
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to