Yay, a long-ish post. But I believe it's worth it. On Tue, 2009-05-12 at 13:14 -0700, an anonymous Nabble user wrote: > Karsten Bräckelmann wrote:
> > The problem is with the design itself. Only the real sender can and will > > confirm. The challenge to the *forged* sender of spam will not be > > responded to. Good for you, bad for everyone else. You are sending > > backscatter! Spammers are using the very same addresses the are spamming > > as the sender. Thus your glorious solution to finally end the spam once > > and forever is SENDING SPAM to innocent humans, bystanders, mirroring > > your spam to them. > > No more than the standard backscatter you get when someone has joe-jobbed > your email address. Every couple of months, for a day or two, I'll get > 400-500 bounces a day sent from my hijacked address. See, that's exactly the point. What is that backscatter? Badly setup SMTPs, first accepting the mail, then checking and bouncing spam. And any type of C/R verification and other auto-responders. Every single newly setup C/R system will add to the backscatter load. The more folks argue "hey, others are backscattering C/Rs, so why shouldn't I" the worse it gets. And yes, it *adds* to the noise. > > Seems to explain why you're using Nabble instead of subscribing to the > > mailing list. You do not want my email. > > But it'd be whitelisted, I think. And the only reason I use nabble is that Whitelisted? Maybe. If you do it manually. Well, you need to COI your subscription request, so there'd be the first hurdle. But what about off-list hints I'd prefer to sent privately? > often technical lists are full of people who spend their time doing weird [...] > Anyway, I digress.. Indeed. And IMHO you'd better not judge about or insult the hackers that are willing to help you. ;) And FWIW, that is absolutely not my impression. Anyway, back on-topic. > > I seriously hope you're not getting much help on integrating such a > > horribly C/R backscatter beast. If, instead, you are willing to drop > > boxtrapper and need help with SA, we'd be glad to assist. > > OK, you make the point very clearly! So, if I don't use the boxtrapper > method, what do I do with the 1 in 10,000 emails which is a false positive > and isn't on my whitelist? How do I give that email the extra chance? Thanks. :) Part of the problem and my statement is, that around here we're every now and then discussing exactly that problem. Backscatter. You'll find the guys active in here will never ever do that. We are, however, victims. What about the *rare* FPs? Well, no system is perfect. If you want to reject spam (at the SMTP level, not after accepting!), do it at a score above the required threshold. Say, a score of 10 or 15. Or just dump them in some dedicated folder, to be searched if need be only. Everything between the spam threshold (default 5) and that high value simply goes to a folder for review (by Subject generally is sufficient). With a second threshold of 10, you should be getting < 1% of your spam in that review folder. And it's done quickly, because you only need to look out for potentially good stuff. Ah, and probably try not even to filter known good mail, like mailing lists, through SA. Btw, in my experience your estimate is quite correct. If you're getting significantly more FPs in 10k hams (not overall), you've customized SA much too aggressively. The default SA scores are optimized to avoid FPs like the plague at the cost of /slightly/ more spam slipping through. > >> And while I'm here, how come there's a large and rapidly growing binary > >> file in /home/myaccount/.spamassassin/auto-whitelist which currently > >> has 5mb of spammy addresses I've never emailed? > > > > It's the AWL, a historical score averager [2] for the senders addresses. > > But AFAICS (or the docs suggest) this is for whitelisting email addresses > sent FROM my account, but the logs show my account definitely DIDN'T send > this email. No, it isn't. For the moment, ignore the "white" in AWL. It really is just a historical score averager, keeping track of previous SA scores per sender and netblock. Unless you feed your own, sent mail through SA, AWL doesn't know about anything like you mentioned above. It does know about senders of everything it scans, which usually is the incoming mail. Now what's with that "white"? The original idea is, to allow senders who generally send low scoring mail to occasionally send something spammy looking without going beyond the spam threshold. This historic averaging typically doesn't apply to spam, because the sender is forged, mostly unique, and the originating netblocks are widely spread. Unlike with real, good mail. > >> Thanks, and apology for the length of this, but over the 1.5 months I've > >> been battling this, I've built up quite some info! > > > > Unfortunately, that info didn't include some real hints about the > > challenge response sender-verification pest, and why it is BAD. > > > > Please, do NOT use challenge response sender-verification, do NOT use > > boxtrapper. > > I can tell you're hinting at something...just say what you mean :) I did. Please do not use a C/R verification system. And explained, why you shouldn't. :) > Anyway, thanks for the help on this. You've persuaded me - I'll find another > method than BoxTrapper. It'll still involve spamassassin, I just can't think > of how to catch the false positives. Yay. :) Good. Just use SA. Start with the default. Tweak it a little, you'll find plenty of repeated advice for good third-party additional stuff in the last couple weeks archives. And don't whitelist, unless necessary. Usually, there's absolutely no need for it. There shouldn't be any real FPs, and you shouldn't worry about it right now. A low-volume review folder makes it easy to grab the very rare FPs. You'll find you will not need a backscattering line of defense. guenther -- char *t="\10pse\0r\0dtu...@ghno\x4e\xc8\x79\xf4\xab\x51\x8a\x10\xf4\xf4\xc4"; main(){ char h,m=h=*t++,*x=t+2*h,c,i,l=*x,s=0; for (i=0;i<l;i++){ i%8? c<<=1: (c=*++x); c&128 && (s+=h); if (!(h>>=1)||!t[s+h]){ putchar(t[s]);h=m;s=0; }}}