On Tue, June 18, 2013 1:01 pm, Martin Gregorie wrote: > The main thing I notice is that there are only two Received: headers, > and no envelope-From so IMO you're hoping for too much from the > header-related SA rules simply because there's very little for SA to get > its teeth into.
Well, I'm not really concerned about getting any header-related SA rules to hit, for these tests. As I mentioned previously, my primary concern right now is the disconnect between the Bayes score during the automatic MTA delivery and during a manual spamc processing. I'm going to try training my database in a different way, using the on-server Spam mbox instead of the Eudora mbox, to see if I can get better results (e.g. if Eudora's mbox format is simply not correct). [The lack of envelope From is an artifact of copy/paste from Eudora... and in Eudora's mbox format, the envelope From is also stripped for some unknown reason. I'm really beginning to doubt Eudora's storage format for purposes of spam identification, though maybe I'm just being paranoid and the real cause is something else.] I'll probably add a .pw ban as well, but that's a separate issue. And, the _original_ subject of this email was about a new rule for HTML comment gibberish, which I would still love, but which is also unrelated to headers. Thanks. =) --- Amir