On Tue, 18 Jun 2013 13:13:56 -0600 (MDT) Amir Caspi wrote: > Well, I'm not really concerned about getting any header-related SA > rules to hit, for these tests. As I mentioned previously, my primary > concern right now is the disconnect between the Bayes score during > the automatic MTA delivery and during a manual spamc processing. I'm > going to try training my database in a different way, using the > on-server Spam mbox instead of the Eudora mbox, to see if I can get > better results (e.g. if Eudora's mbox format is simply not correct). > [The lack of envelope From is an artifact of copy/paste from > Eudora... and in Eudora's mbox format, the envelope From is also > stripped for some unknown reason.
That's set on delivery into a spool file, but IIRC it's not transmitted in POP or IMAP (IMAP has a concept of an envelope but it's not the same thing). Some clients put an address there for the sake of form, but it's a bit pointless. One thing to watch out for is that a mailbox may contain hidden deleted mail that remains there until the mail client compacts/expunges the mailbox. For that reason I prefer explicit training folders rather than folders where misclassified mails have been moved-out.