Hello Reindl, There will a point where the decision to drop the mail is made based on the headers. Cant we log it there?
On Tue, May 31, 2016 at 10:30 AM, Reindl Harald <h.rei...@thelounge.net> wrote: > > > Am 31.05.2016 um 19:25 schrieb Shivram Krishnan: > >> Thanks guys. >> >> What I am going to ask might be a longshot. >> >> But is it possible for anyone who is running a mailserver to give a list >> of source of SPAM (recent , anytime this year)and the SA score >> associated? It will be extremely useful for my research and credit would >> be given. Example:- >> efetunisie.org ,6.3 >> abcxcf.com ,5.7 >> > > problem is that SpamAssassin don't log envelope-adresses at all in the > "spamd: result:" lines and so you even have no point to anything else than > this line in case of mails with a missing message-ID > > discussed this here - nobody cares "use a glue with it's own logging - bla" > > You might think that there will be privacy issues, but I am asking only >> for SPAM mails which would be filtered anyways. I need a large corpus of >> mails for evaluating my technique. >> >> On Tue, May 31, 2016 at 8:55 AM, Bowie Bailey <bowie_bai...@buc.com >> <mailto:bowie_bai...@buc.com>> wrote: >> >> On 5/31/2016 1:38 AM, @lbutlr wrote: >> >> On May 30, 2016, at 11:06 PM, Shivram Krishnan >> <rorryk...@gmail.com <mailto:rorryk...@gmail.com>> wrote: >> >> 2) I have set a threshold of -10 to see how spamassassin >> assigns a score for every mail. >> >> No. Do not do this. >> >> >> Instead, set this option in your local.cf <http://local.cf> file: >> >> add_header all Report _REPORT_ >> >> This will make SA add a report header to all emails so you can see >> how they score. As a plus, you will also see whether it's marked as >> ham or spam, which you lose by artificially lowering the threshold. >> >> You can also use this if you want more info on Bayes scoring (should >> be all one line): >> >> add_header all Bayes bayes=_BAYES_, >> N=_BAYESTC_(_BAYESTCLEARNED_-_BAYESTCHAMMY_+_BAYESTCSPAMMY_), >> ham=(_HAMMYTOKENS(5,short)_), spam=(_SPAMMYTOKENS(5,short)_) >> > > >