Am 31.05.2016 um 19:25 schrieb Shivram Krishnan:
Thanks guys. What I am going to ask might be a longshot. But is it possible for anyone who is running a mailserver to give a list of source of SPAM (recent , anytime this year)and the SA score associated? It will be extremely useful for my research and credit would be given. Example:- efetunisie.org ,6.3 abcxcf.com ,5.7
problem is that SpamAssassin don't log envelope-adresses at all in the "spamd: result:" lines and so you even have no point to anything else than this line in case of mails with a missing message-ID
discussed this here - nobody cares "use a glue with it's own logging - bla"
You might think that there will be privacy issues, but I am asking only for SPAM mails which would be filtered anyways. I need a large corpus of mails for evaluating my technique. On Tue, May 31, 2016 at 8:55 AM, Bowie Bailey <bowie_bai...@buc.com <mailto:bowie_bai...@buc.com>> wrote: On 5/31/2016 1:38 AM, @lbutlr wrote: On May 30, 2016, at 11:06 PM, Shivram Krishnan <rorryk...@gmail.com <mailto:rorryk...@gmail.com>> wrote: 2) I have set a threshold of -10 to see how spamassassin assigns a score for every mail. No. Do not do this. Instead, set this option in your local.cf <http://local.cf> file: add_header all Report _REPORT_ This will make SA add a report header to all emails so you can see how they score. As a plus, you will also see whether it's marked as ham or spam, which you lose by artificially lowering the threshold. You can also use this if you want more info on Bayes scoring (should be all one line): add_header all Bayes bayes=_BAYES_, N=_BAYESTC_(_BAYESTCLEARNED_-_BAYESTCHAMMY_+_BAYESTCSPAMMY_), ham=(_HAMMYTOKENS(5,short)_), spam=(_SPAMMYTOKENS(5,short)_)
signature.asc
Description: OpenPGP digital signature