Am 31.05.2016 um 19:25 schrieb Shivram Krishnan:
Thanks guys.

What I am going to ask might be a longshot.

But is it possible for anyone who is running a mailserver to give a list
of source of SPAM (recent , anytime this year)and the SA score
associated? It will be extremely useful for my research and credit would
be given. Example:-
efetunisie.org ,6.3
abcxcf.com ,5.7

problem is that SpamAssassin don't log envelope-adresses at all in the "spamd: result:" lines and so you even have no point to anything else than this line in case of mails with a missing message-ID

discussed this here - nobody cares "use a glue with it's own logging - bla"

You might think that there will be privacy issues, but I am asking only
for SPAM mails which would be filtered anyways. I need a large corpus of
mails for evaluating my technique.

On Tue, May 31, 2016 at 8:55 AM, Bowie Bailey <bowie_bai...@buc.com
<mailto:bowie_bai...@buc.com>> wrote:

    On 5/31/2016 1:38 AM, @lbutlr wrote:

        On May 30, 2016, at 11:06 PM, Shivram Krishnan
        <rorryk...@gmail.com <mailto:rorryk...@gmail.com>> wrote:

            2) I have set a threshold of -10 to see how spamassassin
            assigns a score for every mail.

        No. Do not do this.


    Instead, set this option in your local.cf <http://local.cf> file:

    add_header all Report _REPORT_

    This will make SA add a report header to all emails so you can see
    how they score.  As a plus, you will also see whether it's marked as
    ham or spam, which you lose by artificially lowering the threshold.

    You can also use this if you want more info on Bayes scoring (should
    be all one line):

    add_header all Bayes bayes=_BAYES_,
    N=_BAYESTC_(_BAYESTCLEARNED_-_BAYESTCHAMMY_+_BAYESTCSPAMMY_),
    ham=(_HAMMYTOKENS(5,short)_), spam=(_SPAMMYTOKENS(5,short)_)


Attachment: signature.asc
Description: OpenPGP digital signature

Reply via email to