I would like to start contributing to spamassassin and help to fight spam.
http://au.spamassassin.org/hacking.html lists how to submit mass-check results. I have a couple of questions: * The CORPUS_POLICY lists that you should use hand-verified spam/ham tiles, but the CORPUS_SUBMIT lists that you should only check the top 20 spam/ham messages. I'm pretty sure my corpus is quite good, but I don't want to check every message by hand. Can anybody elaborate on this policy? * I get about 4000 genuine spams per month and have a couple of mailboxes that I'm sure of only contain ham-mail. I receive both a lot of English and Dutch e-mails. * Are there any other contributors already submitting dutch/english corpora results? * Should the corpora be approx. 50% ham and 50% spam? * How many people submit their mass-check results? How many messages are in their corpora? Regards, Pieter BTW: is there a estimated release date set for spamassassin 2.70? -- http://zwiki.org/PieterB ------------------------------------------------------- This SF.net email is sponsored by: Perforce Software. Perforce is the Fast Software Configuration Management System offering advanced branching capabilities and atomic changes on 50+ platforms. Free Eval! http://www.perforce.com/perforce/loadprog.html _______________________________________________ Spamassassin-talk mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/spamassassin-talk