On 03/10, Jason Bertoch wrote: > Discussion on the dev list points to a lack of sufficient ham in the > corpus which is necessary to generate score updates and publish new > rules. There was a recent drive for new submitters, but I'm still > trying to figure out how I can rearrange my configuration in order > to help.
Actually I believe the problem is currently insufficient spam: HAM: 188008 (150000 required) SPAM: 51330 (150000 required) Insufficient spam corpus to generate scores; aborting. - http://mail-archives.apache.org/mod_mbox/spamassassin-dev/201101.mbox/%3c4d40f924.3070...@dostech.ca%3E On 03/10, Adam Moffett wrote: > I get the impression that they want a representative sample of your > spam, and i will skew things in a bad way if I only submit the spam > that spamassassin already scored low. I got that impression too, but then somebody said if I'm already hand filtering spam that spamassassin missed (false negatives), I might as well submit it, and nobody objected. And I've been submitting it for a while. And yes, only contributing your ham is definitely also useful. Diversity of ham, as John mentions, is definitely a problem. On 03/10, John Hardin wrote: > Spam is easy to get, diverse ham much less so. That's funny, since the sa-updates are currently not happening due to a lack of spam. Also, I agree that https://fedorahosted.org/auto-mass-check/ is a better place to go for participating in the mass checks, since it's more maintained. It should be moved to the SA site eventually. My only change would be to move emailing priv...@spamassassin.apache.org to request a nightly mass check rsync account to the first step, because it can take a while. -- "Wash daily from nose-tip to tail-tip; drink deeply, but never too deep; And remember the night is for hunting, and forget not the day is for sleep." - The Law of the Jungle, Rudyard Kipling http://www.ChaosReigns.com