On 09/17/2009 08:34 AM, Mark Martinec wrote:
Austin,
now hope to do this Thursday/Friday. I should be able to scan my
million or so messages in a day on my cluster.
Wow, that makes me feel inadequate :) I'm struggling to clean up my
little ham sample of 3600 messages, and looking at another couple
thousand that I'll do if I've got time...
Thanks, that will be nice to have. As the rulesqa site can distinguish
results based on a corpus submitter, even a small but carefully checked
collection is worth having.
I found it valuable to double check ham samples which fire rules
URIBL_JP_SURBL, URIBL_WS_SURBL, URIBL_OB_SURBL,
RCVD_IN_PBL, RCVD_IN_XBL, RCVD_IN_PSBL, RCVD_IN_SSBL
https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6156
Be aware that gmail, yahoo.co.jp and rr.com were whitelisted from new
inclusion only 5 days ago. IP's from prior could still be listed before
the 2 week timeout. Auto-whitelisting of yahoo.com is not yet
implemented. riel is working on DKIM checking in order to whitelist
yahoo.com.
FP's of PSBL are already rare, but they should become rarer.
Please let us know if you see FP's from a legitimate ISP MTA server.
That MTA can be whitelisted from PSBL by either listing itself in DNSWL,
or letting us know to check it by SPF or DKIM.
Warren Togami
wtog...@redhat.com