Yes, I'm the guy to talk to about GA-related stuff, or submitting data to the spam/nonspam corpus. "Spam scattered through mailboxes" means that frequently over the course of years, people accidentally store the occasional spam in an otherwise non-spam folder. When you run mass-check on those supposedly nonspam folders, the output will get "contaminated" by those scattered spams. Often you can find these by sorting the output of mass-check and then manually verifying the high-scoring messages, removing the spam from your mail archive.
C David T-G wrote: DT> Craig -- DT> DT> ...and then Craig R Hughes said... DT> % DT> % Check out the /masses directory in the distribution. It has all that stuff in DT> % there, and a README which I think is more or less up to date. DT> DT> Ah, cool. Thanks. DT> DT> I have some questions about what constitutes "spam scattered through DT> mailboxes" and what's worth sending in. Do I ask you (since I think I've DT> seen you talk about the GA before), the submission address (the only DT> address in the README), or the list? DT> DT> DT> % DT> % C DT> DT> DT> TIA & HAND DT> DT> :-D DT> _______________________________________________________________ Hundreds of nodes, one monster rendering program. Now that's a super model! Visit http://clustering.foundries.sf.net/ _______________________________________________ Spamassassin-talk mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/spamassassin-talk