On 01/24/2011 04:42 PM, J4 wrote: > Dear all, > > I am cure this question has come up before on this list, yet after > spending a little while trawling Google, I did not find any sites :( > So I ask here! > > Are there are any recent (<6 months) ham or spam corporaout there that > I can download and feed into sa-learn? I would like to give the > server a small head-start. > > Ciao,s.
And a little more seaching came up with these. Phew: http://plg.uwaterloo.ca/~gvcormac/treccorpus07/ (old) http://www.cc.gatech.edu/projects/doi/WebbSpamCorpus.html http://spamassassin.apache.org/publiccorpus/ So, I think I have some to start with. Cheers, s