On 01/24/2011 04:42 PM, J4 wrote:
> Dear all,
>
>     I am cure this question has come up before on this list, yet after
> spending a little while trawling Google, I did not find any sites :( 
> So I ask here!
>
> Are there are any recent (<6 months) ham or spam corporaout there that
> I can download and feed into sa-learn?  I would like to give the
> server a small head-start.
>
> Ciao,s.

And a little more seaching came  up with these. Phew:

http://plg.uwaterloo.ca/~gvcormac/treccorpus07/  (old)
http://www.cc.gatech.edu/projects/doi/WebbSpamCorpus.html
http://spamassassin.apache.org/publiccorpus/

So, I think I have some to start with.

Cheers, s

Reply via email to