Bart Schaefer said:

>   formail -s spamassasin -L < rawmbox > taggedmbox

another way to do it BTW is to use "mass-check", which comes with
SpamAssassin in the "masses" subdir (or the "tools" rpm).

That will scan every message in an mbox, MH directory, Maildir, Cyrus,
etc. efficiently and output their message ID, whether they're spam or not,
points, and tests hit.

We use it for corpus scanning and reporting.

I think since you want to sort into 2 piles though, and you have such a
large number of mails, you're going to have to do some scripting either
way.  Personally, I would split the mbox up into a tree of directories and
write a shell script to iterate over each dir, running mass-check on each
one; then using the results, mv the spam candidates into another tree.

Working with big mboxes is a pain.  Big dirs is not ;)

--j.


-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Welcome to geek heaven.
http://thinkgeek.com/sf
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to