As an aside,
formail -D 20000 /tmp/dup_id_cache.$$ -s < mbox.txt > mbox_no_dupes.txt rm -f /tmp/dup_id_cache.$$ will do a decent job of weeding out duplicates (based upon message id), where 20000 is the size of the id cache. ------------------------------------------------------- This SF.net email is sponsored by: Perforce Software. Perforce is the Fast Software Configuration Management System offering advanced branching capabilities and atomic changes on 50+ platforms. Free Eval! http://www.perforce.com/perforce/loadprog.html _______________________________________________ Spamassassin-talk mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/spamassassin-talk