Re: Best practice for learning submissions

David B Funk Mon, 23 Jul 2018 19:27:37 -0700

On Mon, 23 Jul 2018, Nick Bright wrote:

On 7/23/2018 7:55 PM, Reindl Harald wrote:
and even if - whats the point to store the surrounding messages in thecorpus which you should keep forever if you need rebuild from scratchlater?what is the problem you try to solveand why can't you just store theattachment instead the whole mail containg it?
The problem I'm trying to solve is "how to implement a training system on myserver".
I suppose i could de-encapsulate an attachment with a script, before feedingit to sa-learn?

If your mail-box server is imap, has public folders capability and you haveaccess to the back-end storage (EG Dovecot) then you could implement areport-spam folder submission system.

EG your users drop spam messages into the report-spam folder and your scriptruns on the back-side, extracting the messages, feeding them to "spamc -l" andthen moving them into a "report-done" folder for archival purposes.

That or you have to glue together some kind of de-mimifying scripts insideprocmail to feed 'spamc -l' and hope that your users use some predictable kindof mime labeling so you can automate the unwrapping process. (good luck).

Either way you are at the mercy of your users to make valid judgments aboutwhether a particular message is actual spam (and not just somemarketing/newsletter thing they signed up for and then forgot).




--
Dave Funk                                  University of Iowa
<dbfunk (at) engineering.uiowa.edu>        College of Engineering
319/335-5751   FAX: 319/384-0549           1256 Seamans Center
Sys_admin/Postmaster/cell_admin            Iowa City, IA 52242-1527
#include <std_disclaimer.h>
Better is not better, 'standard' is better. B{

Re: Best practice for learning submissions

Reply via email to