I've been running SA for about a week now, and need to sa-(un)learn the FPs.
My system is Windoze/IMail (5sp4/8.13) and the harry and susan (shouldn't call them Ham and Spam, should I) folders contain all mis-identified email in one giant flat file each. Does this work? Must I bust them up into separate emails before calling sa-learn? The doc mentions the folders but says diddly-squat/infinity about the contents of those folders. Dan Barker Format of a big flat file: >>From <[EMAIL PROTECTED]> Thu Oct 21 17:17:58 2004 Received: from dan [172.27.0.30] by visioncomm.net with ESMTP (SMTPD32-8.13) id A7823A3001E; Thu, 21 Oct 2004 17:17:54 -0400 From: "Dan Barker" <[EMAIL PROTECTED]> To: <[EMAIL PROTECTED]> ... rest of headers <HTML> <TITLE></TITLE> <BODY > ... rest of message >>From <[EMAIL PROTECTED]> Thu Oct 21 17:44:42 2004 Received: from dan [172.27.0.30] by visioncomm.net with ESMTP (SMTPD32-8.13) id ADCA1BD007C; Thu, 21 Oct 2004 17:44:42 -0400 From: "Dan Barker" <[EMAIL PROTECTED]> To: <[EMAIL PROTECTED]> Subject: Dbarker, Served in the MlLlTARY? ... rest of headers This is a multi-part message in MIME format. ------=_NextPart_000_03B4_01C4B795.A68C01E0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit ... rest of message ... ... for every email in the "box". The Headers stop and Body begins on the first blank line. I haven't figured out how the body ends yet. It appears to be the "From < in column 1". Yeah, that's it. I just ran a test with "From <" in column 1, and the email is stored with ">From <" instead. So, a splitter will be trivial to write, but must I?