I can put up my bayes_seen and bayes_toks files, if it will help for debugging purposes.how about just a -D output?
I assume you mean spamassassin -D, not sa-learn -D, but I've attached both a run of spamassassin -D on a spam (which was missed due to Bayes not being available), and a run of sa-learn -D on a directory containing spam (some of which was already learned), just in case.
Ben
[EMAIL PROTECTED]:~/Maildir/.Missed Spam/cur$ spamassassin -D < 1061926646.27508_1074.squeak,S\=2481\:2,S debug: Score set 0 chosen. debug: running in taint mode? no debug: using "/usr/share/spamassassin" for default rules dir debug: using "/etc/spamassassin" for site rules dir debug: using "/home/ben/.spamassassin" for user state dir debug: using "/home/ben/.spamassassin/user_prefs" for user prefs file debug: Failed to parse line in SpamAssassin configuration, skipping: defang_mime 1 debug: using "/home/ben/.spamassassin" for user state dir debug: bayes: 2297 tie-ing to DB file R/O /home/ben/.spamassassin/bayes_toks debug: bayes: 2297 tie-ing to DB file R/O /home/ben/.spamassassin/bayes_seen debug: bayes: found bayes db version 2 debug: bayes: Not available for scanning, only 64 spam(s) in Bayes DB < 200 debug: bayes: 2297 untie-ing debug: bayes: 2297 untie-ing db_toks debug: bayes: 2297 untie-ing db_seen debug: Score set 1 chosen. debug: Initialising learner debug: using "/home/ben/.spamassassin" for user state dir debug: bayes: 2297 tie-ing to DB file R/O /home/ben/.spamassassin/bayes_toks debug: bayes: 2297 tie-ing to DB file R/O /home/ben/.spamassassin/bayes_seen debug: bayes: found bayes db version 2 debug: bayes: Not available for scanning, only 64 spam(s) in Bayes DB < 200 debug: bayes: 2297 untie-ing debug: bayes: 2297 untie-ing db_toks debug: bayes: 2297 untie-ing db_seen debug: received-header: parsed as [ ip=134.56.3.131 rdns=mx01.net.com helo=mx01.net.com by=mx01-int.net.com ident= ] debug: received-header: parsed as [ ip=204.42.44.151 rdns=mx1.renewedmails.com helo=mx1.renewedmails.com by=mx01.net.com ident= ] debug: received-header: 'from' 134.56.3.131 is near to first 'by' debug: received-header: relay 134.56.3.131 trusted? yes debug: received-header: 'by' mx01.net.com has public IP 134.56.3.131 debug: received-header: relay 204.42.44.151 trusted? no debug: Loading languages file... debug: Can't determine language uniquely enough debug: is Net::DNS::Resolver available? yes debug: trying (3) leo.org... debug: looking up MX for 'leo.org' debug: MX for 'leo.org' exists? 1 debug: MX lookup of leo.org succeeded => Dns available (set dns_available to hardcode) debug: is DNS available? 1 debug: all '*From' addrs: [EMAIL PROTECTED] debug: running header regexp tests; score so far=0 debug: running body-text per-line regexp tests; score so far=0 debug: running raw-body-text per-line regexp tests; score so far=2.245 debug: running uri tests; score so far=2.245 debug: uri tests: Done uriRE debug: running full-text regexp tests; score so far=2.245 debug: DCCifd is not available: no r/w dccifd socket found. debug: Current PATH is: /usr/local/bin:/usr/bin:/bin:/usr/bin/X11:/usr/games debug: DCC is not available: no executable dccproc found. debug: Pyzor is not available: pyzor not found debug: all '*To' addrs: [EMAIL PROTECTED] [EMAIL PROTECTED] debug: forged-HELO: from=renewedmails.com helo=renewedmails.com by=net.com debug: DNS MX records found: 5 debug: RBL: success for 10 of 11 queries debug: RBL: timeout for osirusoft-notfirsthop,osirusoft after 3 seconds debug: running meta tests; score so far=2.245 debug: auto-learn? ham=0.1, spam=12, body-hits=2.245, head-hits=0 debug: auto-learn: currently using scoreset 1. no need to recompute. debug: auto-learn? no: inside auto-learn thresholds debug: is spam? score=2.245 required=4 tests=HTML_IMAGE_ONLY_02,HTML_MESSAGE Return-Path: <[EMAIL PROTECTED]> Received: from localhost ([EMAIL PROTECTED] [127.0.0.1]) by net.com (8.12.3/8.12.3/Debian-8) with ESMTP id h7QJW59A005143 for <[EMAIL PROTECTED]>; Tue, 26 Aug 2003 12:32:05 -0700 Received: from west-mail.net.com by localhost with IMAP (fetchmail-5.9.11) for [EMAIL PROTECTED] (single-drop); Tue, 26 Aug 2003 12:32:05 -0700 (PDT) Received: from mx01-int.net.com ([134.56.112.13]) by west-mail.net.com (Netscape Messaging Server 4.15) with ESMTP id HK8RNN00.9UQ for <[EMAIL PROTECTED]>; Tue, 26 Aug 2003 12:33:23 -0700 Received: from mx01.net.com (mx01.net.com [134.56.3.131]) by mx01-int.net.com (Switch-2.2.4/Switch-2.2.4) with ESMTP id h7QJXNJ01533 for <[EMAIL PROTECTED]>; Tue, 26 Aug 2003 12:33:23 -0700 (PDT) Received: from mx1.renewedmails.com (mx1.renewedmails.com [204.42.44.151]) by mx01.net.com (Switch-2.2.5/Switch-2.2.5) with ESMTP id h7QJXMU10746 for <[EMAIL PROTECTED]>; Tue, 26 Aug 2003 12:33:22 -0700 (PDT) Received: by mx1.renewedmails.com (PowerMTA(TM) v2.0r3) id h9eqa003tuca; Tue, 26 Aug 2003 14:27:26 -0500 (envelope-from <[EMAIL PROTECTED]>) From: "ariella" <[EMAIL PROTECTED]> To: [EMAIL PROTECTED] Subject: Update pjj8THlM Date: 26 Aug 2003 15:27:25 -0500 Message-ID: <[EMAIL PROTECTED]> Reply-To: [EMAIL PROTECTED] X-ppt: +8F5N0&YE="YC;VT` MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_XUI8ihQ7keol7mwVErlO1" X-Spam-Checker-Version: SpamAssassin 2.60-rc2-benrules1 (1.198-2003-08-22-exp) on squeak.shout.net.com X-Spam-Status: No, hits=2.2 required=4.0 tests=HTML_IMAGE_ONLY_02,HTML_MESSAGE autolearn=no version=2.60-rc2-benrules1 X-Spam-Level: **
------=_XUI8ihQ7keol7mwVErlO1 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 8bit ------=_XUI8ihQ7keol7mwVErlO1 Content-Type: text/html; charset="iso-8859-1" Content-Transfer-Encoding: 8bit <html><body><center><p><a href="http://www.renewedmails.com/cgi-bin/tx/l.cgi?xix=tuf"><img src="http://www.renewedmails.com/tuf/tuf01.gif" border="0"><br><img src="http://www.renewedmails.com/tuf/tuf02.gif" border="0"></a><p><img src="http://www.renewedmails.com/tuf/q8F5N0lYE=dYCaVTr.gif"><p><a href="http://www.renewedmails.com/cgi-bin/vmtaplm/[EMAIL PROTECTED]"><img src="http://www.renewedmails.com/tuf/tuf03.gif" border="0"></a></center></body></html> ------=_XUI8ihQ7keol7mwVErlO1--
[EMAIL PROTECTED]:~/Maildir/.Missed Spam/cur$ sa-learn -D --spam . debug: Score set 0 chosen. debug: running in taint mode? no debug: using "/usr/share/spamassassin" for default rules dir debug: using "/etc/spamassassin" for site rules dir debug: using "/home/ben/.spamassassin/user_prefs" for user prefs file debug: Failed to parse line in SpamAssassin configuration, skipping: defang_mime 1 debug: bayes: 3712 tie-ing to DB file R/O /home/ben/.spamassassin/bayes_toks debug: bayes: 3712 tie-ing to DB file R/O /home/ben/.spamassassin/bayes_seen debug: bayes: found bayes db version 2 debug: bayes: Not available for scanning, only 64 spam(s) in Bayes DB < 200 debug: bayes: 3712 untie-ing debug: bayes: 3712 untie-ing db_toks debug: bayes: 3712 untie-ing db_seen debug: Score set 0 chosen. debug: Initialising learner debug: Initialising learner debug: Syncing Bayes journal and expiring old tokens... debug: lock: 3712 created /home/ben/.spamassassin/bayes.lock.squeak.shout.net.com.3712 debug: lock: 3712 trying to get lock on /home/ben/.spamassassin/bayes with 0 retries debug: lock: 3712 link to /home/ben/.spamassassin/bayes.lock: link ok debug: bayes: 3712 tie-ing to DB file R/W /home/ben/.spamassassin/bayes_toks debug: bayes: 3712 tie-ing to DB file R/W /home/ben/.spamassassin/bayes_seen debug: bayes: found bayes db version 2 debug: Syncing complete. debug: Removing Markup debug: Learning Spam debug: uri tests: Done uriRE debug: [EMAIL PROTECTED]: already learnt correctly, not learning twice debug: Removing Markup debug: Learning Spam debug: uri tests: Done uriRE debug: [EMAIL PROTECTED]: already learnt correctly, not learning twice debug: Learning Spam debug: decoded MIME header: " דואר אלקטרוני:על הדבש והעוקץ - מידעון רנסאנס בנושא "דואר זבל"" debug: uri tests: Done uriRE debug: [EMAIL PROTECTED]: already learnt correctly, not learning twice debug: Learning Spam debug: uri tests: Done uriRE debug: [EMAIL PROTECTED]: already learnt correctly, not learning twice debug: Removing Markup debug: Learning Spam debug: uri tests: Done uriRE debug: [EMAIL PROTECTED]: already learnt correctly, not learning twice debug: Removing Markup debug: Learning Spam debug: uri tests: Done uriRE debug: [EMAIL PROTECTED]: already learnt correctly, not learning twice debug: Removing Markup debug: Learning Spam debug: uri tests: Done uriRE debug: tokenize: header tokens for *p = "U*s.douglass D*scievents.com D*com" debug: tokenize: header tokens for X-WM-Posted-At = "ic.SCIevents.com; Wed, 27 Aug 03 10:02:30 -0700" debug: tokenize: header tokens for *F = "U*s.douglass D*scievents.com D*com" debug: tokenize: header tokens for To = "U*BEN D*net.com D*com" debug: tokenize: header tokens for *M = " OEA0f43 OEB19b0 OEC026ea8c0 seand1l1jfwmmy " debug: tokenize: header tokens for MIME-Version = "" debug: tokenize: header tokens for *c = "multipart/alternative; ----=_ NHxtPHrt _ HHH _ HHHH _ HHHHHHHH . HHHHHHHH" debug: tokenize: header tokens for X-Priority = "3 (Normal)" debug: tokenize: header tokens for X-MSMail-Priority = "Normal" debug: tokenize: header tokens for *x = "Microsoft Outlook, Build 10.0.4510" debug: tokenize: header tokens for Importance = "Normal" debug: tokenize: header tokens for x-mimeole = "Produced By Microsoft MimeOLE V6.00.2800.1165" debug: tokenize: header tokens for *r = " [192.168.110] ([EMAIL PROTECTED]); " debug: tokenize: header tokens for *r = " [192.168.110] ([EMAIL PROTECTED]); ic.SCIevents.com (dsl12.argotech.net [209.76.234] (may be forged)) by mx02.net.com (Switch-2.2.5/Switch-2.2.5) <[EMAIL PROTECTED]>; " debug: Removing Markup debug: Learning Spam debug: uri tests: Done uriRE debug: tokenize: header tokens for *p = "U*rachel32480 D*yahoo.com D*com" debug: tokenize: header tokens for *m = " 200308271750 h7RHocU06883 mx01 net com " debug: tokenize: header tokens for *F = "U*journeyman81_317 D*gelrevision.nl D*nl" debug: tokenize: header tokens for To = "U*ben D*net.com D*com" debug: tokenize: header tokens for Mime-Version = "1.0" debug: tokenize: header tokens for *c = "; charset=ISO-8859-1" debug: tokenize: header tokens for Content-Transfer-Encoding = "7bit" debug: tokenize: header tokens for *r = " yahoo.com (adsl-64-109-198-100.dsl.yntwoh.ameritech.net [64.109.198]) by mx01.net.com (Switch-2.2.5/Switch-2.2.5) <[EMAIL PROTECTED]>; " debug: tokenize: header tokens for *r = " yahoo.com (adsl-64-109-198-100.dsl.yntwoh.ameritech.net [64.109.198]) by mx01.net.com (Switch-2.2.5/Switch-2.2.5) <[EMAIL PROTECTED]>; mx01.net.com (mx01.net.com [134.56.3]) by mx02-int.net.com (Switch-2.2.4/Switch-2.2.4) <[EMAIL PROTECTED]>; " Learned from 2 messages (8 messages examined). debug: bayes: 3712 untie-ing debug: bayes: 3712 untie-ing db_toks debug: bayes: 3712 untie-ing db_seen debug: bayes: files locked, now unlocking lock debug: unlock: 3712 unlink /home/ben/.spamassassin/bayes.lock