Re: [SAtalk] 2.60rc2: Getting "bayes: Not available for scanning,only X spam(s) < 200" when trained on thousands of spams

Ben Gertzfield Wed, 27 Aug 2003 19:02:43 +0000

Theo Van Dinter wrote:

I can put up my bayes_seen and bayes_toks files, if it will help for debugging purposes.

how about just a -D output?

I assume you mean spamassassin -D, not sa-learn -D, but I've attached both a run of spamassassin -D on a spam (which was missed due to Bayes not being available), and a run of sa-learn -D on a directory containing spam (some of which was already learned), just in case.

Ben

[EMAIL PROTECTED]:~/Maildir/.Missed Spam/cur$ spamassassin -D < 
1061926646.27508_1074.squeak,S\=2481\:2,S
debug: Score set 0 chosen.
debug: running in taint mode? no
debug: using "/usr/share/spamassassin" for default rules dir
debug: using "/etc/spamassassin" for site rules dir
debug: using "/home/ben/.spamassassin" for user state dir
debug: using "/home/ben/.spamassassin/user_prefs" for user prefs file
debug: Failed to parse line in SpamAssassin configuration, skipping: defang_mime 1
debug: using "/home/ben/.spamassassin" for user state dir
debug: bayes: 2297 tie-ing to DB file R/O /home/ben/.spamassassin/bayes_toks
debug: bayes: 2297 tie-ing to DB file R/O /home/ben/.spamassassin/bayes_seen
debug: bayes: found bayes db version 2
debug: bayes: Not available for scanning, only 64 spam(s) in Bayes DB < 200
debug: bayes: 2297 untie-ing
debug: bayes: 2297 untie-ing db_toks
debug: bayes: 2297 untie-ing db_seen
debug: Score set 1 chosen.
debug: Initialising learner
debug: using "/home/ben/.spamassassin" for user state dir
debug: bayes: 2297 tie-ing to DB file R/O /home/ben/.spamassassin/bayes_toks
debug: bayes: 2297 tie-ing to DB file R/O /home/ben/.spamassassin/bayes_seen
debug: bayes: found bayes db version 2
debug: bayes: Not available for scanning, only 64 spam(s) in Bayes DB < 200
debug: bayes: 2297 untie-ing
debug: bayes: 2297 untie-ing db_toks
debug: bayes: 2297 untie-ing db_seen
debug: received-header: parsed as [ ip=134.56.3.131 rdns=mx01.net.com 
helo=mx01.net.com by=mx01-int.net.com ident= ]
debug: received-header: parsed as [ ip=204.42.44.151 rdns=mx1.renewedmails.com 
helo=mx1.renewedmails.com by=mx01.net.com ident= ]
debug: received-header: 'from' 134.56.3.131 is near to first 'by'
debug: received-header: relay 134.56.3.131 trusted? yes
debug: received-header: 'by' mx01.net.com has public IP 134.56.3.131
debug: received-header: relay 204.42.44.151 trusted? no
debug: Loading languages file...
debug: Can't determine language uniquely enough
debug: is Net::DNS::Resolver available? yes
debug: trying (3) leo.org...
debug: looking up MX for 'leo.org'
debug: MX for 'leo.org' exists? 1
debug: MX lookup of leo.org succeeded => Dns available (set dns_available to hardcode)
debug: is DNS available? 1
debug: all '*From' addrs: [EMAIL PROTECTED]
debug: running header regexp tests; score so far=0
debug: running body-text per-line regexp tests; score so far=0
debug: running raw-body-text per-line regexp tests; score so far=2.245
debug: running uri tests; score so far=2.245
debug: uri tests: Done uriRE
debug: running full-text regexp tests; score so far=2.245
debug: DCCifd is not available: no r/w dccifd socket found.
debug: Current PATH is: /usr/local/bin:/usr/bin:/bin:/usr/bin/X11:/usr/games
debug: DCC is not available: no executable dccproc found.
debug: Pyzor is not available: pyzor not found
debug: all '*To' addrs: [EMAIL PROTECTED] [EMAIL PROTECTED]
debug: forged-HELO: from=renewedmails.com helo=renewedmails.com by=net.com
debug: DNS MX records found: 5
debug: RBL: success for 10 of 11 queries
debug: RBL: timeout for osirusoft-notfirsthop,osirusoft after 3 seconds
debug: running meta tests; score so far=2.245
debug: auto-learn? ham=0.1, spam=12, body-hits=2.245, head-hits=0
debug: auto-learn: currently using scoreset 1.  no need to recompute.
debug: auto-learn? no: inside auto-learn thresholds
debug: is spam? score=2.245 required=4 tests=HTML_IMAGE_ONLY_02,HTML_MESSAGE
Return-Path: <[EMAIL PROTECTED]>
Received: from localhost ([EMAIL PROTECTED] [127.0.0.1])
        by net.com (8.12.3/8.12.3/Debian-8) with ESMTP id h7QJW59A005143
        for <[EMAIL PROTECTED]>; Tue, 26 Aug 2003 12:32:05 -0700
Received: from west-mail.net.com
        by localhost with IMAP (fetchmail-5.9.11)
        for [EMAIL PROTECTED] (single-drop); Tue, 26 Aug 2003 12:32:05 -0700 (PDT)
Received: from mx01-int.net.com ([134.56.112.13]) by
          west-mail.net.com (Netscape Messaging Server 4.15) with ESMTP id
          HK8RNN00.9UQ for <[EMAIL PROTECTED]>; Tue, 26 Aug
          2003 12:33:23 -0700
Received: from mx01.net.com (mx01.net.com [134.56.3.131])
        by mx01-int.net.com (Switch-2.2.4/Switch-2.2.4) with ESMTP id h7QJXNJ01533
        for <[EMAIL PROTECTED]>; Tue, 26 Aug 2003 12:33:23 -0700 (PDT)
Received: from mx1.renewedmails.com (mx1.renewedmails.com [204.42.44.151])
        by mx01.net.com (Switch-2.2.5/Switch-2.2.5) with ESMTP id h7QJXMU10746
        for <[EMAIL PROTECTED]>; Tue, 26 Aug 2003 12:33:22 -0700 (PDT)
Received: by mx1.renewedmails.com (PowerMTA(TM) v2.0r3) id h9eqa003tuca; Tue, 26 Aug 
2003 14:27:26 -0500 (envelope-from <[EMAIL PROTECTED]>)
From: "ariella" <[EMAIL PROTECTED]>
To: [EMAIL PROTECTED]
Subject: Update pjj8THlM
Date: 26 Aug 2003 15:27:25 -0500
Message-ID: <[EMAIL PROTECTED]>
Reply-To: [EMAIL PROTECTED]
X-ppt: +8F5N0&YE="YC;VT`
MIME-Version: 1.0
Content-Type: multipart/alternative; boundary="----=_XUI8ihQ7keol7mwVErlO1"
X-Spam-Checker-Version: SpamAssassin 2.60-rc2-benrules1 (1.198-2003-08-22-exp) on
        squeak.shout.net.com
X-Spam-Status: No, hits=2.2 required=4.0 tests=HTML_IMAGE_ONLY_02,HTML_MESSAGE
        autolearn=no version=2.60-rc2-benrules1
X-Spam-Level: **



------=_XUI8ihQ7keol7mwVErlO1
Content-Type: text/plain; charset="iso-8859-1"
Content-Transfer-Encoding: 8bit



 &nbsp;&nbsp;&nbsp;&nbsp;





------=_XUI8ihQ7keol7mwVErlO1
Content-Type: text/html; charset="iso-8859-1"
Content-Transfer-Encoding: 8bit

<html><body><center><p><a 
href="http://www.renewedmails.com/cgi-bin/tx/l.cgi?xix=tuf";><img 
src="http://www.renewedmails.com/tuf/tuf01.gif"; border="0"><br><img 
src="http://www.renewedmails.com/tuf/tuf02.gif"; border="0"></a><p><img 
src="http://www.renewedmails.com/tuf/q8F5N0lYE=dYCaVTr.gif";><p><a 
href="http://www.renewedmails.com/cgi-bin/vmtaplm/[EMAIL PROTECTED]"><img 
src="http://www.renewedmails.com/tuf/tuf03.gif"; border="0"></a></center></body></html>


------=_XUI8ihQ7keol7mwVErlO1--

[EMAIL PROTECTED]:~/Maildir/.Missed Spam/cur$ sa-learn -D --spam .
debug: Score set 0 chosen.
debug: running in taint mode? no
debug: using "/usr/share/spamassassin" for default rules dir
debug: using "/etc/spamassassin" for site rules dir
debug: using "/home/ben/.spamassassin/user_prefs" for user prefs file
debug: Failed to parse line in SpamAssassin configuration, skipping: defang_mime 1
debug: bayes: 3712 tie-ing to DB file R/O /home/ben/.spamassassin/bayes_toks
debug: bayes: 3712 tie-ing to DB file R/O /home/ben/.spamassassin/bayes_seen
debug: bayes: found bayes db version 2
debug: bayes: Not available for scanning, only 64 spam(s) in Bayes DB < 200
debug: bayes: 3712 untie-ing
debug: bayes: 3712 untie-ing db_toks
debug: bayes: 3712 untie-ing db_seen
debug: Score set 0 chosen.
debug: Initialising learner
debug: Initialising learner
debug: Syncing Bayes journal and expiring old tokens...
debug: lock: 3712 created /home/ben/.spamassassin/bayes.lock.squeak.shout.net.com.3712
debug: lock: 3712 trying to get lock on /home/ben/.spamassassin/bayes with 0 retries
debug: lock: 3712 link to /home/ben/.spamassassin/bayes.lock: link ok
debug: bayes: 3712 tie-ing to DB file R/W /home/ben/.spamassassin/bayes_toks
debug: bayes: 3712 tie-ing to DB file R/W /home/ben/.spamassassin/bayes_seen
debug: bayes: found bayes db version 2
debug: Syncing complete.
debug: Removing Markup
debug: Learning Spam
debug: uri tests: Done uriRE
debug: [EMAIL PROTECTED]: already learnt correctly, not learning twice
debug: Removing Markup
debug: Learning Spam
debug: uri tests: Done uriRE
debug: [EMAIL PROTECTED]: already learnt correctly, not learning twice
debug: Learning Spam
debug: decoded MIME header: " דואר אלקטרוני:על הדבש והעוקץ - מידעון רנסאנס בנושא "דואר 
זבל""
debug: uri tests: Done uriRE
debug: [EMAIL PROTECTED]: already learnt correctly, not learning twice
debug: Learning Spam
debug: uri tests: Done uriRE
debug: [EMAIL PROTECTED]: already learnt correctly, not learning twice
debug: Removing Markup
debug: Learning Spam
debug: uri tests: Done uriRE
debug: [EMAIL PROTECTED]: already learnt correctly, not learning twice
debug: Removing Markup
debug: Learning Spam
debug: uri tests: Done uriRE
debug: [EMAIL PROTECTED]: already learnt correctly, not learning twice
debug: Removing Markup
debug: Learning Spam
debug: uri tests: Done uriRE
debug: tokenize: header tokens for *p = "U*s.douglass D*scievents.com D*com"
debug: tokenize: header tokens for X-WM-Posted-At = "ic.SCIevents.com; Wed, 27 Aug 03 
10:02:30 -0700"
debug: tokenize: header tokens for *F = "U*s.douglass D*scievents.com D*com"
debug: tokenize: header tokens for To = "U*BEN D*net.com D*com"
debug: tokenize: header tokens for *M = " OEA0f43 OEB19b0 OEC026ea8c0 seand1l1jfwmmy "
debug: tokenize: header tokens for MIME-Version = ""
debug: tokenize: header tokens for *c = "multipart/alternative;   ----=_ NHxtPHrt _ 
HHH _ HHHH _ HHHHHHHH . HHHHHHHH"
debug: tokenize: header tokens for X-Priority = "3 (Normal)"
debug: tokenize: header tokens for X-MSMail-Priority = "Normal"
debug: tokenize: header tokens for *x = "Microsoft Outlook, Build 10.0.4510"
debug: tokenize: header tokens for Importance = "Normal"
debug: tokenize: header tokens for x-mimeole = "Produced By Microsoft MimeOLE 
V6.00.2800.1165"
debug: tokenize: header tokens for *r = "   [192.168.110] ([EMAIL PROTECTED]); "
debug: tokenize: header tokens for *r = "   [192.168.110] ([EMAIL PROTECTED]);    
ic.SCIevents.com (dsl12.argotech.net [209.76.234] (may be forged)) by mx02.net.com 
(Switch-2.2.5/Switch-2.2.5)         <[EMAIL PROTECTED]>; "
debug: Removing Markup
debug: Learning Spam
debug: uri tests: Done uriRE
debug: tokenize: header tokens for *p = "U*rachel32480 D*yahoo.com D*com"
debug: tokenize: header tokens for *m = " 200308271750 h7RHocU06883 mx01 net com "
debug: tokenize: header tokens for *F = "U*journeyman81_317 D*gelrevision.nl D*nl"
debug: tokenize: header tokens for To = "U*ben D*net.com D*com"
debug: tokenize: header tokens for Mime-Version = "1.0"
debug: tokenize: header tokens for *c = "; charset=ISO-8859-1"
debug: tokenize: header tokens for Content-Transfer-Encoding = "7bit"
debug: tokenize: header tokens for *r = "  yahoo.com 
(adsl-64-109-198-100.dsl.yntwoh.ameritech.net [64.109.198]) by mx01.net.com 
(Switch-2.2.5/Switch-2.2.5)         <[EMAIL PROTECTED]>; "
debug: tokenize: header tokens for *r = "  yahoo.com 
(adsl-64-109-198-100.dsl.yntwoh.ameritech.net [64.109.198]) by mx01.net.com 
(Switch-2.2.5/Switch-2.2.5)         <[EMAIL PROTECTED]>;    mx01.net.com (mx01.net.com 
[134.56.3]) by mx02-int.net.com (Switch-2.2.4/Switch-2.2.4)         <[EMAIL 
PROTECTED]>; "
Learned from 2 messages (8 messages examined).
debug: bayes: 3712 untie-ing
debug: bayes: 3712 untie-ing db_toks
debug: bayes: 3712 untie-ing db_seen
debug: bayes: files locked, now unlocking lock
debug: unlock: 3712 unlink /home/ben/.spamassassin/bayes.lock

Re: [SAtalk] 2.60rc2: Getting "bayes: Not available for scanning,only X spam(s) < 200" when trained on thousands of spams

Reply via email to