Paul Fielding wrote:
> 
> Quoting Bryan Hoover <[EMAIL PROTECTED]>:
> 
> > You could set these scripts' spamassassin, sa-learn commands with -D,
> > and use standard error redirection to a text file. The output will tell
> > you which Bayes database it's using.  You'd see such like:
> 
> I did this and learned a few things.  The following went to the log:
> 
> debug: using "/home/spamtrap/.spamassassin" for user state dir
> debug: using "/home/spamtrap/.spamassassin/user_prefs" for user prefs file
> debug: bayes: 2391 tie-ing to DB file
> R/O /home/sharedspam/.spamassassin/bayes_toks
> debug: bayes: 2391 tie-ing to DB file
> R/O /home/sharedspam/.spamassassin/bayes_seen
> debug: bayes: found bayes db version 2
> debug: bayes: Not available for scanning, only 19 spam(s) in Bayes DB < 200
> debug: bayes: 2391 untie-ing
> debug: bayes: 2391 untie-ing db_toks
> debug: bayes: 2391 untie-ing db_seen
> debug: Score set 1 chosen.
> debug: Initialising learner
> 
> The good thing is that it appears I'm hitting the correct bayes database.  The
> bit I don't really understand is the part about not being available for
> scanning.  I do understand from reading that the bayes database that it's most
> effective when it's learned a large volume of messages.  But how can I have it
> learn messages if it ignores the spam I'm trying to give it to learn?  It looks
> to me like it's opening the database in Read-Only mode (R/O?), decides the db
> is too small, and releases the database. Nothing gets written to the database,
> so I assume nothing gets learned.

The spamassassin run won't be able to use Bayes for testing a mail, as
the debug output says, until there's 200 each of spam, ham.  And though
I've only used sa-learn for Bayes training, I assume the linked spamtrap
outline is sound, Bayes learning as expected :) - handy to know
spamassassin options provide for this.

Ham/spam limits are a factor in terms of using the Bayes analysis in
detection, not training.

And obviously, the sa-learn run will learn what's sent it.  

As a follow-up check, you should see the corresponding ham/spam database
counts increase with each test you run.

Bryan

> Is it perhaps that the 19 spams it's referring to are spams that the regular
> rules have caught since I set up the database?  If so, then is it fair for me
> to assume that once it's caught +200 spams via regular rules then it will start
> actually using the bayes database and allow me to teach it?
> 
> Thanks muchly for the help...
> 
> regards,
> 
> Paul
> 
> Quoting Bryan Hoover <[EMAIL PROTECTED]>:
> 
> > Paul Fielding wrote:
> > >
> > > I recently set up a shared database with spamtrap and hamtrap accounts, as
> > per:
> > >
> > > http://www.stearns.org/doc/spamassassin-setup.current.html#autoreporting
> > >
> > > You can see the details of the procmail and local.cf files at the link
> > above,
> > > but the sort story is that the database is in
> > /home/sharedspam/.spamassassin,
> > > and accounts spamtrap and hamtrap have their .spamassassin dir linked back
> > to
> > > it.
> > >
> > > /etc/procmail points everyone to the shared database, and the .procmail
> > scripts
> > > for spamtrap and hamtrap take the incoming mail, process it though
> > spamassassin
> > > and sa-learn to teach the spam and ham, and then dump the messages into
> > another
> > > folder for me.
> >
> > You could set these scripts' spamassassin, sa-learn commands with -D,
> > and use standard error redirection to a text file. The output will tell
> > you which Bayes database it's using.  You'd see such like:
> >
> > debug: bayes: 1890297 tie-ing to DB file R/O
> > /home/Bryan/.spamassassin/bayes_toks
> > debug: bayes: 1890297 tie-ing to DB file R/O
> > /home/Bryan/.spamassassin/bayes_seen
> >
> > Bryan
> >
> >
> >
> > -------------------------------------------------------
> > The SF.Net email is sponsored by EclipseCon 2004
> > Premiere Conference on Open Tools Development and Integration
> > See the breadth of Eclipse activity. February 3-5 in Anaheim, CA.
> > http://www.eclipsecon.org/osdn
> > _______________________________________________
> > Spamassassin-talk mailing list
> > [EMAIL PROTECTED]
> > https://lists.sourceforge.net/lists/listinfo/spamassassin-talk
> >
> 
> ----------
> [EMAIL PROTECTED]
> http://www.fielding.ca
> 
> -------------------------------------------------
> This mail sent through IMP: http://horde.org/imp/
> 
> -------------------------------------------------
> This mail sent through IMP: http://horde.org/imp/
> 
> -------------------------------------------------------
> The SF.Net email is sponsored by EclipseCon 2004
> Premiere Conference on Open Tools Development and Integration
> See the breadth of Eclipse activity. February 3-5 in Anaheim, CA.
> http://www.eclipsecon.org/osdn

-- 
One should be an enigma not just to others but to oneself too.  I study
myself.  When I'm tired of that I light a cigar to pass the time, and
think:  God only knows what the good Lord really meant with me, or what
He meant to make of me. - (Soren Kierkegaard - Either/Or)

http://www.wecs.com/content.htm

This signature file is generated by Pick-a-Tag !
Written by Jeroen van Vaarsel
http://www.google.com/search?hl=en&ie=ISO-8859-1&q=pick-a-tag



-------------------------------------------------------
The SF.Net email is sponsored by EclipseCon 2004
Premiere Conference on Open Tools Development and Integration
See the breadth of Eclipse activity. February 3-5 in Anaheim, CA.
http://www.eclipsecon.org/osdn
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to