On Tue, Dec 02, 2003 at 01:06:55PM -0600, Bob Apthorpe wrote:
> > 1. Can anybody tell me how to read the values obtained from sa-learn --dump?
> > The results of my dump are below:
> >
> > 0.000          0          2          0  non-token data: bayes db version
> > 0.000          0      49175          0  non-token data: nspam
> 
> You've trained on 49175 pieces of spam...

Yep, and you're running with DB version 2, which means you're using 2.6x. :)

> > 0.000          0          0          0  non-token data: nham
> 
> ... but no ham. Learn from at least 200 pieces of ham (sa-learn --ham ...)
> so Bayesian scoring kicks in.

yep.  actually, with 49k spam, you'll want way more than 200 ham to be accurate.
Most everything else is self explanatory I think.

0.000          0     125566          0  non-token data: ntokens

that's tokens in the db.

0.000          0 1069458380          0  non-token data: oldest atime
0.000          0 1070374969          0  non-token data: newest atime

atime (time_t) of the oldest and newest tokens in the db.

0.000          0          0          0  non-token data: last journal sync atime

your journal hasn't been synced yet.  it'd be an atime value as well.

0.000          0 1070114468          0  non-token data: last expiry atime

your last expiry run occured at this atime (Sat Nov 29 09:01:08 2003 EST)

0.000          0     571530          0  non-token data: last expire atime delta
0.000          0      67720          0  non-token data: last expire reduction count

"man sa-learn" has more information about how these values are used.
it's there to do estimation values during the next expire run.

-- 
Randomly Generated Tagline:
"If Linux is really to gain "world domination" making sure that NT's
 strengths are Linux's strengths as well as making sure that NT's
 weaknesses are Linux's strengths is required."  - Paul Flinders

Attachment: pgp00000.pgp
Description: PGP signature

Reply via email to