On Mon, Dec 08, 2003 at 01:58:47PM -0600, [EMAIL PROTECTED] wrote:
> So rather than backing up all three files, I just need to back up the
> bayes_toks file periodicly.  And I could use a file lock for this
> purpose, as you confirmed below.

Arguably you should backup seen as well, it has the message-ids that have
been learned already so that you don't double learn (or if you learn again
the other way (ham/spam), it'll deal with the token counts correctly.)

> Further questions: what config option controls how often the _journal
> and _toks files are sync'd?  would the sa-learn --rebuild do the sync
> of the journal and db manually?

yes, --rebuild syncs the journal.  I forget the exact name,
but it's something like bayes_journal_max_size.  the default is
to sync at 100k (filesize).  if you're not learning to the journal
(bayes_learn_to_journal), then how often it syncs doesn't really matter
since the data would only be used for an expiry run.  if you do learn
to journal, then you want it to sync more often.

> Thanks for the quick response, Theo!

no problem. :)

-- 
Randomly Generated Tagline:
"President Bush met with Pakistani President Musharraf today. And you
 could tell Bush has limited experience with Pakistanis. Every time the
 conversation with Musharraf came to a stop, Bush would ask him to turn
 off the meter."
         - Bill Maher, Politically Incorrect

Attachment: pgp00000.pgp
Description: PGP signature

Reply via email to