On Apr 21, 2007, at 1:34 PM, Matt Kettler wrote:
time sa-learn on it, and feed it the WHOLE DIRECTORY at once. Do not
iterate messages, do not specify filenames, just give sa-learn the
name
of the directory.
Doing this on a directory with 6 messages takes about a second more
than doing it for a single message, which is promising. That said,
it isn't noticeably faster (tenths of a second) the second time
(timed using /usr/bin/time).
If it's not, and the first pass did learn messages, you've got a
problem.
That's promising (I have a problem, but problems can be found).
The other possibility is you've got write-lock contention. You can
avoid
a lot of this by using the bayes_learn_to_journal option, at the
expense
of causing your training to not take effect until the next sync.
For batch scripts I'm pretty comfortable doing everything with --no-
sync, with a --sync at the end.
-faisal