And there is one of these for each user, this is just for one user.  Sounds like we may have to abandon Bayes or possibly use mysql.  Not sure we are ready to invest in setting that all up...

Theo Van Dinter <[EMAIL PROTECTED]> wrote:
On Fri, Oct 27, 2006 at 03:01:45PM -0700, Sammy Anderson wrote:
> I manually ran sa-learn --force-expire, and it hammered the box. Here is a debug and timing information (for just a 5 MB file!):
>
> [18002] dbg: bayes: token count: 161725, final goal reduction size: 49225

want to get rid of (max) 49225 tokens

> [18002] dbg: bayes: can't use estimation method for expiry, unexpected result, calculating optimal atime delta (first pass)

have to do step 1 and can't estimate

> [18002] dbg: bayes: expiry max exponent: 9
> ------ about 20 seconds elapsed

it's going through every token in your db

> [18002] dbg: bayes: atime token reduction
> [18002] dbg: bayes: ======== ===============
> [18002] dbg: bayes: 43200 144256
> [18002] dbg: bayes: 86400 133029
> [18002] dbg: bayes: 172800 111350
> [18002] dbg: bayes: 345600 72306
> [18002] dbg: bayes: 691200 9457
> [18002] dbg: bayes: 1382400 0
[...]
> [18002] dbg: bayes: first pass decided on 691200 for atime delta

691200 wins the Price Is Right (9457 is the closest without going over)

> ------ about 40 seconds elapsed [a sort going on here???]

It's creating a new DB file, going back through every token in the original
DB, and for any that are newer than 9457 seconds ago, it copies the entry to
the new DB.

> expired old bayes database entries in 60 seconds <= YIKES

yep. expiry is relatively resource intensive and slow w/ DBMs, but
there's no other good way to do it (or at least, no one has suggested
a really better way to do it...)

--
Randomly Selected Tagline:
I believe it's not butter, I just can't believe it's $1.59!


Get your email and see which of your friends are online - Right on the new Yahoo.com

Reply via email to