Hi, > I would instead, in order of effectiveness: > > a) expire old tokens; > b) eliminate tokens with very few ham/spam occurrences. > c) eliminate tokens with very close nham to nspam values;
Can you explain how to do this, or point to documentation that would explain? My bayes DB is way too big, but mostly effective. I'd just like to trim it to remove the ones infrequently occurring. Thanks, Alex