Hello srunschke,

Monday, January 16, 2006, 2:33:42 AM, you wrote:

sad> I'm currently trying to build up a new bayes DB here, ...
sad> ... yet it poses a problem to build up the ham part.
sad> ... Much of the inbound smtp mail either contains private or
sad> confidential information, so I cannot use them as I keep the
sad> source of the bayes messages in a Notes DB serverside - I'd run
sad> into privacy issues.

If you keep the source of your bayes messages in a Notes DB, then you
should have had enough ham to retrain your bayes with, no?

Bigger problem: bayes can only learn what it's taught.  If you have
ham that really should be trained, and because of privacy issues it
should not be kept after training, then you really should develop a
system which will enable you to train without retaining.  Bayes works
best when properly and fully trained, not just trained on "those
unimportant non-private emails are ham".

I can't make recommendations on how to do so in your system, but
you'll get better results from bayes if you figure out how to manage
it.

Bob Menschel



Reply via email to