On Sat, Oct 30, 2004 at 11:24:30AM -0400, [EMAIL PROTECTED] wrote: > I have a development idea. How about the tokens db storing not only the hash > and frequency, but also the actual plaintext string. The string would only > be used for database dumps and reports, while the hash would be used for the > actual matching and scoring. > I think this would give the best of both worlds, the only potential issue > being privacy. Given that words aren't associated with user accounts or > messages in the DB, I don't really see any merit to the privacy argument. > > JP >
Ahhh, you mean this: http://bugzilla.spamassassin.org/show_bug.cgi?id=3331 Lots of work went into this and it turns out that the performance hit is too great. So, hooks were added to the Plugin API that allow anyone to write a plugin to extract the data. Michael
pgpFFgzq1Ah5m.pgp
Description: PGP signature