On Sat, Oct 30, 2004 at 11:24:30AM -0400, [EMAIL PROTECTED] wrote:
> I have a development idea.  How about the tokens db storing not only the hash 
> and frequency, but also the actual plaintext string.  The string would only 
> be used for database dumps and reports, while the hash would be used for the 
> actual matching and scoring.
> I think this would give the best of both worlds, the only potential issue 
> being privacy.  Given that words aren't associated with user accounts or 
> messages in the DB, I don't really see any merit to the privacy argument.
> 
> JP
> 

Ahhh, you mean this:
http://bugzilla.spamassassin.org/show_bug.cgi?id=3331

Lots of work went into this and it turns out that the performance hit
is too great.  So, hooks were added to the Plugin API that allow
anyone to write a plugin to extract the data.

Michael

Attachment: pgpFFgzq1Ah5m.pgp
Description: PGP signature

Reply via email to