Re: sa-learn dump showing only binary tokens

2004-10-30 Thread Michael Parker
On Sat, Oct 30, 2004 at 11:24:30AM -0400, [EMAIL PROTECTED] wrote: > I have a development idea. How about the tokens db storing not only the hash > and frequency, but also the actual plaintext string. The string would only > be used for database dumps and reports, while the hash would be used f

Re: sa-learn dump showing only binary tokens

2004-10-30 Thread spamassassin
I have a development idea. How about the tokens db storing not only the hash and frequency, but also the actual plaintext string. The string would only be used for database dumps and reports, while the hash would be used for the actual matching and scoring. I think this would give the best of

Re: sa-learn dump showing only binary tokens

2004-10-29 Thread Matt Kettler
At 04:42 PM 10/29/2004, [EMAIL PROTECTED] wrote: Thanks for the responses. Good explanations that make perfect sense. SO.. now that I'm past the hex-in-db issue, I clearly do have some issue nonetheless. The following spam got through with a score of -4.3, seemingly because of the AWL. My AWL,

Re: sa-learn dump showing only binary tokens

2004-10-29 Thread spamassassin
Thanks for the responses. Good explanations that make perfect sense. SO.. now that I'm past the hex-in-db issue, I clearly do have some issue nonetheless. The following spam got through with a score of -4.3, seemingly because of the AWL. My AWL, however is empty per tools/check_whitelist. How

Re: sa-learn dump showing only binary tokens

2004-10-29 Thread Michael Parker
On Fri, Oct 29, 2004 at 03:49:13PM -0400, Matt Kettler wrote: > > Originally I responded to some emails from the Devs while 3.0 whas still in > development, and they agreed to have an option to allow you to force SA to > use plain-text bayes DBs, but apparently that option slowed things down too

Re: sa-learn dump showing only binary tokens

2004-10-29 Thread Michael Parker
On Fri, Oct 29, 2004 at 03:36:26PM -0400, [EMAIL PROTECTED] wrote: > > In short, when I run sa-learn --dump, I see a slew of binary tokens. I've > isolated the problem by creating a test directory, pointing sa-dump to it via > --dbpath, and creating a new db. Even after loading only a single s

Re: sa-learn dump showing only binary tokens

2004-10-29 Thread Matt Kettler
At 03:36 PM 10/29/2004, [EMAIL PROTECTED] wrote: I've searches low and high for answers to this problem, but I believe they answers out there don't have regular predictable keywords to find them. SA 3.0.1 Redhat FC2 Yes, that's what it's supposed to do in SA 3.x. SA 3.0.x does not store bayes toke

sa-learn dump showing only binary tokens

2004-10-29 Thread spamassassin
I've searches low and high for answers to this problem, but I believe they answers out there don't have regular predictable keywords to find them. SA 3.0.1 Redhat FC2 In short, when I run sa-learn --dump, I see a slew of binary tokens. I've isolated the problem by creating a test directory, poi