Re: bayes/txrep questions

2025-02-16 Thread Bill Cole
On 2025-02-16 at 10:38:38 UTC-0500 (Sun, 16 Feb 2025 10:38:38 -0500) Alex is rumored to have said: [quoting me] TxRep (like AWL) is fed not by Bayes learning (sa-learn) but rather it tracks the combination of an address and a source IP range (/24) with a tally of the SA scores of messages usi

Re: bayes/txrep questions

2025-02-16 Thread Alex
> > > > > Is there any benefit to training an email that's already hitting > > bayes99? > > Yes. The tokens which made it hit 99% are already doing their jobs, but > the rest of the message that Bayes isn't seeing as spammy may turn out > to be what makes the next spam hit 99.9% > I have noticed t

Re: bayes/txrep questions

2025-02-14 Thread Bill Cole
On 2025-02-14 at 17:00:03 UTC-0500 (Fri, 14 Feb 2025 17:00:03 -0500) Alex is rumored to have said: Hi, I'm using SA v4 and trying to find ways to minimize the amount of junk that isn't tagged. Emails like "1-hour free consultation" or "buy this event list" or "salesforce optimization" or "HR

Re: bayes/txrep questions

2025-02-14 Thread Greg Troxel
Alex writes: > These also aren't always one-offs, but maybe a dozen or twenty of each over > a short period that get through, likely before the URIs are blocked through > other means. Other times they don't have a link at all. Sounds like fairly aggressive greylisting is in order.

Re: bayes/txrep questions

2025-02-14 Thread John Hardin
On Fri, 14 Feb 2025, Alex wrote: Hi, I'm using SA v4 and trying to find ways to minimize the amount of junk that isn't tagged. Emails like "1-hour free consultation" or "buy this event list" or "salesforce optimization" or "HR consulting" that already hit bayes99 (and bayes999) but are still jus

Re: Bayes in V4 compared to V3

2024-11-13 Thread Grega via users
day differenc now :) Regards,Gregor From: hg user Sent: Tuesday, 12 November 2024 13:58 To: Grega Cc: users@spamassassin.apache.org Subject: Re: Bayes in V4 compared to V3 Yes Il Mar 12 Nov 2024, 13:53 Grega via users mailto:users@spamassassin.apache.org

Re: Bayes in V4 compared to V3

2024-11-12 Thread Shawn Iverson
> > As documented: > > # perldoc Mail::SpamAssassin::BayesStore::SQL > NAME > Mail::SpamAssassin::BayesStore::SQL - SQL Bayesian Storage Module > Implementation > > DESCRIPTION > This module implements a SQL based bayesian storage module. It's > compatible with SQLite and possibly other standard SQ

Re: Bayes in V4 compared to V3

2024-11-12 Thread hg user
Yes Il Mar 12 Nov 2024, 13:53 Grega via users ha scritto: > If we used SQL and now switched to MySQL do we have to re-train bayes? > > -- > *From:* Bill Cole > *Sent:* Tuesday, 12 November 2024 13:35 > *To:* users@spamassassin.apache.org > *Subj

Re: Bayes in V4 compared to V3

2024-11-12 Thread Grega via users
If we used SQL and now switched to MySQL do we have to re-train bayes? From: Bill Cole Sent: Tuesday, 12 November 2024 13:35 To: users@spamassassin.apache.org Subject: Re: Bayes in V4 compared to V3 On 2024-11-12 at 00:33:13 UTC-0500 (Tue, 12 Nov 2024 00:33:13

Re: Bayes in V4 compared to V3

2024-11-12 Thread Bill Cole
On 2024-11-12 at 00:33:13 UTC-0500 (Tue, 12 Nov 2024 00:33:13 -0500) Shawn Iverson is rumored to have said: [...] > The "-D bayes" parameter was quite informative. Thank you. Turns out the > database wasn't being read properly with the bayes_store_module in use. > Maybe it has something to do wit

Re: Bayes in V4 compared to V3

2024-11-11 Thread Grega via users
: users@spamassassin.apache.org; Grega Cc: hg user Subject: Re: Bayes in V4 compared to V3 On Mon, Nov 11, 2024 at 4:48 PM hg user mailto:mercurialu...@gmail.com>> wrote: In spamassassin 3 you could debug bayes points running command line spamassassin with "-D bayes" parameter. I

Re: Bayes in V4 compared to V3

2024-11-11 Thread Shawn Iverson
On Mon, Nov 11, 2024 at 4:48 PM hg user wrote: > In spamassassin 3 you could debug bayes points running command line > spamassassin with "-D bayes" parameter. I think you can in version 4 too. > > In the log all the tokens extracted from the message are listed with the > points assigned, so you c

Re: Bayes in V4 compared to V3

2024-11-11 Thread hg user
values like BAYES_05 and >BAYES_99 ? > > > Thanks for listening and any input 😊 > > Regards,G > > -- > *From:* Grega via users > *Sent:* Wednesday, 25 September 2024 20:02 > *To:* users@spamassassin.apache.org > *Subject:* Re: Baye

Re: Bayes in V4 compared to V3

2024-11-05 Thread Grega via users
via users Sent: Wednesday, 25 September 2024 20:02 To: users@spamassassin.apache.org Subject: Re: Bayes in V4 compared to V3 Oh god I`m idiot... I had: score BAYES_20 0.0 So now every mail has bayes score in it (changed it to score BAYES_20 0.1) Still puzzling why I have no extreme low or

Re: Bayes in V4 compared to V3

2024-09-25 Thread Grega via users
off. From: Matija Nalis Sent: Wednesday, 25 September 2024 18:23 To: users@spamassassin.apache.org Subject: Re: Bayes in V4 compared to V3 On Mon, Sep 23, 2024 at 01:14:25PM +, Grega via users wrote: > Why one has "BAYES_60" and other 2 not? &

Re: Bayes in V4 compared to V3

2024-09-25 Thread Grega via users
Hi. Im on mysql backend. Load is none .. From: Matija Nalis Sent: Wednesday, September 25, 2024 18:24 To: users@spamassassin.apache.org Subject: Re: Bayes in V4 compared to V3 On Mon, Sep 23, 2024 at 01:14:25PM +, Grega via users wrote: > Why one

Re: Bayes in V4 compared to V3

2024-09-25 Thread Matija Nalis
On Tue, Sep 24, 2024 at 08:10:38AM +, Grega via users wrote: > Also this: > > RuleDescriptionScoreTotalHamCol6SpamCol8 > BAYES_40Bayes spam probability is 20 to 40%0.002,784 > 2,72197.7632.3 > BAYES_50Bayes spam probability is 40 to 6

Re: Bayes in V4 compared to V3

2024-09-25 Thread Matija Nalis
On Mon, Sep 23, 2024 at 01:14:25PM +, Grega via users wrote: > Why one has "BAYES_60" and other 2 not? > > 4. Race condition (IDK I`m not coder) What backend are you using for storing bayer data? I'm not yet on 4.x (Debian Stable FTW), but in SA 3.x default was a local file storage (BDB?

Re: Bayes in V4 compared to V3

2024-09-24 Thread Grega via users
via users Sent: Monday, 23 September 2024 15:14 To: users@spamassassin.apache.org Subject: Re: Bayes in V4 compared to V3 Hi again. In V4 there is something wrong with bayes... I received 3 identical mails (1 external sender, 3 internal recipients) and scores are like this: 2 X like

Re: Bayes in V4 compared to V3

2024-09-23 Thread Grega via users
es to skip some? 4. Race condition (IDK I`m not coder) 5. Bayes behaves non consistent on BOTH installs I have it on From: John Hardin Sent: Friday, 13 September 2024 20:38 To: SpamAssassin-Users Subject: Re: Bayes in V4 compared to V3 On Fri, 13 Sep 2024,

Re: Bayes in V4 compared to V3

2024-09-13 Thread John Hardin
On Fri, 13 Sep 2024, Bill Cole wrote: Please send any replies to the list only. ...or to Harald only. -- John Hardin KA7OHZhttp://www.impsec.org/~jhardin/ jhar...@impsec.org pgpk -a jhar...@impsec.org key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C

Re: Bayes in V4 compared to V3

2024-09-13 Thread Benny Pedersen
Grega via users skrev den 2024-09-13 16:16: Sorry guys if I replied to all, my intentions were not to spam :) top posters :) imho not impossible to request 3dr party list archives to make a password for users, never mind eggs came before chickens :=)

Re: Bayes in V4 compared to V3

2024-09-13 Thread Grega via users
Sorry guys if I replied to all, my intentions were not to spam :) From: Benny Pedersen Sent: Friday, 13 September 2024 15:13 To: users@spamassassin.apache.org Subject: Re: Bayes in V4 compared to V3 Bill Cole skrev den 2024-09-13 15:03: > Please send

Noise Around This List (was Re: Bayes in V4 compared to V3)

2024-09-13 Thread Bill Cole
On 2024-09-13 at 09:13:58 UTC-0400 (Fri, 13 Sep 2024 15:13:58 +0200) Benny Pedersen is rumored to have said: Bill Cole skrev den 2024-09-13 15:03: Please send any replies to the list only. unsubscribe listarchivers ? and make archived on apache.org with bugzilla login don't know if it wil

Re: Bayes in V4 compared to V3

2024-09-13 Thread Antony Stone
On Friday 13 September 2024 at 15:13:58, Benny Pedersen wrote: > Bill Cole skrev den 2024-09-13 15:03: > > Please send any replies to the list only. > > unsubscribe listarchivers ? > and make archived on apache.org with bugzilla login > don't know if it will help or not, but chicken and egg I do

Re: Bayes in V4 compared to V3

2024-09-13 Thread Benny Pedersen
Bill Cole skrev den 2024-09-13 15:03: Please send any replies to the list only. unsubscribe listarchivers ? and make archived on apache.org with bugzilla login don't know if it will help or not, but chicken and egg

Re: Bayes in V4 compared to V3

2024-09-13 Thread Bill Cole
9-13 at 05:00:17 UTC-0400 (Fri, 13 Sep 2024 09:00:17 +) Grega is rumored to have said: Do you have V3 or V4 SA? From: Reindl Harald (privat) Sent: Friday, 13 September 2024 10:57 To: Grega; Bill Cole; Grega via users Subject: Re: Bayes in V4 compared to V3

Re: Bayes in V4 compared to V3

2024-09-13 Thread Grega via users
Do you have V3 or V4 SA? From: Reindl Harald (privat) Sent: Friday, 13 September 2024 10:57 To: Grega; Bill Cole; Grega via users Subject: Re: Bayes in V4 compared to V3 autolearn was always a blackbox that below are the stats for the current month and that

Re: Bayes in V4 compared to V3

2024-09-13 Thread Grega via users
September 2024 10:22 To: Grega; Bill Cole; Grega via users Subject: Re: Bayes in V4 compared to V3 Am 13.09.24 um 06:53 schrieb Grega via users: > And I`m reconfiguring autolearn to -4 for HAM and 12 for SPAM to really > auto-train on correct mails... this is even more nonsense than autolearn

Re: Bayes in V4 compared to V3

2024-09-12 Thread Grega via users
September 2024 21:38 To: Grega via users Subject: Re: Bayes in V4 compared to V3 On 2024-09-12 at 14:05:11 UTC-0400 (Thu, 12 Sep 2024 18:05:11 +) Grega via users is rumored to have said: Hi. I have SA 4.0.1 configured it, all is good, except for bayes. It IS working, it IS learning b

Re: Bayes in V4 compared to V3

2024-09-12 Thread Bill Cole
On 2024-09-12 at 14:05:11 UTC-0400 (Thu, 12 Sep 2024 18:05:11 +) Grega via users is rumored to have said: Hi. I have SA 4.0.1 configured it, all is good, except for bayes. It IS working, it IS learning but when it classifies mail it is really not so decisive as it was in V3. I have: db

Re: Bayes "corpus" - how old?

2024-01-31 Thread Bill Cole
On 2024-01-31 at 08:16:13 UTC-0500 (Wed, 31 Jan 2024 14:16:13 +0100) Matus UHLAR - fantomas is rumored to have said: On 2024-01-30 at 12:08:18 UTC-0500 (Tue, 30 Jan 2024 18:08:18 +0100) Matus UHLAR - fantomas is rumored to have said: [...] autolearn may help if your DB is well maintained, alt

Re: Bayes "corpus" - how old?

2024-01-31 Thread Matus UHLAR - fantomas
On 2024-01-30 at 12:08:18 UTC-0500 (Tue, 30 Jan 2024 18:08:18 +0100) Matus UHLAR - fantomas is rumored to have said: [...] autolearn may help if your DB is well maintained, although I have disabled nearly all rules with negative scores, like RCVD_IN_DNSWL_* RCVD_IN_IADB_* DKIMWL_WL_* RCVD_IN_

Re: Bayes "corpus" - how old?

2024-01-30 Thread Bill Cole
On 2024-01-30 at 12:08:18 UTC-0500 (Tue, 30 Jan 2024 18:08:18 +0100) Matus UHLAR - fantomas is rumored to have said: [...] autolearn may help if your DB is well maintained, although I have disabled nearly all rules with negative scores, like RCVD_IN_DNSWL_* RCVD_IN_IADB_* DKIMWL_WL_* RCVD_IN_

Re: Bayes "corpus" - how old?

2024-01-30 Thread Matus UHLAR - fantomas
On 30.01.24 09:59, joe a wrote: Advisable to "prune" Bayes data based on age? While cleaning up recent Ham/Spam, found my "saved SPAM" goes back to 2013. Why that's over . . . wait, I need to take off my socks . . . So, how old is "too old".  For saved SPAM? On 1/30/2024 10:58:52, Matus U

Re: Bayes "corpus" - how old?

2024-01-30 Thread joe a
On 1/30/2024 10:58:52, Matus UHLAR - fantomas wrote: On 30.01.24 09:59, joe a wrote: Advisable to "prune" Bayes data based on age? While cleaning up recent Ham/Spam, found my "saved SPAM" goes back to 2013. Why that's over . . . wait, I need to take off my socks . . . So, how old is "too ol

Re: Bayes "corpus" - how old?

2024-01-30 Thread Bill Cole
On 2024-01-30 at 09:59:52 UTC-0500 (Tue, 30 Jan 2024 09:59:52 -0500) joe a is rumored to have said: Advisable to "prune" Bayes data based on age? Yes. That is why it has an expiration model. Expiration may be de facto blocked on some busy systems so you may need to explicitly force it occas

Re: Bayes "corpus" - how old?

2024-01-30 Thread Matus UHLAR - fantomas
On 30.01.24 09:59, joe a wrote: Advisable to "prune" Bayes data based on age? While cleaning up recent Ham/Spam, found my "saved SPAM" goes back to 2013. Why that's over . . . wait, I need to take off my socks . . . So, how old is "too old". For saved SPAM? I did retrain on old spam a fe

Re: Bayes Stopword

2023-12-29 Thread Jimmy
This is what I believe: the words need to be trimmed or separated, and careful consideration is required to determine the language in order to perform accurate cutoffs. Jimmy On Fri, Dec 29, 2023 at 5:16 PM wrote: > "ทุก" is not considered a word because it's part of the token > "ทุกวันพุธเล่นช

Re: Bayes Stopword

2023-12-29 Thread giovanni
"ทุก" is not considered a word because it's part of the token "ทุกวันพุธเล่นชนะรับเพิ่ม". Words must be separated by spaces, otherwise we should skip the word "theme" just because "the" is in english stopword list. No idea if this makes sense for asian languages. Giovanni On 12/29/23 11:04, J

Re: Bayes Stopword

2023-12-29 Thread Jimmy
The sample email and word list should contain at least these words. ถูก เลย ทุก Jimmy On Fri, Dec 29, 2023 at 4:47 PM wrote: > I do not speak Thai but I cannot see any word in the sample email that > should match that list. > Which word do you think should match the regexp ? > Giovanni > > O

Re: Bayes Stopword

2023-12-29 Thread giovanni
I do not speak Thai but I cannot see any word in the sample email that should match that list. Which word do you think should match the regexp ? Giovanni On 12/29/23 10:08, Jimmy wrote: You can use this word list https://raw.githubusercontent.com/stopwords-iso/stopwords-th/master/stopwords-th

Re: Bayes Stopword

2023-12-29 Thread Jimmy
You can use this word list https://raw.githubusercontent.com/stopwords-iso/stopwords-th/master/stopwords-th.txt Jimmy On Fri, Dec 29, 2023 at 3:59 PM wrote: > To create the stopwords regexp I used the script I shared in a previous > email and a list of words one per line. > Could you share the

Re: Bayes Stopword

2023-12-29 Thread giovanni
To create the stopwords regexp I used the script I shared in a previous email and a list of words one per line. Could you share the list you are using ? Giovanni On 12/29/23 09:22, Jimmy wrote: I use SpamAssassin 4.0.0 (2022-12-14) $ spamassassin -D --lint 2>&1 | grep bayes: Dec 29 15:17:56

Re: Bayes Stopword

2023-12-29 Thread Jimmy
I use SpamAssassin 4.0.0 (2022-12-14) $ spamassassin -D --lint 2>&1 | grep bayes: Dec 29 15:17:56.919 [17420] dbg: bayes: stopword found lang=en Dec 29 15:17:56.919 [17420] dbg: bayes: stopword found lang=th Dec 29 15:17:56.919 [17420] dbg: bayes: stopword found lang=ru Dec 29 15:17:56.919 [17420]

Re: Bayes Stopword

2023-12-28 Thread giovanni
Config line produces a syntax error for me: config: failed to parse line in /etc/mail/spamassassin/local.cf (line 1): bayes_stopword_th Could you share the word list in utf8 ? I tried adding "บาท" to https://raw.githubusercontent.com/stopwords-iso/stopwords-th/master/stopwords-th.txt and it pr

Re: Bayes Stopword

2023-12-28 Thread Jimmy
bayes_stopword_th https://pastebin.pl/view/0838138d Sample mail https://pastebin.pl/view/e5a2c5b8 Jimmy On Thu, Dec 28, 2023 at 10:59 PM wrote: > Could you share a config line and a sample you are using ? > Giovanni > > On 12/28/23 16:26, Jimmy wrote: > > Yes, I have done that, and I am also

Re: Bayes Stopword

2023-12-28 Thread giovanni
Could you share a config line and a sample you are using ? Giovanni On 12/28/23 16:26, Jimmy wrote: Yes, I have done that, and I am also editing Plugin/Bayes.pm to investigate why it is not being skipped. I suspect that if words are not separated by spaces, longer words may not match those pa

Re: Bayes Stopword

2023-12-28 Thread Jimmy
Yes, I have done that, and I am also editing Plugin/Bayes.pm to investigate why it is not being skipped. I suspect that if words are not separated by spaces, longer words may not match those patterns. Jimmy On Thu, Dec 28, 2023 at 10:13 PM wrote: > "spamassassin -D bayes" will tell you, you sho

Re: Bayes Stopword

2023-12-28 Thread giovanni
"spamassassin -D bayes" will tell you, you should see a line like: bayes: skipped token 'from' because it's in stopword list for language 'en' Giovanni On 12/28/23 15:45, Jimmy wrote: The pattern has successfully passed the test script, but it needs to check whether Bayes learning will identi

Re: Bayes Stopword

2023-12-28 Thread Jimmy
The pattern has successfully passed the test script, but it needs to check whether Bayes learning will identify and possibly exclude the word from matching this pattern. Thank you. On Thu, Dec 28, 2023 at 9:22 PM wrote: > On 12/28/23 12:59, Jimmy wrote: > > Hi, > > > > I'm seeking assistance i

Re: Bayes Stopword

2023-12-28 Thread giovanni
On 12/28/23 12:59, Jimmy wrote: Hi, I'm seeking assistance in incorporating a stopword for Asian languages in Unicode. Although I possess comprehensive word lists, my attempts to generate a regex pattern and test it have been unsuccessful; the pattern fails to match or skips tokens in the new

Re: Bayes always reject.

2023-12-13 Thread Jeff Mincy
> From: Pierluigi Frullani > Date: Wed, 13 Dec 2023 07:49:24 +0100 > > Hello all, > I'm facing a strange problem. ... > tests=BAYES_95,MISSING_DATE,MISSING_HEADERS,NO_RECEIVED,NO_RELAYS,T_SCC_BODY_TEXT_LINE How did you feed this message into SpamAssassin? Did you do something to strip of

Re: Bayes always reject.

2023-12-13 Thread Bill Cole
On 2023-12-13 at 01:49:24 UTC-0500 (Wed, 13 Dec 2023 07:49:24 +0100) Pierluigi Frullani is rumored to have said: Hello all, I'm facing a strange problem. Not really. MANY people run into this issue... I've feed the bayes db for a while and now I would like to put it in use but all message

Re: BAYES scores

2023-03-01 Thread Benny Pedersen
joe a skrev den 2023-02-28 17:37: Curious as to why these scores, apparently "stock" are what they are. I'd expect BAYES_999 BODY to count more than BAYES_99 BODY. Noted in a header this morning: * 3.5 BAYES_99 BODY: Bayes spam probability is 99 to 100% * [score: 1.] * 0.2 BAYES_999

Re: BAYES scores

2023-02-28 Thread Loren Wilton
From: "Bill Cole" It is my understanding that an automated rescoring job was run quite some time ago (before I was on the PMC) to generate the Bayes scores, which determined that to be the best supplemental score to give to the greater certainty. I was around in those days. My memory isn't

Re: BAYES scores

2023-02-28 Thread Benny Pedersen
joe a skrev den 2023-02-28 17:37: Curious as to why these scores, apparently "stock" are what they are. I'd expect BAYES_999 BODY to count more than BAYES_99 BODY. Noted in a header this morning: * 3.5 BAYES_99 BODY: Bayes spam probability is 99 to 100% * [score: 1.] * 0.2 BAYES_999

Re: BAYES scores

2023-02-28 Thread Bill Cole
On 2023-02-28 at 13:38:35 UTC-0500 (Tue, 28 Feb 2023 13:38:35 -0500) joe a is rumored to have said: On 2/28/2023 12:05 PM, Jeff Mincy wrote: > From: joe a > Date: Tue, 28 Feb 2023 11:37:34 -0500 > > Curious as to why these scores, apparently "stock" are what they are. > I'd expect

Re: BAYES scores

2023-02-28 Thread hg user
>From my small experience... I score BAYES_999 with 2.00, it was suggested to me months ago. But nowadays I'd be more careful and do some more testing: I'd check which messages have only BAYES_99 and which have BAYES_999, If you are absolutely certain that BYES_999 are only and definitively spam,

Re: BAYES scores

2023-02-28 Thread joe a
On 2/28/2023 12:05 PM, Jeff Mincy wrote: > From: joe a > Date: Tue, 28 Feb 2023 11:37:34 -0500 > > Curious as to why these scores, apparently "stock" are what they are. > I'd expect BAYES_999 BODY to count more than BAYES_99 BODY. > > Noted in a header this morning: > > * 3.5

Re: BAYES scores

2023-02-28 Thread Jeff Mincy
> From: joe a > Date: Tue, 28 Feb 2023 11:37:34 -0500 > > Curious as to why these scores, apparently "stock" are what they are. > I'd expect BAYES_999 BODY to count more than BAYES_99 BODY. > > Noted in a header this morning: > > * 3.5 BAYES_99 BODY: Bayes spam probability is 99 to 1

Re: bayes in sqlite db

2022-08-16 Thread Matt Corallo
Heh, I know this thread is so old it might as well be dead, but this does work. Note that you may need to apply the patch from Bug 7932 until the next release. bayes_store_module Mail::SpamAssassin::BayesStore::SQL bayes_sql_dsn DBI:SQLite:/path/to/bayes.sqlite On 5/26/22 9:25 AM, Michael Grant

Re: Bayes autolearn: how does it resolve whether rules are body or header related?

2021-05-10 Thread RW
On Mon, 10 May 2021 20:39:31 +0200 Bert Van de Poel wrote: > Based on what I've read, I agree that this is indeed a bug (or > actually several). I've filed the following bug reports: > https://bz.apache.org/SpamAssassin/show_bug.cgi?id=7904 (missing body > types, as mentioned by RW) > https://bz

Re: Bayes autolearn: how does it resolve whether rules are body or header related?

2021-05-10 Thread Bert Van de Poel
Dear Loren, Thank you very much for your email. Based on your message I could deduce there were earlier messages (which I then read through a web archive). For some unexplained reason I never received the previous 3 responses to my email. I hope the university network isn't randomly over-filte

Re: Bayes autolearn: how does it resolve whether rules are body or header related?

2021-05-09 Thread Loren Wilton
so you don't have points from body rules. your mentioned URI_DEOBFU_INSTR is a meta rule: meta URI_DEOBFU_INSTR __URI_DEOBFU_INSTR && !__MSGID_OK_HOST so maybe it's not considered. They are treated as header, or ignored if marked as net. I think a bug report should be submitted for this. E

Re: Bayes autolearn: how does it resolve whether rules are body or header related?

2021-05-09 Thread RW
On Sun, 9 May 2021 20:03:27 +0200 Matus UHLAR - fantomas wrote: > so you don't have points from body rules. > > your mentioned URI_DEOBFU_INSTR is a meta rule: > > meta URI_DEOBFU_INSTR __URI_DEOBFU_INSTR && !__MSGID_OK_HOST > > so maybe it's not considered. They are treated as header, or ign

Re: Bayes autolearn: how does it resolve whether rules are body or header related?

2021-05-09 Thread Matus UHLAR - fantomas
On 09.05.21 04:17, Bert Van de Poel wrote: Dear fellow Spamassassin users, I recently noticed that quite a lot of spam emails with high scores weren't marked for Bayes autolearning. While some senders and receivers were a common match, explaining why autolearn was nog, there was no clear expl

Re: Bayes autolearn: how does it resolve whether rules are body or header related?

2021-05-09 Thread RW
On Sun, 9 May 2021 04:17:26 +0200 Bert Van de Poel wrote: > Within the same realm, I'm also wondering whether these expected > numbers for body and header can be tweaked and if so, how. You can create a meta-rule for definite spam and set: tflags autolearn_force a hit on any rule with th

Re: Bayes converstion: SQL--> Redis?

2021-02-04 Thread Kevin A. McGrail
On 2/4/2021 5:32 AM, Giovanni Bechis wrote: On 2/4/21 10:47 AM, Dan Mahoney (Gushi) wrote: Hey there all, In looking at my sql server, it looks like the on-disk size of my MySQL DB's is like 9G (because of InnoDB, it's hard to glean just from the filesystem what tables are which). Anyway,

Re: Bayes converstion: SQL--> Redis?

2021-02-04 Thread Giovanni Bechis
On 2/4/21 10:47 AM, Dan Mahoney (Gushi) wrote: > Hey there all, > > In looking at my sql server, it looks like the on-disk size of my MySQL DB's > is like 9G (because of InnoDB, it's hard to glean just from the filesystem > what tables are which). > > Anyway, I'd like to move over to a global r

Re: bayes and InnoDB read locks

2020-11-30 Thread Benny Pedersen
Alex skrev den 2020-11-30 17:33: 2020-11-30 10:11:46 2772099 [Note] InnoDB: *** (2) WAITING FOR THIS LOCK TO BE GRANTED: have you tryed do the bayes with AriaDB ? note i am not expert at all

Re: Bayes files LOCK

2020-02-14 Thread RW
On Fri, 14 Feb 2020 10:46:00 -0800 (PST) John Hardin wrote: > On Fri, 14 Feb 2020, RW wrote: > > > I was looking at it in a bit more detail and it looks like there > > isn't a reader-writer lock, just write locks for the toks and seen > > files. As scans defer their writes through the journal t

Re: Bayes files LOCK

2020-02-14 Thread Pedro David Marco
On Friday, February 14, 2020, 7:46:18 PM GMT+1, John Hardin wrote: >> I was looking at it in a bit more detail and it looks like there isn't >> a reader-writer lock,  just write locks for the toks and seen >> files. As scans defer their writes through the journal they are >> lockless. >S

Re: Bayes files LOCK

2020-02-14 Thread John Hardin
On Fri, 14 Feb 2020, RW wrote: On Fri, 14 Feb 2020 14:38:05 + (UTC) Pedro David Marco wrote: On Friday, February 14, 2020, 1:17:29 PM GMT+1, RW wrote: That would defeat the object of having a journal file.>>Even if you are right, it doesn't really explain anything because it>applies to ev

Re: Bayes files LOCK

2020-02-14 Thread RW
On Fri, 14 Feb 2020 14:38:05 + (UTC) Pedro David Marco wrote: > > > >On Friday, February 14, 2020, 1:17:29 PM GMT+1, RW > > wrote: That would defeat the object of > >having a journal file.>>Even if you are right, it doesn't really > >explain anything because it>applies to everyone using  BD

Re: Bayes files LOCK

2020-02-14 Thread Pedro David Marco
>On Friday, February 14, 2020, 1:17:29 PM GMT+1, RW > wrote: >That would defeat the object of having a journal file.>>Even if you are right, >it doesn't really explain anything because it>applies to everyone using  >BDB/DBM/SDBM. >>IIWY I'd be looking at what's different for you. I basically

Re: Bayes files LOCK

2020-02-14 Thread RW
On Thu, 13 Feb 2020 15:29:46 + (UTC) Pedro David Marco wrote: > >>On 13.02.20 12:30, RW wrote: > >>Bayes doesn't write on scans (unless it does an opportunistic sync > >>or > >expiry): > > > >doesn't it record token access times to journal? > > i think SA always does an EXclusive

Re: Bayes files LOCK

2020-02-13 Thread RW
On Thu, 13 Feb 2020 15:05:57 +0100 Matus UHLAR - fantomas wrote: > >On Wed, 12 Feb 2020 18:03:01 + (UTC) > >Pedro David Marco wrote: > >> i am getting  errors from Byes because it is not able to lock Bayes > >> files... Error log is:  > >>   bayes: cannot open bayes databases > >> /etc/spama

Re: Bayes files LOCK

2020-02-13 Thread Pedro David Marco
>On 13.02.20 12:30, RW wrote: >>Bayes doesn't write on scans (unless it does an opportunistic sync or >expiry): > >doesn't it record token access times to journal? i think SA always does an EXclusive lock despite the parameters, probably because of that, Fantomas I will try with SDBM...

Re: Bayes files LOCK

2020-02-13 Thread Matus UHLAR - fantomas
On Wed, 12 Feb 2020 18:03:01 + (UTC) Pedro David Marco wrote: i am getting  errors from Byes because it is not able to lock Bayes files... Error log is:    bayes: cannot open bayes databases /etc/spamassassin/bayes/bayes_* R/W: lock failed: Interrupted system call  SA tries to lock bayes fi

Re: Bayes files LOCK

2020-02-13 Thread RW
On Wed, 12 Feb 2020 18:03:01 + (UTC) Pedro David Marco wrote: > Hi.. > i am getting  errors from Byes because it is not able to lock Bayes > files... Error log is:  >   bayes: cannot open bayes databases /etc/spamassassin/bayes/bayes_* > R/W: lock failed: Interrupted system call > > >  SA tr

Re: Bayes files LOCK

2020-02-13 Thread Matus UHLAR - fantomas
On 12.02.20 19:02, Pedro David Marco wrote: Fully aligned with my suspicions... I love Redis...  i think someone posted sometime ago a performance table showing local / sql / redis  ... i will look for it... but it would not surprise me if remote Redis is even faster than local DB... On 13.02.

Re: Bayes files LOCK

2020-02-13 Thread Matus UHLAR - fantomas
On 12.02.20 19:02, Pedro David Marco wrote: Fully aligned with my suspicions... I love Redis...  i think someone posted sometime ago a performance table showing local / sql / redis  ... i will look for it... but it would not surprise me if remote Redis is even faster than local DB... last time

Re: Bayes files LOCK

2020-02-12 Thread Axb
On 2020-02-13 04:11, John Hardin wrote: On Wed, 12 Feb 2020, Pedro David Marco wrote: Hi.. i am getting  errors from Byes because it is not able to lock Bayes files... Error log is:   bayes: cannot open bayes databases /etc/spamassassin/bayes/bayes_* R/W: lock failed: Interrupted system call

Re: Bayes files LOCK

2020-02-12 Thread John Hardin
On Wed, 12 Feb 2020, Pedro David Marco wrote: Hi.. i am getting  errors from Byes because it is not able to lock Bayes files...  Error log is:    bayes: cannot open bayes databases /etc/spamassassin/bayes/bayes_* R/W: lock failed: Interrupted system call  SA tries to lock bayes files always in

Re: Bayes files LOCK

2020-02-12 Thread Benny Pedersen
Kevin A. McGrail skrev den 2020-02-12 23:14: I don't know for sure but Alex Broens aka axb is a master at Bayes with Redis. Definitely look for his postings. lets see sqlite with DBI: supported in spamassassin i am happy with postgresql

Re: Bayes files LOCK

2020-02-12 Thread Kevin A. McGrail
I don't know for sure but Alex Broens aka axb is a master at Bayes with Redis. Definitely look for his postings. -- Kevin A. McGrail Member, Apache Software Foundation Chair Emeritus Apache SpamAssassin Project https://www.linkedin.com/in/kmcgrail - 703.798.0171 On Wed, Feb 12, 2020 at 2:02 PM P

Re: Bayes files LOCK

2020-02-12 Thread Pedro David Marco
Fully aligned with my suspicions... I love Redis...  i think someone posted sometime ago a performance table showing local / sql / redis  ... i will look for it... but it would not surprise me if remote Redis is even faster than local DB... Thanks! Ďakujem mnohokrát Fantomas! Pedro. On Wedne

Re: Bayes files LOCK

2020-02-12 Thread Matus UHLAR - fantomas
On 12.02.20 18:03, Pedro David Marco wrote: i am getting  errors from Byes because it is not able to lock Bayes files...  Error log is:    bayes: cannot open bayes databases /etc/spamassassin/bayes/bayes_* R/W: lock failed: Interrupted system call  SA tries to lock bayes files always in "EXclus

Re: bayes sql postgresql

2020-01-26 Thread Marc Stürmer
Zitat von Benny Pedersen : i came after using it this way for long time now that it could be more optimized with bayes user id vars, currently it creates new ids each time there is a new user, but it does not reuse old ids that is not used anymore after sa-learn --username f...@example.org,

Re: bayes sql postgresql

2020-01-18 Thread RW
On Sat, 18 Jan 2020 13:31:10 +0100 Benny Pedersen wrote: > i came after using it this way for long time now that it could be > more optimized with bayes user id vars, currently it creates new ids > each time there is a new user, but it does not reuse old ids that is > not used anymore after sa-lea

Re: Bayes

2019-11-29 Thread Jerry Malcolm
Can I bump this one to the top again?  I had great bayes reports in every email for 30 minutes.  Then nothing for the last three days, even after restarting SA.  Is it possible that my bayes db got corrupted?  The sa-learn --dump magic looks ok as far as I can tell. Thanks for any suggestions.

Re: Bayes

2019-11-26 Thread Jerry Malcolm
This is getting stranger by the minute... After playing around and verifying permissions and everything, I actually started getting a bayes score item for each email.  So I celebrated and went to dinner.  Came back a few hours later and checked the logs.  Bayes consistently added a score line t

Re: Bayes

2019-11-26 Thread Jerry Malcolm
On 11/25/2019 3:02 PM, Mikael Syska wrote: Try and run: sa-learn --dump magic Should give you some information like: 0.000  0  3  0  non-token data: bayes db version 0.000  0 493422  0  non-token data: nspam 0.000  0    3867414  0  non-

Re: Bayes

2019-11-25 Thread RW
On Mon, 25 Nov 2019 22:02:03 +0100 Mikael Syska wrote: > Try and run: > > sa-learn --dump magic That needs to be run as the spamd user (using su or otherwise) for db files.

Re: Bayes

2019-11-25 Thread Mikael Syska
Try and run: sa-learn --dump magic Should give you some information like: 0.000  0  3  0  non-token data: bayes db version 0.000  0 493422  0  non-token data: nspam 0.000  0    3867414  0  non-token data: nham 0.000  0

Re: Bayes

2019-11-24 Thread RW
On Sun, 24 Nov 2019 12:35:16 -0600 Jerry Malcolm wrote: > Thanks again to everyone who helped me get everything up and running > over the past couple of days. > > Now that I have SA finding my bayes database, I'm curious about bayes > reporting.  My bayes db is not new.  I migrated my previous

Re: Bayes underperforming, HTML entities?

2018-12-07 Thread John Hardin
On Fri, 7 Dec 2018, Amir Caspi wrote: On Dec 6, 2018, at 12:14 PM, John Hardin wrote: Runaway backtracking that was killing masscheck for several people. Hrm, that is disconcerting. I'm not sure where any backtracking might be occurring... This sort of thing is risky, especially in a ra

Re: Bayes underperforming, HTML entities?

2018-12-07 Thread Amir Caspi
On Dec 6, 2018, at 12:14 PM, John Hardin wrote: > > Runaway backtracking that was killing masscheck for several people. Hrm, that is disconcerting. I'm not sure where any backtracking might be occurring... Can anyone help improve this suggested rule? rawbody AC_HTML_ENTITY_BONANZA_NEW

Re: Bayes underperforming, HTML entities?

2018-12-06 Thread John Hardin
On Tue, 4 Dec 2018, Amir Caspi wrote: On Dec 1, 2018, at 10:31 AM, John Hardin wrote: On Thu, 29 Nov 2018, Amir Caspi wrote: A) Could you sandbox the proposed rule change (AC_HTML_ENTITY_BONANZA_NEW) and see how it performs, including possible FPs? Done. Any preliminary results? Run

  1   2   3   4   5   6   7   8   9   10   >