Re: bayes scroing too low

2014-08-31 Thread Ian Zimmerman
On Sun, 31 Aug 2014 12:20:41 +0200, Axb wrote: Axb> get the source from http://razor.sourceforge.net/ I don't recommend Axb> installing via some rpm. The last version mentioned on that site is 2.84, from May 2007. strangely, the version on current Debian packages is 2.85. Anyone know what's go

Re: random low contrast text with bayes

2014-08-31 Thread Eric Shubert
On 08/31/2014 10:26 PM, John Hardin wrote: On Sun, 31 Aug 2014, Eric Shubert wrote: I've seen an uptick of spam lately with random low contrast (hidden) text. This appears to be lowering bayes probabilities. Learn them as spam. That will tend to eliminate that effect. Been doing that (lear

A rule for Phil

2014-08-31 Thread Luciano Rinetti
I need a rule that, when a message is sento to p...@example.com and the Subject contains "CV" or "Curriculum", scores the message with -9 and a rule that, when a message is sent to to p...@example.com and the Subject doesn't contains CV or Curriculum, scores the message with 7 Regards

Re: random low contrast text with bayes

2014-08-31 Thread John Hardin
On Sun, 31 Aug 2014, Eric Shubert wrote: I've seen an uptick of spam lately with random low contrast (hidden) text. This appears to be lowering bayes probabilities. Learn them as spam. That will tend to eliminate that effect. -- John Hardin KA7OHZhttp://www.impsec.org/~jh

random low contrast text with bayes

2014-08-31 Thread Eric Shubert
I've seen an uptick of spam lately with random low contrast (hidden) text. This appears to be lowering bayes probabilities. I'd like to strip low contrast text from messages before they're learned by sa-learn in order to combat this. 1) does anyone have some guidance for building such a filte

Re: Outlook, we do love to hate you....

2014-08-31 Thread Jason Haar
On 01/09/14 04:33, Dave Warren wrote: > > As I understand that, that's specifically for messages that originated > within Exchange itself and had no SMTP transmission or RFC5321 or 5322 > components in the first place. This dates back to Exchange's history, > at which point it wasn't primarily a SM

Re: sa-learn and find

2014-08-31 Thread Ian Zimmerman
On Sun, 31 Aug 2014 17:37:50 -0600, LuKreme wrote: Ian> xargs (the GNU one at least) has an option to not run the inferior Ian> when there are no args to give it. LuKreme> The interior is the find: _Inferior_ which is GNU speak for "subprocess". I should have tried to be less concise :-) > sa

Re: SA works great!

2014-08-31 Thread LuKreme
On 31 Aug 2014, at 08:08 , Ted Mittelstaedt wrote: > Google does it. It's not impossible. [snip] > My experience is that the commercial providers like Gmail are now > so aggressive that false positives are VERY common on their systems, > this leads to people nowadays quite commonly saying "che

Re: SA works great!

2014-08-31 Thread Bob Proulx
Ted Mittelstaedt wrote: > Reindl Harald wrote: > > i think it's impossible to improve that much "out-of-the-box" because > > that would make it to sensitive while the bayes has the ham side of > > your communication too for decisions > > Google does it. It's not impossible. But not "out of the b

Re: Give a penalty to messages with non latin UTF-8 characters?

2014-08-31 Thread LuKreme
On 31 Aug 2014, at 14:38 , Ian Zimmerman wrote: > Doesn't ok_languages and ok_locales do the job? It does for me. Not with UTF-8 encoding, that setting only seems to apply to old-stye character declarations. -- showing snuffy is when Sesame Street jumped the shark

Re: sa-learn and find

2014-08-31 Thread LuKreme
On 31 Aug 2014, at 14:46 , Ian Zimmerman wrote: > On Sat, 30 Aug 2014 19:59:53 -0600, > LuKreme wrote: > > RW> This may run into shell argument limits if you have to learn a lot > RW> of spam. Consider piping the output of find to xargs, or using -exec > RW> ...{} + in find. > > LuKreme> Yes,

Re: SA works great!

2014-08-31 Thread Axb
On 08/31/2014 10:54 PM, Ian Zimmerman wrote: On Sun, 31 Aug 2014 16:55:50 +0200, Axb wrote: Axb> During the last +-4 years, scores have been set by the masscheck GA Axb> system. IF more ppl would contribute with masschecks and rules, Axb> detection could be better, but the lack of volunteers d

Re: bayes scroing too low

2014-08-31 Thread Reindl Harald
Am 31.08.2014 um 23:06 schrieb Ian Zimmerman: > On Sun, 31 Aug 2014 12:20:41 +0200, > Axb wrote: > > Axb> Bayes scores are *not* set to be a sole indicator of spam/ham. > Axb> They're supposed to be yet another indicator. > > FWIW, I use both Razor and Pyzor, and there are times when they seem

Re: SA works great!

2014-08-31 Thread Ian Zimmerman
On Sun, 31 Aug 2014 16:55:50 +0200, Axb wrote: Axb> During the last +-4 years, scores have been set by the masscheck GA Axb> system. IF more ppl would contribute with masschecks and rules, Axb> detection could be better, but the lack of volunteers doing this Axb> shows that apparently what SA do

Re: sa-learn and find

2014-08-31 Thread Ian Zimmerman
On Sat, 30 Aug 2014 19:59:53 -0600, LuKreme wrote: RW> This may run into shell argument limits if you have to learn a lot RW> of spam. Consider piping the output of find to xargs, or using -exec RW> ...{} + in find. LuKreme> Yes, I tried to do that, but as I said in my first post, if I LuKreme>

Re: bayes scroing too low

2014-08-31 Thread Ian Zimmerman
On Sun, 31 Aug 2014 12:20:41 +0200, Axb wrote: Axb> Bayes scores are *not* set to be a sole indicator of spam/ham. Axb> They're supposed to be yet another indicator. FWIW, I use both Razor and Pyzor, and there are times when they seem to be just asleep. Or maybe a particular kind of spam defeat

Re: Give a penalty to messages with non latin UTF-8 characters?

2014-08-31 Thread Ian Zimmerman
On Sat, 30 Aug 2014 06:44:39 -0600, LuKreme wrote: LuKreme> I would welcome rules that would reliably penalize messages LuKreme> that use chinese, japanese, korean, thai, or any other LuKreme> characters in the UTF-8 address space that I don’t read. I LuKreme> would put them in user_prefs. Doesn

Re: Outlook, we do love to hate you....

2014-08-31 Thread Dave Warren
On 2014-08-29 20:31, Jason Haar wrote: while we're having a grizzle... how about the Outlook/MAPI "feature" where if you copy/move an Exchange mail message onto an IMAP folder, what arrives can barely be described as a legitimate mail message: it has no "Received:" headers, and it's To/From line

Re: SA works great!

2014-08-31 Thread Axb
On 08/31/2014 04:08 PM, Ted Mittelstaedt wrote: Out of the box the default decision point of 5 is too high anyway. SA is the framework - you can tune to your need as much as you want. I think the emphasis on avoiding false positives in the stock (non-Bayes) distribution is far too high. I su

Re: SA works great!

2014-08-31 Thread Reindl Harald
Am 31.08.2014 um 16:08 schrieb Ted Mittelstaedt: > On 8/31/2014 2:21 AM, Reindl Harald wrote: >> >> Am 31.08.2014 um 02:15 schrieb Ted Mittelstaedt: >>> Yes, it does work great when you have the bayes filter turned on and you >>> take the time to feed it. And that means >>> you have to feed the

Re: SA works great!

2014-08-31 Thread Ted Mittelstaedt
On 8/31/2014 2:21 AM, Reindl Harald wrote: Am 31.08.2014 um 02:15 schrieb Ted Mittelstaedt: Yes, it does work great when you have the bayes filter turned on and you take the time to feed it. And that means you have to feed the learner both ham and spam and setup reliable sources for those.

Re: bayes scroing too low

2014-08-31 Thread Reindl Harald
Am 31.08.2014 um 12:20 schrieb Axb: > On 08/31/2014 11:58 AM, Reindl Harald wrote: >>> Are you using RAZOR & PYZOR? >> >> https://bugzilla.redhat.com/show_bug.cgi?id=1127650 >> "perl-Razor-Agent - Only used for the not enabled by default Razor plugin" >> >> so i guess no > > get the source from h

Re: bayes scroing too low

2014-08-31 Thread Axb
On 08/31/2014 11:58 AM, Reindl Harald wrote: Are you using RAZOR & PYZOR? https://bugzilla.redhat.com/show_bug.cgi?id=1127650 "perl-Razor-Agent - Only used for the not enabled by default Razor plugin" so i guess no get the source from http://razor.sourceforge.net/ I don't recommend installin

Re: bayes scroing too low

2014-08-31 Thread Axb
On 08/31/2014 11:41 AM, Reindl Harald wrote: i guess it needs to adjust them depending on block score was one of the typical "enhance your penis" mails score BAYES_95 0 0 3.23.0 score BAYES_99 0 0 3.83.5 you missed: + 0.2 BAYES_999 X-Spam-Status: No, score=4.4, tag-level=4.5,

bayes scroing too low

2014-08-31 Thread Reindl Harald
i guess it needs to adjust them depending on block score was one of the typical "enhance your penis" mails score BAYES_95 0 0 3.23.0 score BAYES_99 0 0 3.83.5 X-Spam-Status: No, score=4.4, tag-level=4.5, block-level=8.5 X-Spam-Report: * 0.5 CUST_DNSBL_8 RBL: ix.dnsbl.manitu.net

Re: SA works great!

2014-08-31 Thread Reindl Harald
Am 31.08.2014 um 02:15 schrieb Ted Mittelstaedt: > Yes, it does work great when you have the bayes filter turned on and you take > the time to feed it. And that means > you have to feed the > learner both ham and spam and setup reliable sources for those. > > Unfortunately if Bayes is not turne