On Sat, 18 Jul 2009, twofers wrote:

I am mainly using the rule to check the header subject, I haven't added it to a body check.
?
So, between the 3 choices:
1.  /(?:[^_]{1,30}_+){5}/
2. /\S+_+\S+_+\S+/
3. R02 /^\S{30,}$/m

?Which covers the most territory given the example I submitted? I'm
basically interested in identifying those garbage subject lines laced with characters like underscores, periods, hyphens, semi-colons, etc; so rather than use several rules to trap those individual characters, maybe there is a more effective way to resolve this.

Your original example only included underscores.

Try this:

  header XX Subject =~ /(?:[[:alnum:]]{1,30}[^[:alnum:]\s]{1,5}){5}/i

--
 John Hardin KA7OHZ                    http://www.impsec.org/~jhardin/
 jhar...@impsec.org    FALaholic #11174     pgpk -a jhar...@impsec.org
 key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79

Reply via email to