On Mon, 17 Oct 2011, Adam Katz wrote:

header      __SUBJ_OBFU_PUNCT      Subject =~
/(?:[-~`"!@\#$%^&*()_+={}|\\\/?<>,.:;][a-z][-~`"!@\#$%^&*()_+={}|\\\/?<>,.:;\s]|[a-z][~`"!@\#$%^&*()_+={}|\\\/?<>,.:;][a-z])/i

How does this differ from a negation, like:

/[^\[\]'\w\s][a-z][^\[\]'\w]|[a-z][^\[\]'\w\s-][a-z]/i

I suppose which you'd choose would be based on how conservative you want to be. Matching on specific types of obfuscation (as mine does), or being less selective (as yours does).

and how does this not FP all over the place with subjects like:

Time for F-U-N
I like D&D and rock&roll
/var/spool/mail is full

It must hit more than a specified number of times. __SUBJ_OBFU_PUNCT isn't scored, SUBJ_OBFU_PUNCT_FEW and SUBJ_OBFU_PUNCT_MANY are.

I think this would satisfy the original request:

header   __SUBJ_LACKS_WORDS
 Subject !~ /(?!^.{0,15}$)(?:^|\s)[a-z]{3,15}(?:\s|$)/

(I have not checked that in, feel free if you like it.)

When I get home tonight.

--
 John Hardin KA7OHZ                    http://www.impsec.org/~jhardin/
 jhar...@impsec.org    FALaholic #11174     pgpk -a jhar...@impsec.org
 key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
  Insofar as the police deter by their presence, they are very, very
  good. Criminals take great pains not to commit a crime in front of
  them.                                             -- Jeffrey Snyder
-----------------------------------------------------------------------
 312 days since the first successful private orbital launch (SpaceX)

Reply via email to