On Mon, 17 Oct 2011, Adam Katz wrote:
header __SUBJ_OBFU_PUNCT Subject =~
/(?:[-~`"!@\#$%^&*()_+={}|\\\/?<>,.:;][a-z][-~`"!@\#$%^&*()_+={}|\\\/?<>,.:;\s]|[a-z][~`"!@\#$%^&*()_+={}|\\\/?<>,.:;][a-z])/i
How does this differ from a negation, like:
/[^\[\]'\w\s][a-z][^\[\]'\w]|[a-z][^\[\]'\w\s-][a-z]/i
I suppose which you'd choose would be based on how conservative you want
to be. Matching on specific types of obfuscation (as mine does), or being
less selective (as yours does).
and how does this not FP all over the place with subjects like:
Time for F-U-N
I like D&D and rock&roll
/var/spool/mail is full
It must hit more than a specified number of times. __SUBJ_OBFU_PUNCT isn't
scored, SUBJ_OBFU_PUNCT_FEW and SUBJ_OBFU_PUNCT_MANY are.
I think this would satisfy the original request:
header __SUBJ_LACKS_WORDS
Subject !~ /(?!^.{0,15}$)(?:^|\s)[a-z]{3,15}(?:\s|$)/
(I have not checked that in, feel free if you like it.)
When I get home tonight.
--
John Hardin KA7OHZ http://www.impsec.org/~jhardin/
jhar...@impsec.org FALaholic #11174 pgpk -a jhar...@impsec.org
key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
Insofar as the police deter by their presence, they are very, very
good. Criminals take great pains not to commit a crime in front of
them. -- Jeffrey Snyder
-----------------------------------------------------------------------
312 days since the first successful private orbital launch (SpaceX)