On 10/15/2011 03:37 PM, John Hardin wrote: > On Thu, 13 Oct 2011, Mynabbler wrote: > >> Typically the chickenpox rules do not get a lot of love abroad, >> since they tend to trip over other languages than English. However, >> does someone have an idea how to use the logic in chickenpox for >> subjects like these: >> >> ... or does someone have a decent rule to tag this kind of crap? > > I've got something in local masscheck right now, should commit later > today. Check my sandbox tomorrow.
header __SUBJ_OBFU_PUNCT Subject =~ /(?:[-~`"!@\#$%^&*()_+={}|\\\/?<>,.:;][a-z][-~`"!@\#$%^&*()_+={}|\\\/?<>,.:;\s]|[a-z][~`"!@\#$%^&*()_+={}|\\\/?<>,.:;][a-z])/i How does this differ from a negation, like: /[^\[\]'\w\s][a-z][^\[\]'\w]|[a-z][^\[\]'\w\s-][a-z]/i and how does this not FP all over the place with subjects like: Time for F-U-N I like D&D and rock&roll /var/spool/mail is full I think this would satisfy the original request: header __SUBJ_LACKS_WORDS Subject !~ /(?!^.{0,15}$)(?:^|\s)[a-z]{3,15}(?:\s|$)/ (I have not checked that in, feel free if you like it.)
signature.asc
Description: OpenPGP digital signature