Hello Richard, Monday, February 21, 2005, 6:09:49 AM, you wrote:
GR> Try these on for size: GR> header __PORN_WORD01 Subject =~/n(?:ex|xe)t door/i GR> header __PORN_WORD02 Subject =~/puss(?:y|ies)/i GR> ... header __PORN_WORD01 Subject =~ /n(?:ex|xe)t door/i header __PORN_WORD02 Subject =~ /puss(?:y|ies)/i header __PORN_WORD04 Subject =~ /(?:needs|for) m(?:one|oen|neo|noe|eno|eon)y/i header __PORN_WORD05 Subject =~ /h(?:orn|onr|nro|nor|ron|rno)y/i header __PORN_WORD06 Subject =~ /f(?:ucke|ucek|ukce|ukec|ueck|uekc|cuek|cuke|ckue|ckeu|ceku|ceuk|kuce|kuec|kcue|kceu|kecu|keuc|euck|eukc|ecuk|ecku|ekcu|ekuc)d/i header PORN_WORD08 Subject =~ /\bMILF\b/i header PORN_WORD09 Subject =~ /w(?:hor|hro|roh|rho|ohr|orh)e/i header PORN_WORD20 Subject =~ /w(?:hore|hoer|hroe|hreo|heor|hero|ohre|oher|orhe|oreh|oerh|oehr|rhoe|rhep|roeh|rohe|reho|reoh|ehro|ehor|eorh|eohr|erho|eroh)s/i header PORN_WORD10 Subject =~ /(?:hstoett|o(?:the|teh|het|hte|eht|eth)r|stpuid|stupid|disgusting|shy|married|brand new|dirty|average|amateur|amatuer|amtauer|real|beautiful|hot|sexy|sxey|n(?:ast|ats|tas|tsa|sta|sat)y|wet|cute).{1,3}(?:(?:step|grand)?[\-_]?(?:mo|om)ms?|house[\-_]?wi[fvr]es?|(?:cow)?girls?|moms?|w(?:om[ae]|o[ae]m|[ae]om|[ae]mo|m[ae]o|mo[ae])n|neigbhour|neighbour|neighbuor|(?:teen|tnee)(?:ager|agre|arge)?s?|s(?:lu|ul)ts?|bitehcs|bitches)/i header __PORN_WORD11 Subject =~ /\bcum(?:shot)?\b/i #error: header __PORN_WORD12 Subject =~ /(?:d(?:ic|ci)k|c(?:|oc|co)k/i header __PORN_WORD12 Subject =~ /(?:d(?:ic|ci)k|c(?:|oc|co)k)/i header __PORN_WORD13 Subject =~ /fucking/i header __PORN_WORD14 Subject =~ /up[\-_]c(?:los|lso|sol|slo|ols|osl)e/i header __PORN_WORD15 Subject =~ /snatch/i header __PORN_WORD16 Subject =~ /(?:pervert|peervrt|prevert|perevrt)/i No ham hits for these: #counts __PORN_WORD01 7s/0h of 197615 corpus (96830s/100785h RM) 02/22/05 #counts __PORN_WORD05 57s/0h of 197615 corpus (96830s/100785h RM) 02/22/05 #counts PORN_WORD08 19s/0h of 197615 corpus (96830s/100785h RM) 02/22/05 #counts __PORN_WORD11 914s/0h of 197615 corpus (96830s/100785h RM) 02/22/05 #counts __PORN_WORD16 2s/0h of 197615 corpus (96830s/100785h RM) 02/22/05 Adequate S/O for these on my system: #counts __PORN_WORD02 82s/1h of 197615 corpus (96830s/100785h RM) 02/22/05 #counts __PORN_WORD06 53s/1h of 197615 corpus (96830s/100785h RM) 02/22/05 #counts PORN_WORD10 139s/5h of 197615 corpus (96830s/100785h RM) 02/22/05 These don't work here: #counts __PORN_WORD04 0s/1h of 197615 corpus (96830s/100785h RM) 02/22/05 #counts PORN_WORD09 26s/23h of 197615 corpus (96830s/100785h RM) 02/22/05 #counts PORN_WORD20 13s/4h of 197615 corpus (96830s/100785h RM) 02/22/05 #counts __PORN_WORD12 4543s/4626h of 197615 corpus (96830s/100785h RM) 02/22/05 #counts __PORN_WORD13 18s/2h of 197615 corpus (96830s/100785h RM) 02/22/05 #counts __PORN_WORD14 2s/1h of 197615 corpus (96830s/100785h RM) 02/22/05 #counts __PORN_WORD15 4s/1h of 197615 corpus (96830s/100785h RM) 02/22/05 There's a fair amount of overlap with current SARE rules, which I haven't tested yet, but some of these should be worth adding to the SARE rule set if we can have your permission to do so. Bob Menschel