Hi Jennifer,

Thanks for the reply.  I think I am seeing the light now.  With your help
and others from the list, I am really getting it now.  The FPs I was getting
are gone after changing the limit from \w{0,7} to
\w{1,7}?.  This really makes a huge difference.  Thanks again!

Don't feel awkward answering the questions.  Know one knows a pet better
than the owner!

I would like to move away from the previous discussion but still maintain
this topic so I removed the previous posts but am maintaining the subject to
keep the thread.  Hope this is good with those following this thread.

So I started experimenting again. Danger Will Robinson! ;)

I had the following HTML tag OBFU rule (variant of yours):
  /(\>|\s)\w{1,5}?\<\/?\s?[\w\s]{6,150}\/?\s?\>\w{1,7}?(\s|\W|\<)/

The thing I started to notice was that it missed OBFU like below (which will
wrap a little - the <br> is followed by an EOL).

<br>
O<f>UR U<RCEN>S Doct<X>ors wi<qk>ll Wri<vlL>te YO<NOsF>U a<WmL>
Prescr<yhE>iption<br>
Y<jHCe>ou wi<fm>ll ge<nT>t i<n>t NEXT<m>-DAY vi<ZvI>a Fed<YYam>-Ex<br>
<br>

So I changed the rule to the following:
  /(\>|\s)\w{1,5}?\<\/?\s?[\w\s]{1,150}\/?\s?\>\w{1,7}?(\s|\W|\<)/

But I started matching the following:
  <small>Top</small>

This came from a legit newsletter.  I assume that is why you used
[\w\s]{6,150}.  I have not found an answer yet, but I have an idea that may
work.  I think that if I create a rule like below, I might snag it.

  /\s\w{1,5}?\<\/?\s?[\w\s]{1,150}\/?\s?\>\w{1,7}?(\s|\W)/

It seems that most ham does not include the spaces which break up words.
I'll experiment and let you know the results.

Thanks again Jennifer.  These rules have really tagged a bunch of messages
that were at best borderline.  A lot of messages would have slipped through
without these!

--Larry



-------------------------------------------------------
This SF.net email is sponsored by: SF.net Giveback Program.
SourceForge.net hosts over 70,000 Open Source Projects.
See the people who have HELPED US provide better services:
Click here: http://sourceforge.net/supporters.php
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to