> -----Original Message-----
> From: jennifer [mailto:[EMAIL PROTECTED] 
> Sent: Monday, December 01, 2003 11:05 AM
> To: Dallas L. Engelken; [EMAIL PROTECTED]
> Subject: RE: [SAtalk] Evil rules, popcorn, etc too much?
> 
> 
> Hi Dallas
> Thanks for posting your tests.  Bob found the same type thing 
> on the weeds set when he ran the rules against his monster 
> corpus.  I realize those hit on ham, but for the most part, 
> they don't hit it hard. There were however some emails that 
> weeds did push over the threshold.  I tag at 7.0, so I hadn't 
> noticed before he sent me his test.  You could lower the 
> points on those rules.  -Or- There is a second set of weeds 
> that is more restrictive if you'd rather try those out.  
> They're on the page with the others.  
> (http://www.emtinc.net/spamhammers.htm )  I wrote those after 
> Bob's test.  Both sets are still up, I prefer the originals for us.
> 
> Also, I noticed you're using an older version of P&B, you 
> might want to check out the new sets as well.  They hit 
> better. (The spammers altered their tag content, and I had a 
> little more time to learn how to write them to be more 
> inclusive but remain reliable.)
> 
> Jennifer
> 

thanks...

i updated to popcorn, blackhair and weeds / weeds2 rules dated 11/13,
and re-ran the corpus tests (after removing 30 emails, see below)...

here they are..

http://engelken.net/masses/testrule.NEW_BLACKHAIR.txt.out
http://engelken.net/masses/testrule.NEW_POPCORN.txt.out
http://engelken.net/masses/testrule.NEW_WEEDS_TEST_2.txt.out
http://engelken.net/masses/testrule.NEW_WEEDS_2_TEST_2.txt.out

- 0 total hits on the ham corpus for the popcorn tests.
- 7 total hits on the ham corpus for the Blackhair tests.
- 0** total hits on the ham corpus for the weeds and weeds 2 tests.
 
  ** the previous weeds and weeds 2 tests had alot of hits 
     on the ham corpus, but *ALL* of the hits were all related 
     to the 30 emails on the <sprocket.lockergnome.com> mailling 
     list that you can find in the public hard_ham corpus.

[EMAIL PROTECTED] masses]# grep J_WEEDS_C ham.log | wc -l
     30
[EMAIL PROTECTED] masses]# grep J_WEEDS_C ham.log | grep lockergnome | wc -l
     30

I removed the following messages from the ham corpus and ran the weeds
tests again..
http://engelken.net/masses/offending_weeds_email.txt

The filenames you see there can be matched against what you find in hard
ham..
http://spamassassin.org/publiccorpus/20030228_hard_ham.tar.bz2

Dallas


-------------------------------------------------------
This SF.net email is sponsored by: SF.net Giveback Program.
Does SourceForge.net help you be more productive?  Does it
help you create better code?  SHARE THE LOVE, and help us help
YOU!  Click Here: http://sourceforge.net/donate/
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to