[SAtalk] Re: Proposed 'FROM_SPAMLAND' user response summary

2002-03-09 Thread Daniel Pittman
On Fri, 08 Mar 2002, Rob McMillin wrote: > Justin Mason wrote: > >> Mind you, I don't think this is a good idea; it will make SA even >> more westerner-oriented. :( Pretty much all the GA corpus is from >> western >>sources and in western charsets, so the GA will totally skew it. > > Further: the

Re: [SAtalk] Re: Proposed 'FROM_SPAMLAND' user response summary

2002-03-08 Thread Rob McMillin
Justin Mason wrote: >Mind you, I don't think this is a good idea; it will make SA even more >westerner-oriented. :( Pretty much all the GA corpus is from western >sources and in western charsets, so the GA will totally skew it. > Further: the spam tests, body and keyword match, are virtually 10

Re: [SAtalk] Re: Proposed 'FROM_SPAMLAND' user response summary

2002-03-08 Thread Justin Mason
(delurking in a net cafe somewhere in Oz ;) >> * Default score = 0 > I think that's probably a good idea for the test as it stands because > it's a fairly uncontrolled score applied equally to a /large/ > proportion of the world. I agree. If the test is added it should be 0 by default. >> * Se

Re: [SAtalk] Re: Proposed "FROM_SPAMLAND" user response summary

2002-03-08 Thread Matt Sergeant
On Fri, 8 Mar 2002, Daniel Pittman wrote: > > ...and should I mention that I regularly see non-SPAM from about half of > those domains in lists that I am on? I think that's the nub really - you're going to see false positives with this rule, and the corpus may or may not show that up depending o

[SAtalk] RE: Proposed "FROM_SPAMLAND" user response summary (was Re: [SAtalk] A better alternative to test ROUND_THE_WORLD

2002-03-07 Thread Hamilton, Kent
Hmmm, well if you do then here will will have to turn that test off first thing. We are an international company with distributors or offices in at least 4 of those domains. -- Kent Hamilton <[EMAIL PROTECTED]> Manager - Systems Admin & Networking Hunter Engineering Company > -Original Me

[SAtalk] Re: Proposed "FROM_SPAMLAND" user response summary

2002-03-07 Thread Daniel Pittman
On Thu, 7 Mar 2002, Scott Doty wrote: > On Fri, Mar 01, 2002 at 09:50:03PM -0800, Rob McMillin wrote regarding > the "FROM_SPAMLAND" test: > ] http://www.geocrawler.com/lists/3/SourceForge/11679/350/7984404/ > >> /\.(?:kr|cn|cl|ar|hk|il|th|tw|sg|za|tr|ma|ua|in|pe)(?:[\s\)\]]|$)/ >> Let the spear-