On Fri, 08 Mar 2002, Rob McMillin wrote:
> Justin Mason wrote:
>
>> Mind you, I don't think this is a good idea; it will make SA even
>> more westerner-oriented. :( Pretty much all the GA corpus is from
>> western
>>sources and in western charsets, so the GA will totally skew it.
>
> Further: the
Justin Mason wrote:
>Mind you, I don't think this is a good idea; it will make SA even more
>westerner-oriented. :( Pretty much all the GA corpus is from western
>sources and in western charsets, so the GA will totally skew it.
>
Further: the spam tests, body and keyword match, are virtually 10
(delurking in a net cafe somewhere in Oz ;)
>> * Default score = 0
> I think that's probably a good idea for the test as it stands because
> it's a fairly uncontrolled score applied equally to a /large/
> proportion of the world.
I agree. If the test is added it should be 0 by default.
>> * Se
On Fri, 8 Mar 2002, Daniel Pittman wrote:
>
> ...and should I mention that I regularly see non-SPAM from about half of
> those domains in lists that I am on?
I think that's the nub really - you're going to see false positives with
this rule, and the corpus may or may not show that up depending o
Hmmm, well if you do then here will will have to turn that test off
first thing. We are an international company with distributors or
offices in at least 4 of those domains.
--
Kent Hamilton <[EMAIL PROTECTED]>
Manager - Systems Admin & Networking
Hunter Engineering Company
> -Original Me
On Thu, 7 Mar 2002, Scott Doty wrote:
> On Fri, Mar 01, 2002 at 09:50:03PM -0800, Rob McMillin wrote regarding
> the "FROM_SPAMLAND" test:
> ] http://www.geocrawler.com/lists/3/SourceForge/11679/350/7984404/
>
>> /\.(?:kr|cn|cl|ar|hk|il|th|tw|sg|za|tr|ma|ua|in|pe)(?:[\s\)\]]|$)/
>> Let the spear-