Theo Van Dinter wrote:
> On Tue, Oct 14, 2008 at 09:32:42PM -0400, Matt Kettler wrote:
>   
>> Agreed. However, there are times where it's not worth fixing a FP case
>> because its a rare case caused by a small-scale broken tool that
>> violates the specs.
>>     
>
> FWIW, this case is specifically looked for by the rule.
>
>   
In general, I'm not advocating dropping this aspect of the rule, just
moving it to a separate test where it can stand alone and be scored on
its own merits.
>> Of course, I lack the time right now to write it, so unless Theo or
>> someone else wants to jump on it, I'm sure patches are welcome.
>>     
>
> I'm positive we've gone through this before, but I lack the time to go
> searching for the discussion/bugs.  One that came up in a quick bugzilla
> search:
>
> https://issues.apache.org/SpamAssassin/show_bug.cgi?id=4234
>
> The results at the time were that spam hits went down by so much, the rule
> became worse.  Score generation is expected to deal with the FPs.
>
>   


Well, that alone sounds like a good reason to split the rule. Those
results suggest that SINGLEPART_MULTIPART would be a better performing
rule, and probably deserves to be singled out so it can be scored higher.

Of course, the split would have the opposite affect of what the poster
desires, but it would be a good, conclusive point to stand on. This test
shows that this exact property occurs rarely in nonspam, and frequently
in spam. It also would allow the score to readily adapt to changes in
the FP ratio.







Reply via email to