On Wed, Aug 15, 2012 at 09:31:40AM +0300, Henrik K wrote: > On Tue, Aug 14, 2012 at 09:20:26PM -0700, John Evans wrote: > > On 2012-08-14 21:13, Kevin A. McGrail wrote: > > >>Here's the output of -D -t on the file. I let it run for about > > >>10 minutes before giving up and killing the process. > > > > > >Out of interest, can you let it run longer? Say an hour just to see > > >if does finish processing? > > > > > >regards, > > >KAM > > > > You bet! I'll fire it up in a screen session tonight (about to go to > > bed) and check on it in the morning when I get to work. That should > > give it PLENTY of time to finish up if it's going to finish. > > > > I'll be in touch tomorrow with (hopefully) more information. > > Nothing new about this problem, it's "well known". RTF incorrectly marked as > text/plain will be scanned as body and regexps go wild from all the > formatting. > > https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6582 > https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6584
Correction, anything marked text/* will be scanned.. anyways, if you want to fix this, you can try applying my patch which is found there.