On Tue, 1 Jun 2010, Chris Conn wrote:

John Hardin wrote:
 On Tue, 1 Jun 2010, Chris Conn wrote:

> I upgraded to SA 3.3.1 on a CentOS system using Perl 5.8.5 and I > occasionally get this error; > > Malformed UTF-8 character (unexpected non-continuation byte 0x00, > immediately after start byte 0xc3) in pattern match (m//) at > /var/lib/spamassassin/3.003001/updates_spamassassin_org/72_active.cf, > rule __HUSH_HUSH, line 1, <GEN272> line 528. > > What can I check to correct this?

 I'll fix that, thanks for mentioning it.

 SA is somewhat inconsistent about whether or not it complains about
 malformed UTF-8 characters, as illustrated by your only occasionally
 getting that error. I get no complaints about that rule here when testing
 my sandbox...

Hopefully its the regexp that can be modified and not that it will consistently error-out on my few RH4/CentOS4 boxes I run ;) RH maintains the same version for the entire life of the distro for dependancies so upgrading out of RedHat is most often painful.

Yes, it's a fairly simple modification to the regex that contains the UTF-8 multibyte character sequence. Perl is just getting confused handling it properly when the byte sequence is bare (e.g. \xc3\xa9) so making it a sequence of one-character character sets ([\xc3][\xa9]) fixes that problem without materially altering the RE.

I had to fix this for _some_ of the UTF-8 sequences here, but others were being handled properly so I was lazy and didn't change them all. For that I apologize.

I've committed the fix, it will go out with the next sa-update.

--
 John Hardin KA7OHZ                    http://www.impsec.org/~jhardin/
 jhar...@impsec.org    FALaholic #11174     pgpk -a jhar...@impsec.org
 key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
  North Korea: the only country in the world where people would risk
  execution to flee to communist China.                  -- Ride Fast
-----------------------------------------------------------------------
 5 days until the 66th anniversary of D-Day

Reply via email to