On Tue, 1 Jun 2010, Chris Conn wrote:
John Hardin wrote:
On Tue, 1 Jun 2010, Chris Conn wrote:
> I upgraded to SA 3.3.1 on a CentOS system using Perl 5.8.5 and I
> occasionally get this error;
>
> Malformed UTF-8 character (unexpected non-continuation byte 0x00,
> immediately after start byte 0xc3) in pattern match (m//) at
> /var/lib/spamassassin/3.003001/updates_spamassassin_org/72_active.cf,
> rule __HUSH_HUSH, line 1, <GEN272> line 528.
>
> What can I check to correct this?
I'll fix that, thanks for mentioning it.
SA is somewhat inconsistent about whether or not it complains about
malformed UTF-8 characters, as illustrated by your only occasionally
getting that error. I get no complaints about that rule here when testing
my sandbox...
Hopefully its the regexp that can be modified and not that it will
consistently error-out on my few RH4/CentOS4 boxes I run ;) RH
maintains the same version for the entire life of the distro for
dependancies so upgrading out of RedHat is most often painful.
Yes, it's a fairly simple modification to the regex that contains the
UTF-8 multibyte character sequence. Perl is just getting confused handling
it properly when the byte sequence is bare (e.g. \xc3\xa9) so making it a
sequence of one-character character sets ([\xc3][\xa9]) fixes that problem
without materially altering the RE.
I had to fix this for _some_ of the UTF-8 sequences here, but others were
being handled properly so I was lazy and didn't change them all. For that
I apologize.
I've committed the fix, it will go out with the next sa-update.
--
John Hardin KA7OHZ http://www.impsec.org/~jhardin/
jhar...@impsec.org FALaholic #11174 pgpk -a jhar...@impsec.org
key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
North Korea: the only country in the world where people would risk
execution to flee to communist China. -- Ride Fast
-----------------------------------------------------------------------
5 days until the 66th anniversary of D-Day