Hi Jason, thanks for your test cases. However, I don't think that binmode provides an acceptable solution, at least not alone. While it ensures that the strings are valid utf-8 strings, it will convert any valid utf-8 character to two "garbage" characters. Try
$ ./utf8_test.pl testlog
(see attached files)
I'm not really sure what a proper solution is. But I'm actually not yet
fully convinced that there is a problem logwatch should solve. I will
ask Debian's security team for advice.
WM
Am 2016-12-30 um 20:26 schrieb Jason Pyeron:
> A very rudimentary test:
>
> /projects/logwatch
> $ perl -e 'for ($i=0; $i<256; ++$i) {print chr($i);}' | hexdump.exe -C
> 00000000 00 01 02 03 04 05 06 07 08 09 0a 0b 0c 0d 0e 0f |................|
> 00000010 10 11 12 13 14 15 16 17 18 19 1a 1b 1c 1d 1e 1f |................|
> 00000020 20 21 22 23 24 25 26 27 28 29 2a 2b 2c 2d 2e 2f | !"#$%&'()*+,-./|
> 00000030 30 31 32 33 34 35 36 37 38 39 3a 3b 3c 3d 3e 3f |0123456789:;<=>?|
> 00000040 40 41 42 43 44 45 46 47 48 49 4a 4b 4c 4d 4e 4f |@ABCDEFGHIJKLMNO|
> 00000050 50 51 52 53 54 55 56 57 58 59 5a 5b 5c 5d 5e 5f |PQRSTUVWXYZ[\]^_|
> 00000060 60 61 62 63 64 65 66 67 68 69 6a 6b 6c 6d 6e 6f |`abcdefghijklmno|
> 00000070 70 71 72 73 74 75 76 77 78 79 7a 7b 7c 7d 7e 7f |pqrstuvwxyz{|}~.|
> 00000080 80 81 82 83 84 85 86 87 88 89 8a 8b 8c 8d 8e 8f |................|
> 00000090 90 91 92 93 94 95 96 97 98 99 9a 9b 9c 9d 9e 9f |................|
> 000000a0 a0 a1 a2 a3 a4 a5 a6 a7 a8 a9 aa ab ac ad ae af |................|
> 000000b0 b0 b1 b2 b3 b4 b5 b6 b7 b8 b9 ba bb bc bd be bf |................|
> 000000c0 c0 c1 c2 c3 c4 c5 c6 c7 c8 c9 ca cb cc cd ce cf |................|
> 000000d0 d0 d1 d2 d3 d4 d5 d6 d7 d8 d9 da db dc dd de df |................|
> 000000e0 e0 e1 e2 e3 e4 e5 e6 e7 e8 e9 ea eb ec ed ee ef |................|
> 000000f0 f0 f1 f2 f3 f4 f5 f6 f7 f8 f9 fa fb fc fd fe ff |................|
> 00000100
>
> /projects/logwatch
> $ perl -e 'binmode(STDOUT, ":utf8"); for ($i=0; $i<256; ++$i) {print STDOUT
> chr($i);}' | hexdump.exe -C
> 00000000 00 01 02 03 04 05 06 07 08 09 0a 0b 0c 0d 0e 0f |................|
> 00000010 10 11 12 13 14 15 16 17 18 19 1a 1b 1c 1d 1e 1f |................|
> 00000020 20 21 22 23 24 25 26 27 28 29 2a 2b 2c 2d 2e 2f | !"#$%&'()*+,-./|
> 00000030 30 31 32 33 34 35 36 37 38 39 3a 3b 3c 3d 3e 3f |0123456789:;<=>?|
> 00000040 40 41 42 43 44 45 46 47 48 49 4a 4b 4c 4d 4e 4f |@ABCDEFGHIJKLMNO|
> 00000050 50 51 52 53 54 55 56 57 58 59 5a 5b 5c 5d 5e 5f |PQRSTUVWXYZ[\]^_|
> 00000060 60 61 62 63 64 65 66 67 68 69 6a 6b 6c 6d 6e 6f |`abcdefghijklmno|
> 00000070 70 71 72 73 74 75 76 77 78 79 7a 7b 7c 7d 7e 7f |pqrstuvwxyz{|}~.|
> 00000080 c2 80 c2 81 c2 82 c2 83 c2 84 c2 85 c2 86 c2 87 |................|
> 00000090 c2 88 c2 89 c2 8a c2 8b c2 8c c2 8d c2 8e c2 8f |................|
> 000000a0 c2 90 c2 91 c2 92 c2 93 c2 94 c2 95 c2 96 c2 97 |................|
> 000000b0 c2 98 c2 99 c2 9a c2 9b c2 9c c2 9d c2 9e c2 9f |................|
> 000000c0 c2 a0 c2 a1 c2 a2 c2 a3 c2 a4 c2 a5 c2 a6 c2 a7 |................|
> 000000d0 c2 a8 c2 a9 c2 aa c2 ab c2 ac c2 ad c2 ae c2 af |................|
> 000000e0 c2 b0 c2 b1 c2 b2 c2 b3 c2 b4 c2 b5 c2 b6 c2 b7 |................|
> 000000f0 c2 b8 c2 b9 c2 ba c2 bb c2 bc c2 bd c2 be c2 bf |................|
> 00000100 c3 80 c3 81 c3 82 c3 83 c3 84 c3 85 c3 86 c3 87 |................|
> 00000110 c3 88 c3 89 c3 8a c3 8b c3 8c c3 8d c3 8e c3 8f |................|
> 00000120 c3 90 c3 91 c3 92 c3 93 c3 94 c3 95 c3 96 c3 97 |................|
> 00000130 c3 98 c3 99 c3 9a c3 9b c3 9c c3 9d c3 9e c3 9f |................|
> 00000140 c3 a0 c3 a1 c3 a2 c3 a3 c3 a4 c3 a5 c3 a6 c3 a7 |................|
> 00000150 c3 a8 c3 a9 c3 aa c3 ab c3 ac c3 ad c3 ae c3 af |................|
> 00000160 c3 b0 c3 b1 c3 b2 c3 b3 c3 b4 c3 b5 c3 b6 c3 b7 |................|
> 00000170 c3 b8 c3 b9 c3 ba c3 bb c3 bc c3 bd c3 be c3 bf |................|
> 00000180
>
> This confirms that binmode utf8 is needed to print out the full ASCII range.
>
>> -----Original Message-----
>> From: Jason Pyeron [mailto:[email protected]]
>> Sent: Friday, December 30, 2016 14:03
>> To: Jason Pyeron; 'Willi Mann'; [email protected]
>> Cc: [email protected]; [email protected];
>> 'Klaus Ethgen'
>> Subject: RE: [Logwatch-devel] Bug#849531: Possible security
>> problem,new logwatch sends mails with charset UTF-8
>>
>> I have opened https://sourceforge.net/p/logwatch/bugs/56/ .
>>
>> I am working a test case for this right now.
>>
>> As I see it, there are 3 paths to test.
>>
>> Output as STDOUT, file, and email. In each case does an 8bit
>> value (0x00..0xff unsigned) result in a valid UTF-8 character.
>>
>> Is binmode(STDOUT, ":utf8") needed? Does it fix the issue if
>> it was needed?
>>
>>>> -----Original Message-----
>>>> From: Willi Mann
>>>> Sent: Friday, December 30, 2016 12:18
>>>> To: logwatch-devel
>>>> Cc: [email protected];
>> [email protected]; Klaus Ethgen
>>>> What would be your suggested fix?
>>
>>
>> $ git show f9db5949c58321175bda66310156f43ae607109f
>> commit f9db5949c58321175bda66310156f43ae607109f
>> Author: bjorn <[email protected]>
>> Date: Sat Oct 15 17:38:40 2016 -0700
>>
>> Changed encoding to UTF-8, as suggested by Goran Uddeborg.
>>
>> diff --git a/scripts/logwatch.pl b/scripts/logwatch.pl
>> index 0f863dc..0167755 100755
>> --- a/scripts/logwatch.pl
>> +++ b/scripts/logwatch.pl
>> @@ -1162,9 +1162,9 @@ sub initprint {
>> }
>> #Config{output} html
>> if ( $Config{'format'} eq "html" ) {
>> - $out_mime .= "Content-Type: text/html;
>> charset=\"iso-8859-1\"\n\n";
>> + $out_mime .= "Content-Type: text/html;
>> charset=\"UTF-8\"\n\n";
>> } else {
>> - $out_mime .= "Content-Type: text/plain;
>> charset=\"iso-8859-1\"\n\n";
>> + $out_mime .= "Content-Type: text/plain;
>> charset=\"UTF-8\"\n\n";
>> }
>>
>> if ($Config{'hostformat'} eq "split") { #8.0 check
>> hostlimit also? or ne none?
>>
>>
übersät
utf8_test.pl
Description: Perl program

