# New Ticket Created by  Helmut Wollmersdorfer 
# Please include the string:  [perl #64918]
# in the subject line of all future correspondence about this issue. 
# <URL: http://rt.perl.org/rt3/Ticket/Display.html?id=64918 >


Hi,

rakudo$ ./perl6 -e '"\c[LATIN SMALL LETTER A WITH DIAERESIS,COMBINING 
CEDILLA]";'
Malformed UTF-8 string

current instr.: 'parrot;PAST;Compiler;escape' pc 9067 (src/POST/Node.pir:90)
called from Sub 'parrot;PAST;Compiler;escape' pc 1731 
(src/PAST/Compiler.pir:188)
called from Sub 'parrot;PAST;Compiler;as_post' pc 8758 
(src/PAST/Compiler.pir:2313)
called from Sub 'parrot;PAST;Compiler;post_children' pc 2185 
(src/PAST/Compiler.pir:415)
called from Sub 'parrot;PAST;Compiler;as_post' pc 2600 
(src/PAST/Compiler.pir:602)
called from Sub 'parrot;PAST;Compiler;post_children' pc 2185 
(src/PAST/Compiler.pir:415)
called from Sub 'parrot;PAST;Compiler;as_post' pc 3633 
(src/PAST/Compiler.pir:866)
called from Sub 'parrot;PAST;Compiler;post_children' pc 2185 
(src/PAST/Compiler.pir:415)
called from Sub 'parrot;PAST;Compiler;pirop' pc 4256 
(src/PAST/Compiler.pir:1044)
called from Sub 'parrot;PAST;Compiler;post_children' pc 2185 
(src/PAST/Compiler.pir:415)
called from Sub 'parrot;PAST;Compiler;as_post' pc 3633 
(src/PAST/Compiler.pir:866)
called from Sub 'parrot;PCT;HLLCompiler;compile' pc 428 
(src/PCT/HLLCompiler.pir:301)
called from Sub 'parrot;PCT;HLLCompiler;eval' pc 920 
(src/PCT/HLLCompiler.pir:519)
called from Sub 'parrot;PCT;HLLCompiler;command_line' pc 1510 
(src/PCT/HLLCompiler.pir:798)
called from Sub 'parrot;Perl6;Compiler;main' pc 23985 (perl6.pir:164)

This works:
rakudo$ ./perl6 -e '"\c[LATIN CAPITAL LETTER D,COMBINING DOT BELOW,COMBINING 
DOT ABOVE,COMBINING HORN]";'

My versions:
rakudo$ ./perl6 -v
This is Rakudo Perl 6, revision 37980 built on parrot 1.0.0-devel
for i486-linux-gnu-thread-multi

rakudo$ icu-config --unicode-version
5.1

In comparison the following Perl 8.10 script does not croak:

use strict; 
use warnings; 

use charnames qw(:full); 

my $s= "\N{LATIN SMALL LETTER A WITH DIAERESIS}\N{COMBINING CEDILLA}";

use Encode;
decode("utf8",encode("utf8",$s,1),1);

Helmut Wollmersdorfer

Reply via email to