On Mon, 2010-09-06 at 15:10 +0200, Pierre Nugues wrote: > > I wrote a simple tokenizer for texts containing Latin9 characters. It > does not behave as expected with the Swedish text below and I would > like to find a workaround.
Add these lines to top of your program: use strict; use warnings; binmode STDIN, 'encoding(utf8)'; binmode STDOUT, 'encoding(utf8)'; -- Just my 0.00000002 million dollars worth, Shawn Programming is as much about organization and communication as it is about coding. The secret to great software: Fail early & often. Eliminate software piracy: use only FLOSS. -- To unsubscribe, e-mail: beginners-unsubscr...@perl.org For additional commands, e-mail: beginners-h...@perl.org http://learn.perl.org/