having problem finding the correct regexp using with split()

2004-12-17 Thread tim23456
Hi all, again i am sending a message to this list hoping you bear with me and my novice question. In my script i would like to split a single scalar containing a random passage of text into a list, which holds the words of the text. What i found in the perl documentation is the following example

Re: combining getc() and unicode strings problem?

2004-12-17 Thread tim23456
Hello Jonathan, all, thank you for your kind response. [snip] > I never use getc... > > In C, where getc originates, the getc function returns a > char type. The C char type is almost always 8 bits long. > By definition it doesn't support unicode, so neither does > Perl. It would be nice if they

Re: combining getc() and unicode strings problem?

2004-12-16 Thread tim23456
Hello Jonathan, all > Not had the misfortune to need to play with this stuff, but I guess > the documentation for perl is a good place to start: > [snip] yes, i read these man pages more than just one time now (at different times), so i think i should have not missed anything. the perl-manpages

combining getc() and unicode strings problem?

2004-12-16 Thread tim23456
Hello, i have intensively searched the web for a solution on the following problem, but could not find any indication for it. The following code does basicelly nothing else then reading in a file on single char basis and writing it to a file again. The input file is encoded as UTF-8 as well as