On Tue, 2005-04-12 at 07:42, David Cantrell wrote:

> > You might argue that Language::Dutch should never ship with the core...
> > that's a valid opinion, but SOMEONE is going to write it. It'd be a kind
> > of strange form of censorship for CPAN not to accept it. After all,
> > there's more than one way to say it... isn't there?
> 
> While it may be possible to do it, and while it may be an interesting
> exercise to implement it, that doesn't mean that anyone *using* it for
> anything other than a joke isn't a blithering idiot.

We're all programmers (I assume), and we've all come across environments
where programming tools that seemed useless or silly in one environment
were critical in another. The education example I gave is an ideal one,
but there are dozens of others from any number of fields. Perl 6 offers
us the opportunity to create true dialects, and dialects are valuable
things.

I don't recommend that everyone rush in and use such a module, but it
has its value.

> > > I'm not even sure I like the *possibility* of using non-ascii letters in
> > > identifiers, even.
> > I think we already have Latin-1 in identifiers...
> 
> more's the pity.

This is just ASCII prejudice (which seems strange, as your domain
implies you're not here in the US where such prejudice is usually
assumed). There's nothing fundamentally wrong with having $Ãther as a
variable name, other than the fact that most Americans are confused by
it because they think "Ã" and "ae" are identical. They're wrong. We've
already deeply embraced the Latin-1 character set for many purposes in
Perl 6.

I *would* however, have a "use warning <unicode>" type of thing which
would warn on the use of any Unicode character in an identifier whose
standard display form is difficult to distinguish from some more
commonly used character's display form. For example, "Æ" is just a bit
too close to "b", so you might want to warn the user unless their
default locale was one where it's commonly used, or at least provide a
facility for requesting such warnings.

> > Let's see about UTF-8....
> >         pugs> my $??? = 1;
> >         undef
> >         pugs> $???;
> >         1
> 
> I see a sequence of three question marks, there's no funny foreign
> characters there.  I have to confess to being surprised that $??? is
> legal.

That's either a bug in your mailer (or possibly something that tried to
re-encode your mail for you) or the fact that you're using some ancient
output device that doesn't grok Unicode character sequences. It's
refusing to display perfectly valid "Content-Type: text/plain;
charset=UTF-8" mail in "Content-Transfer-Encoding: quoted-printable"
correctly. It should render "pugs> my $=E0=BA=81 =3D 1;" as the
appropriate Unicode sequence.

Certainly we've already stated that the default encoding of Perl 6
programs will be UTF-8, and that means that such sequences are valid,
and all editors I know of handle these sequences correctly (Emacs, vim,
eclipse, etc.)

I don't see why stopping someone in Russia from writing code in their
native syllabary is wrong... after all, they're probably not going to
use ASCII to represent words that I can read anyway... they're going to
use ASCII to write a romanized form of their own language. Restrictions
on style should be performed at a higher level. For example the new CPAN
for Perl 6 might have some qualifiers that you can attach to your
programs that describe character set used and other stylistic
conventions. That way, I might choose to look for only modules that are
written in a language that I feel comfortable maintaining. Then again, I
might not care, and just install whatever works.

-- 
Aaron Sherman <[EMAIL PROTECTED]>
Senior Systems Engineer and Toolsmith
"It's the sound of a satellite saying, 'get me down!'" -Shriekback


Reply via email to