On Aug 29, 2009, at 5:25 PM, kcrisman wrote:

> On Aug 29, 6:45 pm, Robert Dodier <robert.dod...@gmail.com> wrote:
>> Minh Nguyen wrote:
>>> Usually "an" comes before a word that starts with a vowel, i.e a, e,
>>> i, o, u. So one would say "an eight o'clock meeting" or "an 8  
>>> o'clock
>>> meeting". More examples: an amphibian, an egg, an igloo, an octopus,
>>> an umbrella. However, there are situations when this rule doesn't
>>> apply. In software engineering, one uses UML diagrams as part of the
>>> design process. Although this acronym starts with a capital "u",  
>>> it's
>>> pronounced and written as "a UML diagram" not "an UML diagram", just
>>> as in "a ewe" not "an ewe".
>>
>> In American English at least (I just don't know about other  
>> varieties)
>> it is typical to change some vowels into diphthongs, in particular to
>> change initial u into iu instead. In iu, i acts as a semivowel,
>> and it's typical to use the article "a" in front of a word beginning
>> with iu, e.g. a unicorn. But e.g. "urn" doesn't have the initial
>> semivowel,
>> so the article is "an", so "an urn".
>>
>> The other semivowel (there might be still others, but I can't think
>> of them at the moment) is w as in "one". As with semivocalic i,
>> the article for semivocalic w is "a", e.g. a one-time deal.
>>
>
> Yes.  And don't forget things like "an historical novel" versus "a
> historical novel", depending on what part of the US you are from (no
> idea for other parts).  But at any rate, a versus an is purely
> phonetic.  At least in theory, there are also two pronunciations of
> "the" (thee and thuh), depending on the same input.  Thee angel, thuh
> time.
>
> Well, whatever; definitely no good algorithm! Especially in the land
> of abbreviations and letters that mathematics is.  Should we maybe use
> other articles for other alphabets, ό α or א ה or something?

The generic problem is hard (and in practice typically solved via  
heuristics + an exception lookup table), but for numbers, an integer - 
 > English words algorithm is very easy to write, and there are very  
few words that can come up first, so this particular case isn't too  
difficult.

>> In all varieties of English, there is a pretty broad gulf between
>> orthography and pronunciation. (I don't know if the orthography was
>> fixed ages ago and pronunciation continued to evolved, or if they
>> were never really aligned to begin with.) In a fantasy world, you
>> would see that urn has a different initial letter than unicorn, but
>> for now you just have to listen to the pronunciation to figure out
>> the appropriate article.
>
> Orthography is horrible because so much of the vocabulary is from
> Anglo-Saxon or old French, but we barely pronounce anything like
> Icelandic.  For a great example, see http://www.etymonline.com/ 
> index.php?term=enough
> - a lot of initial "g"s became "y"s or disappeared, and I can only
> assume the final g was originally pronounced that way since it still
> is in German.

Even worse, most of our words come from latin/old French, but most of  
our common words and grammar come from the Celts and the Goths.  
(Interestingly, this was originally a class distinction between the  
nobles and the peasants, which is why we have beef (for those who ate  
them)/cows (for those who raised them), etc.) Language evolution can  
create strange (but usually algorithmic) rules, but this hybrid  
history is what causes so many irregularities. English historically  
has been more accepting of importing foreign words as well, which  
doesn't help things.

> Also, Isn't French another language whose pronunciation
> is only related to orthography by very complicated or non-algorithmic
> rules?  Because it's still written like it was centures ago?  I feel
> like I've heard that somewhere.

French has rather complicated rules, in which many letters become  
silent much of the time, but it tends to be very regular at least.

> So English isn't unique like that, though it's probably the only  
> language you can really have a spelling bee in :)

In France the equivalent of a spelling bee is a "dictation" in which  
the challenger reads a paragraph or two (typically from a famous  
work) and you copy it down. The challenge is filling in all these  
silent letters, especially as many of them can only be deduced in  
context and by agreement with other parts of the text. (For example,  
as in English, the generic way to make things plural in French is to  
add an 's' to the end, but this 's' is usually silent, so the only  
way you'd know it's there is by context.)

> Anyway, interesting thread.  I assume that there are no algorithms for
> Chomskian transformational grammars in Sage yet, but maybe someone
> should volunteer.

Some day I'm sure it'll happen. The transformations themselves are  
usually pretty simple, it's coming up with a minimal set of  
transformations that describe a huge set of phenomena that's the  
challenge.

- Robert


--~--~---------~--~----~------------~-------~--~----~
To post to this group, send email to sage-support@googlegroups.com
To unsubscribe from this group, send email to 
sage-support-unsubscr...@googlegroups.com
For more options, visit this group at 
http://groups.google.com/group/sage-support
URLs: http://www.sagemath.org
-~----------~----~----~----~------~----~------~--~---

Reply via email to