Hi

Could U+E0001 LANGUAGE TAG become undeprecated please? There is a good reason why I ask

There is a German song, Lorelei, and I searched to find an English translation.

I found the following video.

https://www.youtube.com/watch?v=lJ3JhxOUbw0

The video is an instrumental version and is particularly interesting is that there are lyrics displayed in four languages, with two versions of the translation in English.

Being a native speaker of English and living in England I first watched the video viewing just the version labelled British:. Later I played the video again and I just viewed the version labelled U.S..

Remembering that I had some time ago heard a version in Esperanto, I searched nd found the two following videos.

https://www.youtube.com/watch?v=reUpdGgdBsA

https://www.youtube.com/watch?v=7dHhTXDmP0k

They may be of the same recording. This first has in its notes the text of the lyrics.

The song in Esperanto has the rather expressive Esperanto word belega in it. This single word, an adjective, is composed from the Esperanto word bela which means beautiful augmented with the Esperanto word-building component -eg- that modifies the word to which it is an augmentation to indicate greatness. So the word belega expresses in one three-syllable Esperanto word the concept that is in English "greatly beautiful".

http://esperanto.davidgsimpson.com/eo-affixes.html

Thinking of the first video to which I linked, it occurred to me that if a plain text message were sent containing each of two or more versions of the same text, for whatever text, probably a short message in practice, each in a different language from the other or others, with the language of a particular version preceded by a tag sequence: then software at the receiving end could be set to a chosen language and only text in that language would be displayed.

Thinking around this idea I thought that this could be very useful in The Internet of Things for machine to human communication, whereby, if, say, an end user (human) is wanting to dialogue with a device (thing) then the technique could be used to send the message

Please enter the password

from the thing in a number of languages. The decoding software in the end user's computer could use the first message in the list as the default if the sequence sent by the thing does not have a version for the particular language set by the end user in his or her computer.

The list of languages supported by a particular thing would not be specified by a universal standard, but could perhaps have English, French, German and one or more others depending up the location and application of the thing. Any language expressible in Unicode could be included in the list.

Support for Unicode characters beyond plane 0 is much more obtainable in software these days.

I know that people have been urged to use a higher level protocol for indicating in language documents, but please consider if one is wanting to assemble automatically a status report by combing reports from each of a number of mutually independent sensors on the Internet of Things, each of relatively small size, located in a variety of physical locations perhaps miles apart. In such a case the concatenation of such plain text sequences would be straightforward.

Such an undeprecating of U+E0001 LANGUAGE TAG would, in my opinion, contribute to the development of The Internet of Things.

William Overington

Monday 10 February 2020

Reply via email to