Re: [9fans] utf-8 handling oddities

2023-10-14 Thread LdBeth
> In <1597a7b3-09d5-443f-b372-8b28f5f2b...@aaoth.xyz> > la-ninpre wrote: la-ninpre> if i understand it correctly, unicode extended past the BMP la-ninpre> in 1996 with the release of unicode 2.0. plan 9 had two la-ninpre> editions released after that, but, of course assuming that la-ni

[9fans] utf-8 handling oddities

2023-10-13 Thread la-ninpre
greetings, 9fans. recently i have been studying utf-8 encoding and decided to look at how it is handled in plan 9. i thought that since plan 9 was the first application of this encoding, it makes sense to look at its implementation. the fact that mentioned implementation was done by designers o

Re: [9fans] UTF-8 characters in acme

2021-09-09 Thread revcomninos
Thanks so much! -- 9fans: 9fans Permalink: https://9fans.topicbox.com/groups/9fans/T0d54d4b593281935-M872a6411e5772d69166bf414 Delivery options: https://9fans.topicbox.com/groups/9fans/subscription

Re: [9fans] UTF-8 characters in acme

2021-09-09 Thread Steve Simon
pretty much the only major change to sam since 87 was (i believe) the migration from ASCII (on Unix v10 and p9 Ed1) to utf-8. if you are not seeing unicode characters it is most probably because you don’t have a font with the appropriate glyphs. -Steve ---

Re: [9fans] UTF-8 characters in acme

2021-09-09 Thread revcomninos
Thanks for clarifying. Now why does sam not display utf-8 characters on my setup?  -- 9fans: 9fans Permalink: https://9fans.topicbox.com/groups/9fans/T0d54d4b593281935-M73d5efdb47296d7869a25279 Delivery options: https://9fans.topicbox.com/groups/9fans/subsc

Re: [9fans] UTF-8 characters in acme

2021-09-09 Thread Ethan Gardener
On Thu, Sep 9, 2021, at 1:46 PM, revcomni...@gmail.com wrote: > I came to this impression after reading: > "Sam edits uninterpreted ASCII text." > http://doc.cat-v.org/plan_9/4th_edition/papers/sam/ Understandable, but you missed the last paragraph of the abstract, which reads, "This paper is rep

Re: [9fans] UTF-8 characters in acme

2021-09-09 Thread revcomninos
I came to this impression after reading: "Sam edits uninterpreted ASCII text." http://doc.cat-v.org/plan_9/4th_edition/papers/sam/ -- 9fans: 9fans Permalink: https://9fans.topicbox.com/groups/9fans/T0d54d4b593281935-Mf25e9d16a3b91bb3c67546e2 Delivery options

Re: [9fans] UTF-8 characters in acme

2021-09-09 Thread Maurizio Boriani
umbrati...@prosimetrum.com writes: >> How to write UTF-8 caracters > > keyboard(6) works very well! What pleasantly! Compose key and mnemonics are fantastic idea! -- Maurizio Boriani PGP key: 0xCC0FBF8F -- 9fans: 9fans Permalink: https://9fans.topi

Re: [9fans] UTF-8 characters in acme

2021-09-07 Thread Frank D. Engel, Jr.
Seems rather unlikely considering that UTF-8 was originally invented for Plan 9? Turns out it is the same as anywhere else in Plan 9 - tap ALT (not hold) then the letter x then the hex code for the rune you are trying to enter and a semicolon (if fewer than 6 hexadecimal nybbles). Example:

Re: [9fans] UTF-8 characters in acme

2021-09-07 Thread revcomninos
I was under the impression you could only use ASCII. -- 9fans: 9fans Permalink: https://9fans.topicbox.com/groups/9fans/T0d54d4b593281935-Med338a73860dc960cac21090 Delivery options: https://9fans.topicbox.com/groups/9fans/subscription

Re: [9fans] UTF-8 characters in acme

2021-09-07 Thread umbraticus
> How to write UTF-8 caracters keyboard(6) -- 9fans: 9fans Permalink: https://9fans.topicbox.com/groups/9fans/T0d54d4b593281935-Mb9d3caaf9c167393db658f43 Delivery options: https://9fans.topicbox.com/groups/9fans/subscription

[9fans] UTF-8 characters in acme

2021-09-07 Thread Maurizio Boriani
Hi all, may be a dummy question: How to write UTF-8 caracters in acme? Thanks in advance, -- Maurizio Boriani PGP key: 0xCC0FBF8F -- 9fans: 9fans Permalink: https://9fans.topicbox.com/groups/9fans/T0d54d4b593281935-M9ca70d5498411227cb9608b2 Delive

Re: [9fans] utf-8

2016-02-08 Thread Anthony Sorace
It isn't. MIDI variable-length quantities don't include an indication of length in a header, and the last byte in any multi-byte sequence is a valid single-byte value. It is more bit efficient than UTF-8, but I believe it lacks other properties around synchronization, possibly others. I'm less

Re: [9fans] utf-8

2016-02-07 Thread Bruce Ellis
Yes. But the midi version is utf-8. brucee On 7 February 2016 at 16:42, Skip Tavakkolian wrote: > but following that line of reasoning, aren't they all specialized versions > of Huffman encoding? > > > On Sat, Feb 6, 2016 at 6:04 PM Bruce Ellis wrote: > >> Not to burst a balloon but check out

Re: [9fans] utf-8

2016-02-06 Thread Skip Tavakkolian
but following that line of reasoning, aren't they all specialized versions of Huffman encoding? On Sat, Feb 6, 2016 at 6:04 PM Bruce Ellis wrote: > Not to burst a balloon but check out variable length ints in the Midi File > Format for utf-8 in the early 80s. > > brucee >

[9fans] utf-8

2016-02-06 Thread Bruce Ellis
Not to burst a balloon but check out variable length ints in the Midi File Format for utf-8 in the early 80s. brucee

Re: [9fans] UTF-8 font

2014-09-03 Thread erik quanstrom
On Tue Sep 2 02:47:03 EDT 2014, b...@mails.tsinghua.edu.cn wrote: > Hello 9fans, > > I am using 9front, a fork of plan9 in my ThinkPad. I know that plan9 > supports unicode but I still find some of Chinese characters display > funny. Is it a font issue or something? i would recommend cyberbit

Re: [9fans] UTF-8 font

2014-09-02 Thread Skip Tavakkolian
couldn't you use ttf2subf to get some truetype fonts for the code set that you need? On Tue, Sep 2, 2014 at 12:00 AM, Steve Simon wrote: > > Is it a font issue or something? > > exactly. > > It is rare to find a unicode font that has _very_ wide coverage, > though some plan9 fonts are better t

Re: [9fans] UTF-8 font

2014-09-02 Thread Steve Simon
> Is it a font issue or something? exactly. It is rare to find a unicode font that has _very_ wide coverage, though some plan9 fonts are better than others in this respect. This is not somthing I have used much so hopefully someone will chime in with font suggestions. -Steve

[9fans] UTF-8 font

2014-09-01 Thread Bao Cheng
Hello 9fans, I am using 9front, a fork of plan9 in my ThinkPad. I know that plan9 supports unicode but I still find some of Chinese characters display funny. Is it a font issue or something? Best wishes, Cheng

Re: [9fans] utf-8 on 9fans.net/archive

2010-06-22 Thread Joel C. Salomon
On Tue, Jun 22, 2010 at 3:33 PM, LiteStar numnums wrote: > The haiku is short a syllable on the first line, unless you pronounce it > "con-fu-sed" No, it's alright as it stands; see . (We discuss *everything* on this list, don't we?) —Joel Salomon

Re: [9fans] utf-8 on 9fans.net/archive

2010-06-22 Thread LiteStar numnums
On Tue, Jun 22, 2010 at 3:00 PM, EBo wrote: > > confused boxes > > emdashes turned into mud > > double conversion? > > > > http://9fans.net/archive/2010/06/187 > > should I read this as poetry or a question? > > EBo -- > > > The haiku is short a syllable on the first line, unless you prono

Re: [9fans] utf-8 on 9fans.net/archive

2010-06-22 Thread EBo
> confused boxes > emdashes turned into mud > double conversion? > > http://9fans.net/archive/2010/06/187 should I read this as poetry or a question? EBo --

[9fans] utf-8 on 9fans.net/archive

2010-06-22 Thread erik quanstrom
confused boxes emdashes turned into mud double conversion? http://9fans.net/archive/2010/06/187 - erik

Re: [9fans] utf-8 text files from httpd

2009-10-19 Thread lucio
> 2009/10/19 erik quanstrom : >> why try that hard?  just call it utf-8.  i can't think of >> any browsers that would have a problem with that today. > > the instance of the problem that i had was when > adding an attachment to a upas mail. > file -m is useful when the attachment might be > binary

Re: [9fans] utf-8 text files from httpd

2009-10-19 Thread erik quanstrom
On Mon Oct 19 10:36:51 EDT 2009, rogpe...@gmail.com wrote: > 2009/10/19 erik quanstrom : > > why try that hard?  just call it utf-8.  i can't think of > > any browsers that would have a problem with that today. > > the instance of the problem that i had was when > adding an attachment to a upas ma

Re: [9fans] utf-8 text files from httpd

2009-10-19 Thread roger peppe
2009/10/19 erik quanstrom : > why try that hard?  just call it utf-8.  i can't think of > any browsers that would have a problem with that today. the instance of the problem that i had was when adding an attachment to a upas mail. file -m is useful when the attachment might be binary.

Re: [9fans] utf-8 text files from httpd

2009-10-19 Thread erik quanstrom
On Mon Oct 19 09:51:33 EDT 2009, rogpe...@gmail.com wrote: > there's another problem with file -m that > i've been bitten by before: it ignores any > stuff after the first 6000 bytes. > > so if you've got a mostly-ascii file with some > utf-8 characters 8K in, then it won't be picked up. > > i th

Re: [9fans] utf-8 text files from httpd

2009-10-19 Thread roger peppe
there's another problem with file -m that i've been bitten by before: it ignores any stuff after the first 6000 bytes. so if you've got a mostly-ascii file with some utf-8 characters 8K in, then it won't be picked up. i think file -m should read the whole file, but that's just IMHO.

Re: [9fans] utf-8 text files from httpd

2009-10-19 Thread erik quanstrom
> Is the output of file(1) appropriate for this purpose? > Shouldn't your sample file also be sent as UTF-8? it should be. for example since ; echo ☺ | file stdin: short UTF text # sic one would expect that echo ☺ | file -m would yield text/plain; charset=utf-8. > file(1) speak

Re: [9fans] utf-8 text files from httpd

2009-10-19 Thread Kenji Arisawa
I think it is difficult to make web server to work correctly in case we have variety of charset text files on the server. Although we can manually select charset in browser menu, the selection is useless in case the page is written in Javascript that fills some portion of a page reading a tex

Re: [9fans] utf-8 text files from httpd

2009-10-19 Thread Akshat Kumar
new/sendfd.c:243 c old/sendfd.c:243 < --- > /* new/sendfd.c:246 c old/sendfd.c:246 < --- > */ (context: text/plain -> text/plain; charset=utf-8) Now my text files can be read in the proper encoding by default, and are not interpreted by browsers (as well as certain applications) to be whack ASCII

Re: [9fans] utf-8 text files from httpd

2009-10-19 Thread Eris Discordia
The decision whether to open in place or save to disk based on MIME type is up to the browser. For example, I set my browsers to ask to save to disk application/pdf documents (rather than opening them with Adobe Acrobat's problem plugin). A MIME type of text/plain (without any specification of

Re: [9fans] utf-8 text files from httpd

2009-10-18 Thread erik quanstrom
> Thus, hard coding "charset=utf-8" in http header will bring other > problem > because that coding disables a line in html header such as: > that should not be a problem on a plan 9 system; plan 9's character set is utf-8. - erik

Re: [9fans] utf-8 text files from httpd

2009-10-18 Thread Kenji Arisawa
we should note also http://www.w3.org/TR/html4/charset.html#h-5.2.2. the document says: To sum up, conforming user agents must observe the following priorities when determining a document's character encoding (from highest priority to lowest): 1. An HTTP "charset" parameter i

Re: [9fans] utf-8 text files from httpd

2009-10-18 Thread Kenji Arisawa
according to rfc2616, default charset in sending text file is ascii: The "charset" parameter is used with some media types to define the character set (section 3.4) of the data. When no explicit charset parameter is provided by the sender, media subtypes of the "text" type are defined

Re: [9fans] utf-8 text files from httpd

2009-10-18 Thread andrey mirtchovski
your mimetypes are probably maim-typed (heh). see /sys/lib/mimetype for a fix, or put this in your page's section: On Sun, Oct 18, 2009 at 6:34 PM, Akshat Kumar wrote: > I'm trying to put up a plain text file containing UTF-8 > characters from httpd, but when viewing it from any > br

Re: [9fans] utf-8 text files from httpd

2009-10-18 Thread erik quanstrom
On Sun Oct 18 20:37:23 EDT 2009, aku...@mail.nanosouffle.net wrote: > I'm trying to put up a plain text file containing UTF-8 > characters from httpd, but when viewing it from any > browser, it comes off as an ASCII file that needs to > be downloaded (so, those characters are garbled). > Is this du

[9fans] utf-8 text files from httpd

2009-10-18 Thread Akshat Kumar
I'm trying to put up a plain text file containing UTF-8 characters from httpd, but when viewing it from any browser, it comes off as an ASCII file that needs to be downloaded (so, those characters are garbled). Is this due to some behaviour of httpd? ak