LC_CTYPE=en_US.UTF-8 should be set for UTF-8, it has some bugs and
sometimes doesn't recognise good UTF-8 string.
http://img51.imageshack.us/img51/4591/201011072157261024x768s.png
diff -r 94c886b859a1 config.def.h
--- a/config.def.h Sun Oct 31 20:29:22 2010 +0100
+++ b/config.def.h Sun N
I removed the wchar_t completely, added some UTF-8 parsing functions.
No support for combining, bidi, doublecolumn etc. Markus Kuhn's UTF-8
stress test file is not working 100% correctly (the decoder works
however, even when reading bytes one by one).
diff -r 288747c60bc1 config.def.h
--- a/config
2010/11/14, cryptix :
> Hi,
>
> On 13.11.2010, at 22:53, Damian Okrasa wrote:
>> I removed the wchar_t completely, ...
>
> great! After a small adaption it seems to work flawlessly.
> In the form you posted, my compiler doesn't like the first constant B0.
>
> st
2010/11/15, Aurélien Aptel :
> On Sat, Nov 13, 2010 at 10:53 PM, Damian Okrasa wrote:
>> I removed the wchar_t completely, added some UTF-8 parsing functions.
>> No support for combining, bidi, doublecolumn etc. Markus Kuhn's UTF-8
>> stress test file is not working 1
changed FONT and BOLDFONT, more fonts are selected
fixed bug in ttyread, does ncurses apps work correctly?
st-utf8.diff
Description: Binary data
2010/11/18, Stefan Mark :
> On 17.11.2010 14:54, Damian Okrasa wrote:
>> changed FONT and BOLDFONT, more fonts are selected
>> fixed bug in ttyread, does ncurses apps work correctly?
>
> The cursor keys did not work in midnight commander (they did last
> version), and i
2010/11/19, Hiltjo Posthuma :
> I noticed in canstou():
>
>329 /* use this if your buffer is less than UTF_SIZ, it returns 1
> if you can decode
>330UTF-8 otherwise return 0 */
>331 static int canstou(char *s, int b) {
>332unsigned char c = *s;
>333int n;
>
I forgot to hg diff, here's the correct one.
st-fix.diff
Description: Binary data
Hey,
this patch replaces current utf decoder with a new one, which is ~50
lines shorter and should be easier to understand. Parsing 5 and 6
sequences, if necessary, requires trivial modification of UTF_SIZ
constant and utfbyte, utfmask, utfmin, utfmax arrays.
Regards,
Damian Okrasa
st.c | 196
utflen 181
utfencode35837
utfdecode 405641
Sorry for previous mail, but i had focus on send, and wanted to insert
new line...
It is number of function calls, on cat dwm
cat UTF-8-demo yields:
utflen 113
utfencode 8152
utfdecode 198346
So I think only utfdecode need to be optimised if necessary.
2014-03-21 19:00 GMT+01:00, Damian Okrasa
2014-03-24 9:43 GMT+01:00, Christoph Lohmann <2...@r-36.net>:
> The naming is wrong. It is just decoding »utf8« and can’t decode
> »utf16«. So: s,utf,utf8,g
Hey,
I have stripped all '8's, because there is UTF_SIZ instead of UTF8_SIZ.
Regards.
Damian Okrasa
12 matches
Mail list logo