Re: r994 UTF8 addition broke latin-1 characters (ISO-8859-1) - WAS: Re: Unicode, accented characters

2009-03-13 Thread Christophe Grand
here is some background info on the change: http://groups.google.com/group/clojure/browse_thread/thread/123ef17d7c650018/e1da76a4a273aa5a max3000 a écrit : > The default character set on WinXP (which I use) is windows-1252 > (cp1252). Check out http://www.rgagnon.com/javadetails/java-0505.html. >

Re: r994 UTF8 addition broke latin-1 characters (ISO-8859-1) - WAS: Re: Unicode, accented characters

2009-03-13 Thread max3000
I paste below what I got. It's doing something for sure, but just for *out* I think. Thanks. user=> (println "abcd") ����abcd nil user=> (binding [*in* (new LineNumberingPushbackReader (new InputStreamReader System/in)) *out* (new OutputStreamWriter System/out)] (println

Re: r994 UTF8 addition broke latin-1 characters (ISO-8859-1) - WAS: Re: Unicode, accented characters

2009-03-13 Thread Michael Wood
On Fri, Mar 13, 2009 at 8:46 AM, max3000 wrote: > > Any news on this item? Does what I'm saying make sense? > > I understand most people who use clojure are probably English-speaking > and couldn't care less about internationalization, but this has to be > addressed if clojure is to get any sembl

Re: r994 UTF8 addition broke latin-1 characters (ISO-8859-1) - WAS: Re: Unicode, accented characters

2009-03-12 Thread max3000
Any news on this item? Does what I'm saying make sense? I understand most people who use clojure are probably English-speaking and couldn't care less about internationalization, but this has to be addressed if clojure is to get any semblance of semi-mainstream adoption. In fact, one of the reason

Re: r994 UTF8 addition broke latin-1 characters (ISO-8859-1) - WAS: Re: Unicode, accented characters

2009-03-07 Thread max3000
The default character set on WinXP (which I use) is windows-1252 (cp1252). Check out http://www.rgagnon.com/javadetails/java-0505.html. If I were to change my source file encodings to UTF-8 that would probably get me some mileage. Of course, I would have to use an editor that supports it and not

Re: r994 UTF8 addition broke latin-1 characters (ISO-8859-1) - WAS: Re: Unicode, accented characters

2009-03-07 Thread Toralf Wittner
On Sat, 2009-03-07 at 05:43 -0800, max3000 wrote: > Ok, so I ended up doing this in my code: > > String resource = "/exmentis/rules_main.clj"; > InputStream is = getClass().getResourceAsStream(resource); > String script = ... read in is as a String (like slurp) ... > Strin

Re: r994 UTF8 addition broke latin-1 characters (ISO-8859-1) - WAS: Re: Unicode, accented characters

2009-03-07 Thread max3000
Ok, so I ended up doing this in my code: String resource = "/exmentis/rules_main.clj"; InputStream is = getClass().getResourceAsStream(resource); String script = ... read in is as a String (like slurp) ... StringReader r = new StringReader(script); clojure.lang.Compi

Re: r994 UTF8 addition broke latin-1 characters (ISO-8859-1) - WAS: Re: Unicode, accented characters

2009-03-06 Thread max3000
Some more information: In REPL, everything seems fine: exmentis=> "ààà" "ààà" exmentis=> (def a "") #'exmentis/a exmentis=> a "" exmentis=> (println a) nil exmentis=> (. System/out println a) nil However, when I import a Java class: public class Application { public static

r994 UTF8 addition broke latin-1 characters (ISO-8859-1) - WAS: Re: Unicode, accented characters

2009-03-06 Thread max3000
There is definitely a bug. In r994 (Aug 07, 2008) UTF8 encoding was added to *in*, *out* and *err*. This messes up the Repl (and the Reader in general) as discussed above. Case in point, everything works fine when I go in the code and modify RT.java as follows: final static public Var OUT =

Re: Unicode, accented characters

2009-03-06 Thread rzeze...@gmail.com
On Mar 6, 5:58 pm, max3000 wrote: > > I don't really want to use the SVN version because I'm developing an > application and can really do without the (normal) instabilities that > come with development builds. > FYI, you may want to consider using SVN for now because there have been breaking ch

Re: Unicode, accented characters

2009-03-06 Thread Kevin Downey
Jline is known to have issues with unicode. On Fri, Mar 6, 2009 at 2:58 PM, max3000 wrote: > > I'm getting similar results without jline.ConsoleRunner. Also as I > mentioned I use RT.loadResourceScript and get the same results. > > However, I'm using the clojure release from 2008-12-17 (the only

Re: Unicode, accented characters

2009-03-06 Thread max3000
I'm getting similar results without jline.ConsoleRunner. Also as I mentioned I use RT.loadResourceScript and get the same results. However, I'm using the clojure release from 2008-12-17 (the only official version I see at http://code.google.com/p/clojure/downloads/list.) Could that make a differe

Re: Unicode, accented characters

2009-03-06 Thread Meikel Brandmeyer
Hi, Am 06.03.2009 um 23:31 schrieb max3000: I'm trying to output accented characters from clojure. Actually, I'm trying to call setToolTipText on a JComponent with some unicode string. No problems doing so from Java, but with clojure I'm hitting a wall. In REPL: exmentis=> "àéôö" "&→∟↔" exment

Unicode, accented characters

2009-03-06 Thread max3000
Hi, I'm trying to output accented characters from clojure. Actually, I'm trying to call setToolTipText on a JComponent with some unicode string. No problems doing so from Java, but with clojure I'm hitting a wall. In REPL: exmentis=> "àéôö" "&→∟↔" exmentis=> \u00f4 \├┤ exmentis=> \u00c0 \├Ç Ok,