Re: appearance of text different in riak different than original xml data

2012-04-07 Thread Wes James
Ok - that's good to know. Thanks, Wes On Sat, Apr 7, 2012 at 9:32 AM, Sean Cribbs wrote: > Wes, > > Also, if you're using curl to load things into Riak, be sure to use > --data-binary with your payload, which will not try to convert multibyte > characters or line-terminators. > > On Sat, Apr 7,

Re: appearance of text different in riak different than original xml data

2012-04-07 Thread Sean Cribbs
Wes, Also, if you're using curl to load things into Riak, be sure to use --data-binary with your payload, which will not try to convert multibyte characters or line-terminators. On Sat, Apr 7, 2012 at 11:21 AM, Wes James wrote: > I found it. I thought if any web site might be able to handle uni

Re: appearance of text different in riak different than original xml data

2012-04-07 Thread Wes James
I found it. I thought if any web site might be able to handle unicode, it would be erlang.org, so I went and grabbed some of the header text: http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd'> test and it works correctly now. thanks On Fri, Apr 6, 2012 at 3:18 PM, Kresten Krab Tho

Re: appearance of text different in riak different than original xml data

2012-04-06 Thread Kresten Krab Thorup
It looks like you may have missed specifying the charset when importing your data; could that be the case? You need to specify the charset when importing 8-bit text. It looks like your xml is utf-8 encoded, so it should be imported using something like this: curl -H 'Content-Type: text/html;ch

appearance of text different in riak different than original xml data

2012-04-06 Thread Wes James
I imported many records, one of which looks like this: 0 Ekologie lučních porostů (A) Rychnovská, Milena, Emilie Balátová-Tuláčková, Blanka Úlehlová, Jaroslav Pelikán 1985 Academia - amazon 5/22/09 Category: Ecology (Y) 8.00 10.00 with bin/search-cmd solr books books.xml Notice the characte