It appears that this is due to the browser not knowing that the downloaded file is UTF-8; adding <meta charset="utf-8"> to the page fixes the problem.
So this is not a Go / "net/http" issue. On Sun, 2018-12-30 at 11:05 +1030, Dan Kortschak wrote: > I am putting together a tiny tool to archive the go.science G+ group > for future archeology. However, I have run into an issue that I don't > understand and so don't know how to fix. > > The code is here: https://play.golang.org/p/1vVm5dvVueS > > This takes in a the Takeout JSON for the community's posts and then > just gets the page. The intention is to extract the post text and > comments into JSON since Takeout does not give you all the post > content, just the metadata. > > What I am seeing that I don't understand is the following. Using my > browser at this page (https://plus.google.com/+DanKortschak/posts/Gqc > ss > cVQuoW) I see properly encoded text... > > ``` > Raúl Mera expressed concerns about inter-convertibility between > matrices in go.matrix and biogo/matrix. This, in conjunction with > some > restrictions I found on writing matrix code with biogo/matrix that > prevented me from using blas effectively, prompted me to create an > unsafe matrix type for Dense matrices that greatly facilitates > interconversion between the two packages. > > type UnsafeDense struct { > Rows, Cols int > Data []float64 > Stride int > } > > <snip> > ``` > > but in the page that is retrieved, some spaces and diacritic- > containing > runes are rendered incorrectly. > > ``` > Raúl Mera expressed concerns about inter-convertibility between > matrices in go.matrix and biogo/matrix. This, in conjunction with > some > restrictions I found on writing matrix code with biogo/matrix that > prevented me from using blas effectively, prompted me to create an > unsafe matrix type for Dense matrices that greatly facilitates > interconversion between the two packages. > >        type UnsafeDense struct { >                Rows, Cols int >                Data      []float64 >                Stride    int >        } > > <snip> > ``` > > > What I think is happening is that I am being sent non-UTF-8 encoded > data when performing the GET in my code above. Is this likely to be > the > case and if so, how do I communicate to the server that I want UTF-8? > > thanks > Dan > -- You received this message because you are subscribed to the Google Groups "golang-nuts" group. To unsubscribe from this group and stop receiving emails from it, send an email to golang-nuts+unsubscr...@googlegroups.com. For more options, visit https://groups.google.com/d/optout.