[sphinx-users] Re: SphinxError: Can't decode unicode within a doc

Guenter Milde Thu, 18 Apr 2013 00:07:03 -0700

On 2013-04-17, Conway M wrote:


> I am trying to compile the docs of Pandas
> <https://github.com/pydata/pandas>but I am unable to get Sphinx to
> compile a document with some unicode.  Is there some flag I need to
> specify to let Sphinx correctly build documents with unicode in them?

The default input encoding is 'utf8', so if your rst document is
utf8-encoded, it should be OK.

If not, please post more details (used encoding, docutils settings).
A minimal example (the part of the input file that coused the error) may
help further.

> In this case, I don't want Sphinx to decode the text.

Docutils/Sphinx will always decode the input into an "unicode" instance
and encode the output. All inner processing is done on "unicode" (or
derived) objects.

...

>> *  File "/usr/local/lib/python2.7/dist-packages/sphinx/environment.py", 
>> line 609, in read_doc
>>     raise SphinxError(str(err))
>> *SphinxError: 'utf8' codec can't decode byte 0xe4 in position 36: invalid 
>> continuation byte
>> *> 
>> /usr/local/lib/python2.7/dist-packages/sphinx/environment.py(609)read_doc()
>> -> raise SphinxError(str(err))
>> (Pdb)

It looks like the input file is either broken or not in utf8 encoding (which
then?).

It looks like the input decoding is not done by docutils.io, but by the
Sphinx "wrapper" - this means you must tell Sphinx about the correct
"source_encoding"
http://sphinx-doc.org/config.html#confval-source_encoding.  
Setting the Docutils config setting "input-encoding"
http://docutils.sourceforge.net/docs/user/config.html#input-encoding will
not help.

Günter

-- 
You received this message because you are subscribed to the Google Groups 
"sphinx-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/sphinx-users?hl=en.
For more options, visit https://groups.google.com/groups/opt_out.

[sphinx-users] Re: SphinxError: Can't decode unicode within a doc

Reply via email to