On 10/18/2017 09:02 PM, Suresh Babu Mallampati wrote:
> cqlsh: Connection error: ('Unable to connect to any servers',
> {'127.0.0.1': UnicodeDecodeError('utf8', '\x00\x00\x00\xf0\x8f\x0e', 3,
> 5, 'unexpected end of data')})

This does appear to be invalid UTF-8, so the error is valid. Perhaps you
could provide a method to reproduce, or you could just clean up that
data if you know the primary key(s)?

Python agrees:
>>> print('\x00\x00\x00\xf0\x8f\x0e'.decode('utf-8'))
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/usr/lib/python2.7/encodings/utf_8.py", line 16, in decode
    return codecs.utf_8_decode(input, errors, True)
UnicodeDecodeError: 'utf8' codec can't decode bytes in position 3-4:
unexpected end of data

From:
https://software.hixie.ch/utilities/cgi/unicode-decoder/utf8-decoder

----
Decoder output:

Byte number 1 is decimal 0, hex 0x00, octal \000, binary 00000000

U+0000  <control>
        = NULL


Byte number 2 is decimal 0, hex 0x00, octal \000, binary 00000000

U+0000  <control>
        = NULL


Byte number 3 is decimal 0, hex 0x00, octal \000, binary 00000000

U+0000  <control>
        = NULL


Byte number 4 is decimal 240, hex 0xF0, octal \360, binary 11110000
This is the first byte of a 4 byte sequence.

Byte number 5 is decimal 143, hex 0x8F, octal \217, binary 10001111
This is continuation byte 1, expecting 2 more.

Byte number 6 is decimal 14, hex 0x0E, octal \016, binary 00001110
Previous UTF-8 multibyte sequence incomplete, earlier bytes dropped.

U+000E  <control>
        = SHIFT OUT
        * known as LOCKING-SHIFT ONE in 8-bit environments
----

-- 
Kind regards,
Michael

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@cassandra.apache.org
For additional commands, e-mail: user-h...@cassandra.apache.org

Reply via email to