Re: Data corruption, invalid UTF-8 bytes

2018-01-03 Thread Stefano Ortolani
Little update. I've managed to compute the token, and I can indeed SELECT the row from CQLSH. Interestingly enough, if I use CQLSH I do not get the exception (even if the string is printed out). I am now wondering whether, instead of a data corruption, the error is related to the reading path use

Data corruption, invalid UTF-8 bytes

2018-01-02 Thread Stefano Ortolani
Hi all, apparently the year started with a node (version 3.0.15) exhibiting some data corruption (discovered by a spark job enumerating all keys). The exception is attached below. The invalid string is a partition key, and it is supposed to be a file name. If I manually decode the bytes I get so