Thanks, I can reproduce the issue with that, and I should be able to look
into it tomorrow.  FWIW, I believe the issue is server-side, not in the
driver.  I may be able to suggest a workaround once I figure out what's
going on.

On Mon, Nov 17, 2014 at 6:02 AM, Erik Forsberg <forsb...@opera.com> wrote:

> On 2014-11-17 09:56, Erik Forsberg wrote:
> > On 2014-11-15 01:24, Tyler Hobbs wrote:
> >> What version of cassandra did you originally create the column family
> >> in?  Have you made any schema changes to it through cql or
> >> cassandra-cli, or has it always been exactly the same?
> >
> > Oh that's a tough question given that the cluster has been around since
> > 2011. So CF was probably created in Cassandra 0.7 or 0.8 via thrift
> > calls from pycassa, and I don't think there has been any schema changes
> > to it since.
>
> Actually, I don't think it matters. I created a minimal repeatable set
> of python code (see below). Running that against a 2.0.11 server,
> creating fresh keyspace and CF, then insert some data with
> thrift/pycassa, then trying to extract the data that has a different
> validation class, the python-driver and cqlsh bails out.
>
> cqlsh example after running the below script:
>
> cqlsh:badcql> select * from "Users" where column1 = 'default_account_id'
> ALLOW FILTERING;
>
> value "\xf9\x8bu}!\xe9C\xbb\xa7=\xd0\x8a\xff';\xe5" (in col 'value')
> can't be deserialized as text: 'utf8' codec can't decode byte 0xf9 in
> position 0: invalid start byte
>
> cqlsh:badcql> select * from "Users" where column1 = 'date_created' ALLOW
> FILTERING;
>
> value '\x00\x00\x00\x00Ti\xe0\xbe' (in col 'value') can't be
> deserialized as text: 'utf8' codec can't decode bytes in position 6-7:
> unexpected end of data
>
>
> So the question remains - how do I work with this data from cqlsh and /
> or the python driver?
>
> Thanks,
> \EF
>
> --repeatable example--
> #!/usr/bin/env python
>
> # Run this in virtualenv with pycassa and cassandra-driver installed via
> pip
> import pycassa
> import cassandra
> import calendar
> import traceback
> import time
> from uuid import uuid4
>
> keyspace = "badcql"
>
> sysmanager = pycassa.system_manager.SystemManager("localhost")
> sysmanager.create_keyspace(keyspace,
> strategy_options={'replication_factor':'1'})
> sysmanager.create_column_family(keyspace, "Users",
> key_validation_class=pycassa.system_manager.LEXICAL_UUID_TYPE,
>
> comparator_type=pycassa.system_manager.ASCII_TYPE,
>
> default_validation_class=pycassa.system_manager.UTF8_TYPE)
> sysmanager.create_index(keyspace, "Users", "username",
> pycassa.system_manager.UTF8_TYPE)
> sysmanager.create_index(keyspace, "Users", "email",
> pycassa.system_manager.UTF8_TYPE)
> sysmanager.alter_column(keyspace, "Users", "default_account_id",
> pycassa.system_manager.LEXICAL_UUID_TYPE)
> sysmanager.create_index(keyspace, "Users", "active",
> pycassa.system_manager.INT_TYPE)
> sysmanager.alter_column(keyspace, "Users", "date_created",
> pycassa.system_manager.LONG_TYPE)
>
> pool = pycassa.pool.ConnectionPool(keyspace, ['localhost:9160'])
> cf = pycassa.ColumnFamily(pool, "Users")
>
> user_uuid = uuid4()
>
> cf.insert(user_uuid, {'username':'test_username', 'auth_method':'ldap',
> 'email':'t...@example.com', 'active':1,
>
> 'date_created':long(calendar.timegm(time.gmtime())),
> 'default_account_id':uuid4()})
>
> from cassandra.cluster import Cluster
> cassandra_cluster = Cluster(["localhost"])
> cassandra_session = cassandra_cluster.connect(keyspace)
> print "username", cassandra_session.execute('SELECT value from "Users"
> where key = %s and column1 = %s', (user_uuid, 'username',))
> print "email", cassandra_session.execute('SELECT value from "Users"
> where key = %s and column1 = %s', (user_uuid, 'email',))
> try:
>     print "default_account_id", cassandra_session.execute('SELECT value
> from "Users" where key = %s and column1 = %s', (user_uuid,
> 'default_account_id',))
> except Exception as e:
>     print "Exception trying to get default_account_id",
> traceback.format_exc()
>     cassandra_session = cassandra_cluster.connect(keyspace)
>
> try:
>     print "active", cassandra_session.execute('SELECT value from "Users"
> where key = %s and column1 = %s', (user_uuid, 'active',))
> except Exception as e:
>     print "Exception trying to get active", traceback.format_exc()
>     cassandra_session = cassandra_cluster.connect(keyspace)
>
> try:
>     print "date_created", cassandra_session.execute('SELECT value from
> "Users" where key = %s and column1 = %s', (user_uuid, 'date_created',))
> except Exception as e:
>     print "Exception trying to get date_created", traceback.format_exc()
> -- end of example --
>



-- 
Tyler Hobbs
DataStax <http://datastax.com/>

Reply via email to