Hello, We are trying to read data from cassandra via pig. The version of cassandra is 1.0.7 and pig is 0.9.0.
We get the following error when we try to load the data from the cassandra keyspace and columnfamily. [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 2997: Unable to recreate exception from backed error: Error: org.apache.cassandra.utils.FBUtilities.hexToBytes(Ljava/lang/String;)[B Following is the pig script: register /var/pig-0.9.0/lib/pygmalion-1.0.0.jar; define FromCassandraBag org.pygmalion.udf.FromCassandraBag(); rows = LOAD 'cassandra://Keyspace/ColumnFamily' USING org.apache.cassandra.hadoop.pig.CassandraStorage() AS (key, columns: bag {T: tuple(columnname, value)}) ; DUMP rows; The Cassandra Columnfamily looks like: Key Validation Class: org.apache.cassandra.db.marshal.UTF8Type Default column value validator: org.apache.cassandra.db.marshal.BytesType Columns sorted by: org.apache.cassandra.db.marshal.UTF8Type Row cache size / save period in seconds / keys to save : 0.0/0/all Row Cache Provider: org.apache.cassandra.cache.ConcurrentLinkedHashCacheProvider Key cache size / save period in seconds: 200000.0/14400 GC grace seconds: 864000 Compaction min/max thresholds: 4/32 Read repair chance: 1.0 Replicate on write: false Bloom Filter FP chance: default Built indexes: [] Compaction Strategy: org.apache.cassandra.db.compaction.SizeTieredCompactionStrategy Has anyone come across this error and have a solution? Any help is appreciated. Thanks