Re: Storing values of mixed types in a list

2014-06-24 Thread Tuukka Mustonen
Unfortunately, I need to query per list items. That's why I'm running Cassandra 2.1rc1 (offers secondary indexes for collections). I'm also studying Dynamo, it seems to be somewhat more dynamic by nature and allows mixed type lists. As I understood it, also Cassandra supports dynamic schemas, but

Re: EC2 cassandra cluster node address problem

2014-06-24 Thread Andrey Ilinykh
you can set rpc_address to 0.0.0.0, then it will listen on all interfaces. Also you have to modify security group settings to allow incoming connection for port 9160. But it is a really bad idea. By this way you open your cluster to whole world, ssh tunnel is the best way. On Tue, Jun 24, 2014 at

Re: Use Cassnadra thrift API with collection type

2014-06-24 Thread Huiliang Zhang
Yes, I realized the way to use CQL. I checked the way how map data is represented by using cassandra-cli. For each element in the map, it use the key as part of the column name and value as the column value. I just cannot insert this by using thrift API because I already defined a CompositeType co

Re: EC2 cassandra cluster node address problem

2014-06-24 Thread Huiliang Zhang
Thanks. Is there a way to configure Cassandra to use elastic ip instead of private ip? On Tue, Jun 24, 2014 at 9:29 PM, Andrey Ilinykh wrote: > Cassandra knows nothing about elastic ip. You have to use ssh tunnel or > run your client on ec2 instance. > > Thank you, > Andrey > > > On Tue, Jun

Re: EC2 cassandra cluster node address problem

2014-06-24 Thread Andrey Ilinykh
Cassandra knows nothing about elastic ip. You have to use ssh tunnel or run your client on ec2 instance. Thank you, Andrey On Tue, Jun 24, 2014 at 8:55 PM, Huiliang Zhang wrote: > Hi, > > I am using Cassandra on EC2 instances. My cassandra always returns private > ips of the instances to the

EC2 cassandra cluster node address problem

2014-06-24 Thread Huiliang Zhang
Hi, I am using Cassandra on EC2 instances. My cassandra always returns private ips of the instances to the thrift program. Then the program cannot connect to the private ips. I already changed the rpc_address: elastic ip rpc_address: elastic ip Then I restarted the cassandra cluster. But the sys

Re: Does the default LIMIT applies to automatic paging?

2014-06-24 Thread DuyHai Doan
Yes. And I advise to set fetchSize to a smaller value than 10 000. 1000 is a good start. As long as there are still results, the iterator will fetch data for you by batch of "fechtSize" On Tue, Jun 24, 2014 at 9:03 PM, ziju feng wrote: > Does that mean the iterator will give me all the data ins

Re: Does the default LIMIT applies to automatic paging?

2014-06-24 Thread ziju feng
Does that mean the iterator will give me all the data instead of 1 rows? On Mon, Jun 23, 2014 at 10:20 PM, DuyHai Doan wrote: > With the Java Driver, set the fetchSize and use ResultSet.iterator > Le 24 juin 2014 01:04, "ziju feng" a écrit : > > Hi All, >> >> I have a wide row table that

Re: Storing values of mixed types in a list

2014-06-24 Thread DuyHai Doan
"Jeremy, with blob field (ByteBuffer), I can query exact matches (just encode the value in query), but greater/less than queries would not work. Any sort of serialization kills "native" ways to query data" --> Not necessarily. You still use "normal" types (uuid, string, timestamp,...) for clusterin

Re: Storing values of mixed types in a list

2014-06-24 Thread Tuukka Mustonen
What if I need to query by list items? 1. Jeremy, with blob field (ByteBuffer), I can query exact matches (just encode the value in query), but greater/less than queries would not work. Any sort of serialization kills "native" ways to query data 2. Even with user defined types, I would need to def

Re: Can I call getBytes on a text column to get the raw (already encoded UTF8)

2014-06-24 Thread Kevin Burton
Yes… I confirmed that getBytesUnsafe works… I also have a unit test for it so if cassandra ever changes anything we'll pick it up. One point in your above code. I still think charsets are behind a synchronized code block. So your above code wouldn't be super fast on multi-core machines. I usua

Consistency level used when applying atomic batches

2014-06-24 Thread John Sumsion
The atomic batches feature is all about moving the multi-statement application to the server side to avoid having to worry about retry logic on the client side. I'm glad that the client doesn't have to worry about it. An earlier thread about consistency level and atomic batches broke the execu

Re: Storing values of mixed types in a list

2014-06-24 Thread Pavel Kogan
1) You can use list of strings which are serialized JSONs, or use ByteBuffer with your own serialization as Jeremy suggested. 2) Use Cassandra 2.1 (not officially released yet) were there is new feature of user defined types. Pavel On Tue, Jun 24, 2014 at 9:18 AM, Jeremy Jongsma wrote: > Use

Re: How to perform Range Queries in Cassandra

2014-06-24 Thread Jeremy Jongsma
You'd be better off using external indexing (ElasticSearch or Solr), Cassandra isn't really designed for this sort of querying. On Jun 24, 2014 3:09 AM, "Mike Carter" wrote: > Hello! > > > I'm a beginner in C* and I'm quite struggling with it. > > I’d like to measure the performance of some Cassa

Re: Storing values of mixed types in a list

2014-06-24 Thread Jeremy Jongsma
Use a ByteBuffer value type with your own serialization (we use protobuf for complex value structures) On Jun 24, 2014 5:30 AM, "Tuukka Mustonen" wrote: > Hello, > > I need to store a list of mixed types in Cassandra. The list may contain > numbers, strings and booleans. So I would need something

Re: Adding large text blob causes read timeout...

2014-06-24 Thread Jonathan Haddad
Can you do you query in the cli after setting "tracing on"? On Mon, Jun 23, 2014 at 11:32 PM, DuyHai Doan wrote: > Yes but adding the extra one ends up by * 1000. The limit in CQL3 > specifies the number of logical rows, not the number of physical columns in > the storage engine > Le 24 juin 20

Re: Does the default LIMIT applies to automatic paging?

2014-06-24 Thread Laing, Michael
And with python use future.has_more_pages and future.start_fetching_next_page(). On Tue, Jun 24, 2014 at 1:20 AM, DuyHai Doan wrote: > With the Java Driver, set the fetchSize and use ResultSet.iterator > Le 24 juin 2014 01:04, "ziju feng" a écrit : > > Hi All, >> >> I have a wide row table th

Storing values of mixed types in a list

2014-06-24 Thread Tuukka Mustonen
Hello, I need to store a list of mixed types in Cassandra. The list may contain numbers, strings and booleans. So I would need something like list. Is this possible in Cassandra and if not, what workaround would you suggest for storing a list of mixed type items? I sketched a few (using a list pe

How to perform Range Queries in Cassandra

2014-06-24 Thread Mike Carter
Hello! I'm a beginner in C* and I'm quite struggling with it. I’d like to measure the performance of some Cassandra-Range-Queries. The idea is to execute multidimensional range-queries on Cassandra. E.g. there is a given table of 1million rows with 10 columns and I like to execute some queries l

Re: Can I call getBytes on a text column to get the raw (already encoded UTF8)

2014-06-24 Thread Robert Stupp
You can use getBytesUnsafe on the UTF8 column -- Sent from my iPhone > Am 24.06.2014 um 09:13 schrieb Olivier Michallat > : > > Assuming we're talking about the DataStax Java driver: > > getBytes will throw an exception, because it validates that the column is of > type BLOB. But you can use

Re: Can I call getBytes on a text column to get the raw (already encoded UTF8)

2014-06-24 Thread Olivier Michallat
Assuming we're talking about the DataStax Java driver: getBytes will throw an exception, because it validates that the column is of type BLOB. But you can use getBytesUnsafe: ByteBuffer b = row.getBytesUnsafe("aTextColumn"); // if you want to check it: Charset.forName("UTF-8").decode(