Re: Bad Request: Type error: cannot assign result of function token (type bigint) to id (type int)

2014-06-05 Thread Jonathan Haddad
Sorry, the datastax docs are actually a bit better: http://www.datastax.com/documentation/cql/3.0/cql/cql_using/paging_c.html Jon On Thu, Jun 5, 2014 at 10:46 PM, Jonathan Haddad wrote: > You should read through the token docs, it has examples and > specifications: http://cassandra.apache.org/

Re: Bad Request: Type error: cannot assign result of function token (type bigint) to id (type int)

2014-06-05 Thread Jonathan Haddad
You should read through the token docs, it has examples and specifications: http://cassandra.apache.org/doc/cql3/CQL.html#tokenFun On Thu, Jun 5, 2014 at 10:22 PM, Kevin Burton wrote: > I'm building a new schema which I need to read externally by paging > through the result set. > > My understa

Bad Request: Type error: cannot assign result of function token (type bigint) to id (type int)

2014-06-05 Thread Kevin Burton
I'm building a new schema which I need to read externally by paging through the result set. My understanding from reading the documentation , and this list, is that I can do that but I need to use the token() function. Only it doesn't work. Here's a reduction: create table test_paging ( id int

Re: CQLSSTableWriter memory leak

2014-06-05 Thread Xu Zhongxing
Is writing too many rows to a single partition the cause of memory consumption? What I want to achieve is this: say I have 5 partition ID. Each corresponds to 50 million IDs. Given a partition ID, I need to get its corresponding 50 million IDs. Is there another way to design the schema to avoi

Sizing a new Cassandra cluster

2014-06-05 Thread Koppel, Jeremy
I have been able to find lots of general information about sizing each node in a new Cassandra cluster, but have not come across any specific recommendations about the total size and configuration of the cluster (the number of nodes required per data center, the number of data centers, throughpu

Re: Consolidating records and TTL

2014-06-05 Thread Charlie Mason
Hi All, Thanks for the replies. These detailed explanations are extremely useful. The are much appreciated! One other question that came to mind with this. Is there any way I can use a TTLs to keep the old account entries around for a fixed length of time rather than deleting them immediately? Th

Re: High latency on 5 node Cassandra Cluster

2014-06-05 Thread Arup Chakrabarti
Yes, I meant RF: 2,2,1. We have the following in our network topo config: # cass08: us-west-1c private.ip=DC-A:1 public.ip=DC-A:1 # cass09: us-west-1b private.ip=DC-A:2 public.ip=DC-A:2 # cass10: Linode-Fremont-CA public.ip=DC-C:1 # cass11: us-west-2c private.ip=DC-B:1 public.ip=DC-B:1

Re: VPC AWS

2014-06-05 Thread Aiman Parvaiz
Cool, thanks again for this. On Thu, Jun 5, 2014 at 11:51 AM, Michael Theroux wrote: > You can have a ring spread across EC2 and the public subnet of a VPC. > That is how we did our migration. In our case, we simply replaced the > existing EC2 node with a new instance in the public VPC, resto

Re: VPC AWS

2014-06-05 Thread Michael Theroux
You can have a ring spread across EC2 and the public subnet of a VPC.  That is how we did our migration.  In our case, we simply replaced the existing EC2 node with a new instance in the public VPC, restored from a backup taken right before the switch. -Mike

Re: VPC AWS

2014-06-05 Thread Aiman Parvaiz
Thanks for this info Michael. As far as restoring node in public VPC is concerned I was thinking ( and I might be wrong here) if we can have a ring spread across EC2 and public subnet of a VPC, this way I can simply decommission nodes in Ec2 as I gradually introduce new nodes in public subnet of VP

Re: VPC AWS

2014-06-05 Thread Michael Theroux
The implementation of moving from EC2 to a VPC was a bit of a juggling act.   Our motivation was two fold: 1) We were running out of static IP addresses, and it was becoming increasingly difficult in EC2 to design around limiting the number of static IP addresses to the number of public IP addr

Re: Cassandra 2.0 unbalanced ring with vnodes after adding new node

2014-06-05 Thread Marcelo Elias Del Valle
Actually, I have the same doubt. The same happens to me, but I guess it's because of lack of knowledge in Cassandra vnodes, somehow... I just added 3 nodes to my old 2 nodes cluster, now I have a 5 nodes cluster. As rows should be in a node calculated by HASH / number of nodes, adding a new node

Re: nodetool move seems slow

2014-06-05 Thread Jason Tyler
Hi Rob, THX for you response and link to the issue. The move did complete after a restart! Cheers, ~Jason *** From: Robert Coli mailto:rc...@eventbrite.com>> Reply-To: "user@cassandra.apache.org" mailto:user@cassandra.apache.org>> Date: Wednesday, June 4,

Re: migration to a new model

2014-06-05 Thread Marcelo Elias Del Valle
Michael, I will try to test it up to tomorrow and I will let you know all the results. Thanks a lot! Best regards, Marcelo. 2014-06-04 22:28 GMT-03:00 Laing, Michael : > BTW you might want to put a LIMIT clause on your SELECT for testing. -ml > > > On Wed, Jun 4, 2014 at 6:04 PM, Laing, Micha

RE: Consolidating records and TTL

2014-06-05 Thread James Campbell
Thanks for creating and opening the discussion on this use case. I have been evaluating Cassandra for a very similar problem, but with the small twist that I'd like to roll up the ledger entries into the aggregated 'account' information on a regular basis to ensure that even rarely-read (but of

Re: CQLSSTableWriter memory leak

2014-06-05 Thread Jack Krupansky
How many rows (primary key values) are you writing for each partition of the primary key? I mean, are there relatively few, or are these very wide partitions? Oh, I see! You’re writing 50,000,000 rows to a single partition! My, that IS ambitious. -- Jack Krupansky From: Xu Zhongxing Sent: Th

Migration 1.2.14 to 2.0.8 causes "Tried to create duplicate hard link" at startup

2014-06-05 Thread Tom van den Berge
Hi, I'm trying to migrate a development cluster from 1.2.14 to 2.0.8. When starting up 2.0.8, I'm seeing the following error in the logs: INFO 17:40:25,405 Snapshotting drillster, Account to pre-sstablemetamigration ERROR 17:40:25,407 Exception encountered during startup java.lang.RuntimeExcept

Re: VPC AWS

2014-06-05 Thread Michael Theroux
We personally use the EC2Snitch, however, we don't have the multi-region requirements you do, -Mike From: Alain RODRIGUEZ To: user@cassandra.apache.org Sent: Thursday, June 5, 2014 9:14 AM Subject: Re: VPC AWS I think you can define VPC subnet to be publi

Re: memtable mem usage off by 10?

2014-06-05 Thread Benedict Elliott Smith
What does /usr/java/latest/bin/java -version print? On 5 June 2014 08:15, Idrén, Johan wrote: > I’m using the datastax rpms, using the bundled launch scripts. > > grep -i jamm * > cassandra-env.sh:# add the jamm javaagent > cassandra-env.sh:JVM_OPTS="$JVM_OPTS > -javaagent:$CASSANDRA_HO

Re: VPC AWS

2014-06-05 Thread Alain RODRIGUEZ
Well if you are talking about the transition I think we are going to do it this way - Existing cluster out of any VPC -> DC1 - New VPC cluster, with subnet private + public -> DC2 - Add / Rebuild DC2 - Switch clients from DC1 to DC2 - Remove DC1 - Switch client inside the VPC - Remove public VPC s

Re: VPC AWS

2014-06-05 Thread David McNelis
A general thought, if you're using AWS for this, I'd strongly recommend you consider using OpsWorks and custom chef recipes for your node deployment if its an option for you. The easy of provisioning new nodes without the need for snapshotting is certainly worth the hassle, and there are already s

Re: VPC AWS

2014-06-05 Thread William Oberman
I was making assumptions (sorry!), namely the goal to switching to vpc was getting "backend resources" (like cassandra) off the public internet. Will On Thursday, June 5, 2014, Alain RODRIGUEZ wrote: > I think you can define VPC subnet to be public (to have public + private > IPs) or private on

Re: VPC AWS

2014-06-05 Thread Alain RODRIGUEZ
I think you can define VPC subnet to be public (to have public + private IPs) or private only. Any insight regarding snitches ? What snitch do you guys use ? 2014-06-05 15:06 GMT+02:00 William Oberman : > I don't think traffic will flow between "classic" ec2 and vpc directly. > There is some ki

Re: VPC AWS

2014-06-05 Thread William Oberman
I don't think traffic will flow between "classic" ec2 and vpc directly. There is some kind of gateway bridge instance that sits between, acting as a NAT. I would think that would cause new challenges for: -transitions -clients Sorry this response isn't heavy on content! I'm curious how this thr

Re: VPC AWS

2014-06-05 Thread Michael Theroux
Hello Alain, We switched from EC2 to VPC a couple of years ago.  The process for us was long, slow and multi step. In our case, we don't need to consider multi-DC.  However, in our infrastructure we were rapidly running out of IP addresses, and wished to move to VPC to give us a nearly inexhau

Re: VPC AWS

2014-06-05 Thread Michael Theroux
Hello Alain, We switched from EC2 to VPC a couple of years ago.  The process for us was long, slow, and multi step for our (at the time) 6 node cluster. In our case, we don't need to consider multi-DC.  However, in our infrastructure we were rapidly running out of IP addresses, and wished to mo

Re: VPC AWS

2014-06-05 Thread Idrén , Johan
Using 2.0.x, we hit https://issues.apache.org/jira/browse/CASSANDRA-2380 when deploying in a VPC. We worked around this by adding a hostname in /etc/hosts. Apart from that, nothing specific. We use only internal ip's, no public ip's and no broadcast etc. It works very well. /Johan From: Alain

VPC AWS

2014-06-05 Thread Alain RODRIGUEZ
Hi guys, We are going to move from a cluster made of simple Amazon EC2 servers to a VPC cluster. We are using Cassandra 1.2.11 and I have some questions regarding this switch and the Cassandra configuration inside a VPC. Actually I found no documentation on this topic, but I am quite sure that so

Re: Consolidating records and TTL

2014-06-05 Thread Aaron Morton
As Tyler says, with atomic batches which are enabled by default the cluster will keep trying to replay the insert / deletes. Nodes check their local batch log for failed batches, ones where the coordinator did not acknowledge it had successfully completed, every 60 seconds. So there is a windo

CQLSSTableWriter memory leak

2014-06-05 Thread Xu Zhongxing
I am using Cassandra's CQLSSTableWriter to import a large amount of data into Cassandra. When I use CQLSSTableWriter to write to a table with compound primary key, the memory consumption keeps growing. The GC of JVM cannot collect any used memory. When writing to tables with no compound primary

Re: memtable mem usage off by 10?

2014-06-05 Thread Idrén , Johan
I’m using the datastax rpms, using the bundled launch scripts. grep -i jamm * cassandra-env.sh:# add the jamm javaagent cassandra-env.sh:JVM_OPTS="$JVM_OPTS -javaagent:$CASSANDRA_HOME/lib/jamm-0.2.5.jar” And it’s part of the commandline used to start cassandra: /usr/java/latest/bin/java -ea