Re: Large system.Migration CF after upgrade to 1.1

2013-11-26 Thread Aaron Morton
> We have noticed that a cluster we upgraded to 1.1.6 (from 1.0.*) still has a > single large (~4GB) row in system.Migrations on each cluster node. There is some code in there to drop that CF at startup, but I’m not sure on the requirements for it to run. if the time stamps have not been updated

Re: 1.1.11: system keyspace is filling up

2013-11-26 Thread Aaron Morton
> What happens if they are not being successfully delivered ? Will they > eventually TTL-out ? They have a TTL set to the gc_grace_seconds on the CF at the time of the write. I’ve also seen hints build up in multi DC systems due to timeouts on the coordinator. i.e. the remote nodes are up, co-

Re: Intermittent connection error

2013-11-26 Thread Aaron Morton
> The inability to truncate is actually my bigger problem. If I could truncate > tables, then I wouldn't have to create so many sessions, and the frequency of > this error would be at tolerable levels. Can you truncate through cqlsh ? > Running this program occasionally produces the following o

MiscStage Backup

2013-11-26 Thread Chris Burroughs
I'm trying to debug a node that has a backup in MiscStage. Starting a bit under 24 hours ago the number of Pending tasks jumped to a bit under 400 and hovered around there. It looks like repair requests from other nodes (tpstats on this node shows AntiEntropySessions: 0, 0, 0, which I think

Re: Choosing python client lib for Cassandra

2013-11-26 Thread Kumar Ranjan
Thanks Jonathan for the help. On Tue, Nov 26, 2013 at 6:14 PM, Jonathan Haddad wrote: > No, 2.7 only. > > > On Tue, Nov 26, 2013 at 3:04 PM, Kumar Ranjan wrote: > >> Hi Jonathan - Does cqlengine have support for python 2.6 ? >> >> >> On Tue, Nov 26, 2013 at 4:17 PM, Jonathan Haddad wrote: >> >>

Re: Choosing python client lib for Cassandra

2013-11-26 Thread Jonathan Haddad
No, 2.7 only. On Tue, Nov 26, 2013 at 3:04 PM, Kumar Ranjan wrote: > Hi Jonathan - Does cqlengine have support for python 2.6 ? > > > On Tue, Nov 26, 2013 at 4:17 PM, Jonathan Haddad wrote: > >> cqlengine supports batch queries, see the docs here: >> http://cqlengine.readthedocs.org/en/latest/t

Re: Choosing python client lib for Cassandra

2013-11-26 Thread Kumar Ranjan
Hi Jonathan - Does cqlengine have support for python 2.6 ? On Tue, Nov 26, 2013 at 4:17 PM, Jonathan Haddad wrote: > cqlengine supports batch queries, see the docs here: > http://cqlengine.readthedocs.org/en/latest/topics/queryset.html#batch-queries > > > On Tue, Nov 26, 2013 at 11:53 AM, Kumar

Re: Choosing python client lib for Cassandra

2013-11-26 Thread Jonathan Haddad
cqlengine supports batch queries, see the docs here: http://cqlengine.readthedocs.org/en/latest/topics/queryset.html#batch-queries On Tue, Nov 26, 2013 at 11:53 AM, Kumar Ranjan wrote: > Jon - Any comment on batching? > — > Sent from Mailbox for iPhone > > > On

Re: Choosing python client lib for Cassandra

2013-11-26 Thread Robert Coli
On Tue, Nov 26, 2013 at 11:46 AM, Kumar Ranjan wrote: > How do you insert huge amount of data? > http://www.palominodb.com/blog/2012/09/25/bulk-loading-options-cassandra ? =Rob

Re: Choosing python client lib for Cassandra

2013-11-26 Thread Kumar Ranjan
Jon - Any comment on batching? — Sent from Mailbox for iPhone On Tue, Nov 26, 2013 at 2:52 PM, Laing, Michael wrote: > That's not a problem we have faced yet. > On Tue, Nov 26, 2013 at 2:46 PM, Kumar Ranjan wrote: >> How do you insert huge amount of data? >> — >> Sent from Mailbox

Re: Choosing python client lib for Cassandra

2013-11-26 Thread Laing, Michael
That's not a problem we have faced yet. On Tue, Nov 26, 2013 at 2:46 PM, Kumar Ranjan wrote: > How do you insert huge amount of data? > — > Sent from Mailbox for iPhone > > > On Tue, Nov 26, 2013 at 2:31 PM, Laing, Michael > wrote: > >> I think thread pooling

Re: Choosing python client lib for Cassandra

2013-11-26 Thread Kumar Ranjan
How do you insert huge amount of data? — Sent from Mailbox for iPhone On Tue, Nov 26, 2013 at 2:31 PM, Laing, Michael wrote: > I think thread pooling is always in operation - and we haven't seen any > problems in that regard going to the 6 local nodes each client connects to. > We haven't tried

Re: Choosing python client lib for Cassandra

2013-11-26 Thread Kumar Ranjan
Hi Jon - you are right. Its that I understand other ORM like python sqlalchemy or perl DBIX by heart. So i can cql faster than use cqlengine. I will give a shot at python-driver based on Michael's recommendation. — Sent from Mailbox for iPhone On Tue, Nov 26, 2013 at 2:21 PM, Jonathan Haddad w

Re: Choosing python client lib for Cassandra

2013-11-26 Thread Laing, Michael
I think thread pooling is always in operation - and we haven't seen any problems in that regard going to the 6 local nodes each client connects to. We haven't tried batching yet. On Tue, Nov 26, 2013 at 2:05 PM, Kumar Ranjan wrote: > Michael - thanks. Have you tried batching and thread pooling

Re: Choosing python client lib for Cassandra

2013-11-26 Thread Robert Coli
On Tue, Nov 26, 2013 at 9:46 AM, Kumar Ranjan wrote: > I have worked with Pycassa before and wrote a wrapper to use batch > mutation & connection pooling etc. But > http://wiki.apache.org/cassandra/ClientOptions recommends now to use CQL > 3 based api because Thrift based api (Pycassa) will be su

Re: Schema disagreement under normal conditions, ALTER TABLE hangs

2013-11-26 Thread Robert Coli
On Mon, Nov 25, 2013 at 6:42 PM, Josh Dzielak wrote: > Recently we had a strange thing happen. Altering schema (gc_grace_seconds) > for a column family resulted in a schema disagreement. 3/4 of nodes got it, > 1/4 didn't. There was no partition at the time, nor was there multiple > schema updates

Re: Choosing python client lib for Cassandra

2013-11-26 Thread Jonathan Haddad
We're currently using the "cql" package, which is really a wrapper around thrift. To your concern about deadlines, I'm not sure how writing raw CQL is going to be any faster than using a mapper library for anything other than the most trivial of project. On Tue, Nov 26, 2013 at 11:09 AM, Kumar

Re: Data loss when swapping out cluster

2013-11-26 Thread Robert Coli
On Tue, Nov 26, 2013 at 9:48 AM, Christopher J. Bottaro < cjbott...@academicworks.com> wrote: > One thing that I didn't mention, and I think may be the culprit after > doing a lot or mailing list reading, is that when we brought the 4 new > nodes into the cluster, they had themselves listed in the

PHP CQL binary protocol client

2013-11-26 Thread Robert McFrazier
I have just released the initial commit for a PHP CQL binary protocol client. It is not completed yet, I'm currently working on implementing compression, collections, and prepared queries. There is an example of how to connect to a Cassandra server and issue a query, in the examples folder. Blo

Re: Choosing python client lib for Cassandra

2013-11-26 Thread Kumar Ranjan
Jon - Thanks. As I understand, cqlengine is an object mapper and must be using for cql prepare statements. What are you wrapping it with, in alternative to python-driver?  — Sent from Mailbox for iPhone On Tue, Nov 26, 2013 at 1:19 PM, Jonathan Haddad wrote: > So, for cqlengine (https://githu

Re: Choosing python client lib for Cassandra

2013-11-26 Thread Kumar Ranjan
Michael - thanks. Have you tried batching and thread pooling in python-driver? For now, i would avoid object mapper cqlengine, just because of my deadlines. — Sent from Mailbox for iPhone On Tue, Nov 26, 2013 at 1:52 PM, Laing, Michael wrote: > We use the python-driver and have contributed som

Re: Choosing python client lib for Cassandra

2013-11-26 Thread Laing, Michael
We use the python-driver and have contributed some to its development. I have been careful to not push too fast on features until we need them. For example, we have just started using prepared statements - working well BTW. Next we will employ futures and start to exploit the async nature of new

Re: Choosing python client lib for Cassandra

2013-11-26 Thread Jonathan Haddad
So, for cqlengine (https://github.com/cqlengine/cqlengine), we're currently using the thrift api to execute CQL until the native driver is out of beta. I'm a little biased in recommending it, since I'm one of the primary authors. If you've got cqlengine specific questions, head to the mailing lis

Re: nodetool repair seems to increase linearly with number of keyspaces

2013-11-26 Thread Christopher J. Bottaro
We only have a single CF per keyspace. Actually we have 2, but one is tiny (only has 2 rows in it and is queried once a month or less). Yup, using vnodes with 256 tokens. Cassandra 1.2.10. -- C On Mon, Nov 25, 2013 at 2:28 PM, John Pyeatt wrote: > Mr. Bottaro, > > About how many column famil

Re: Data loss when swapping out cluster

2013-11-26 Thread Christopher J. Bottaro
We ran repair -pr on each node after we realized there was data loss and we added the 4 original nodes back in the cluster. I.e. we ran repair on the 8 node cluster that consisted of the 4 old and 4 new nodes, once we realized there was a problem. We are using quorum reads and writes. One thing

Choosing python client lib for Cassandra

2013-11-26 Thread Kumar Ranjan
I have worked with Pycassa before and wrote a wrapper to use batch mutation & connection pooling etc. But http://wiki.apache.org/cassandra/ClientOptions recommends now to use CQL 3 based api because Thrift based api (Pycassa) will be supported for backward compatibility only. Apache site recommends

Re: Data loss when swapping out cluster

2013-11-26 Thread Jeremiah D Jordan
TL;DR you need to run repair in between doing those two things. Full explanation: https://issues.apache.org/jira/browse/CASSANDRA-2434 https://issues.apache.org/jira/browse/CASSANDRA-5901 Thanks, -Jeremiah Jordan On Nov 25, 2013, at 11:00 AM, Christopher J. Bottaro wrote: > Hello, > > We rec

Re: Data loss when swapping out cluster

2013-11-26 Thread Janne Jalkanen
That sounds bad! Did you run repair at any stage? Which CL are you reading with? /Janne On 25 Nov 2013, at 19:00, Christopher J. Bottaro wrote: > Hello, > > We recently experienced (pretty severe) data loss after moving our 4 node > Cassandra cluster from one EC2 availability zone to an

io bound model

2013-11-26 Thread Marcelo Elias Del Valle
Hi everyone, I currently have a column family InputCf in production which has 1 data input per row. Everytime I receive new data from web, I insert a row in this CF. Besides that, I have another CF InputCfIndex in which the year/month/day is my row id (MMdd) and I insert the id of InputCf

What is listEndpointsPendingHints?

2013-11-26 Thread Tom van den Berge
When I run the operation "listEndpointsPendingHints" on the mbean org.apache.cassandra.db:type=HintedHandoffManager, I'm getting ( 126879603237190600081737151857243914981 ) It suggests that there are pending hints, but the org.apache.cassandra.internal:type=HintedHandoff mbean provides these figu

Unable to load dependent classes of a trigger

2013-11-26 Thread J Ramesh Kumar
Hi, I wrote a trigger and it will call internally some other classes. I added all the dependent classes into a jar and put it into the "*conf/tiggers*". But cassandra does not load the dependent classes which available in the jar. How can Solve this issue ? Thanks, Ramesh Traces which found in t

WRITETIME() sometimes returns 0

2013-11-26 Thread Robert Wille
I've got a Cassandra 2.0.2 server with a single node. I've written a test harness that populates the database, looks at the write times for each column, runs the test cases, and then checks to see what columns have been updated. However, when I query WRITETIME() (prior to running the test cases), o