Nodes Flapping in the RIng

2011-11-10 Thread Jake Maizel
56 0.3 60 12 32 periodic 1 864000 true Any thoughts on this would be really interesting. -- Jake Maizel Head of Network Operations Soundcloud Mail & GTalk: j...@soundcloud.com Skype: jakecloud Rosenthaler strasse 13, 101 19, Berlin, DE

Re: Upgrading to 1.0

2011-11-05 Thread Jake Maizel
2011 at 2:57 PM, Jonathan Ellis wrote: > 1.0 can read 0.6 data files but is not network-compatible, so you need > to do an all-at-once upgrade. Additionally, the Thrift api changed > started with 0.7; see NEWS.txt for details. > > On Wed, Nov 2, 2011 at 6:46 AM, Jake Maizel wrot

Large Increase in SSTable count after Upgrade to 0.6.13

2011-11-04 Thread Jake Maizel
3 GCInspector.java:168 SoundCloud.IncomingTracks 95470,1604563 0/0 20/20 We have tried to run manual compactions but these don't seem to happen every, like do to the high pending count. I am wondering what the best way to figure out what is blocking on these nodes, in order to get compaction back in that game. I have considered isolating one node via the network to see if it can catch up once there is no load on it. Not sure of the negative side effects of that. Any suggestions on resolving this? Regards, Jake -- Jake Maizel Head of Network Operations Soundcloud Mail & GTalk: j...@soundcloud.com Skype: jakecloud Rosenthaler strasse 13, 101 19, Berlin, DE

Upgrading to 1.0

2011-11-02 Thread Jake Maizel
Hello, We run a medium sized cluster of 12 nodes on 0.6.13 and would like to move to 1.0. What's the best practices for this? Can we do a rolling upgrade or does the entire cluster need to be upgraded at once? Regards, Jake -- Jake Maizel Head of Network Operations Soundcloud Mail &a

Streaming stuck on one node during Repair

2011-09-02 Thread Jake Maizel
needing it. Thoughts? (0.6.6 on a 7 nodes cluster) -- Jake Maizel Head of Network Operations Soundcloud Mail & GTalk: j...@soundcloud.com Skype: jakecloud Rosenthaler strasse 13, 101 19, Berlin, DE

Repairing lost data

2011-08-27 Thread Jake Maizel
and Filter files are still the originals. Should I keep these or do anything else with them? My alternative is to delete all the data and run repair again which I have done in the past and it works but takes a while with a large data set. I am open to ideas and any suggestions are welcome. --

Upgrade to a different version?

2011-03-16 Thread Jake Maizel
, jake -- Jake Maizel Soundcloud Mail & GTalk: j...@soundcloud.com Skype: jakecloud Rosenthaler strasse 13, 101 19, Berlin, DE

Help with Error on reading sstable

2011-02-22 Thread Jake Maizel
r caused a huge amount of disk to be used and therefore ran out. Currently, is there a way to clear space in this situation? Would running a clean up help? Running ver 0.6.6. Thanks, -- Jake Maizel

Re: Ran out of space during cleanup.. HELP

2010-12-08 Thread Jake Maizel
tually leads me to an important question. Should I always make sure > that Cassandra doesn't get past 50% of the drives free space, otherwise an > anticompaction like this can just destroy the machine? > > On 12/8/10 1:12 AM, Jake Maizel wrote: >> >> Also, look for any sna

Re: How to Tell if Decommission has Completed

2010-12-08 Thread Jake Maizel
d have completed. > > On Wed, Dec 8, 2010 at 3:10 AM, Jake Maizel wrote: >> >> Hello, >> >> Is there a definitive way to tell if a Decommission operation has >> completed, such as a log message similar to what happens with a Drain >> command? >> &g

Re: Ran out of space during cleanup.. HELP

2010-12-08 Thread Jake Maizel
isk space to flush >>      at >> > >> On 12/7/10 8:44 PM, Mark wrote: >> > 3 Node cluster and I just ran a nodetool cleanup on node #3. 1 and 2 >> > are now at 100% disk space. What should I do? >> >> > > > Is there files with -tmp n their names

How to Tell if Decommission has Completed

2010-12-08 Thread Jake Maizel
Hello, Is there a definitive way to tell if a Decommission operation has completed, such as a log message similar to what happens with a Drain command? Thanks. -- Jake Maizel Network Operations Soundcloud Mail & GTalk: j...@soundcloud.com Skype: jakecloud Rosenthaler strasse 13, 10

Re: Best Practice for Data Center Migration

2010-12-03 Thread Jake Maizel
e also been considering increasing the replication factor to 4 >> and then just shutting down all the old nodes.  Would that work as far >> as data availability would go? > > Not sure what you are thinking of there, but probably not. :) > > -- > Jonathan Ellis > Project C

Best Practice for Data Center Migration

2010-12-02 Thread Jake Maizel
work as far as data availability would go? Any other suggestions? Thanks. -- Jake Maizel Network Operations Soundcloud Mail & GTalk: j...@soundcloud.com Skype: jakecloud Rosenthaler strasse 13, 101 19, Berlin, DE

Disk Full Error on Cleanup

2010-11-26 Thread Jake Maizel
that doesn't have enough space for the anit-compaction. This keep nodetool cleanup from completing since this occurs at the beginning of the run. I also tired running compact on this node and then cleanup but the same error results. Any ideas or pointers? -- Jake Maizel Network Opera

Questions about RackAwareStrategy and Multiple Data Centers

2010-11-19 Thread Jake Maizel
ntally did this and am curious of the effect. 3. Is large GC activity normal during a repair outside the documented "GC Storm" cases? By the way, really great work on cassandra from an operations POV. I've enjoyed working with it. Regards and thanks for any help. Jake -- Jake Maizel