Cassandra Storage Sizing

2011-07-20 Thread Todd Burruss
I put together a blog post on Cassandra Storage Sizing so I don't need to keep figuring it out again and again. Hope everyone finds it useful, and give feedback if you find errors. http://btoddb-cass-storage.blogspot.com/ ... enjoy

selective replication

2011-09-14 Thread Todd Burruss
Has anyone done any work on what I'll call "selective replication" between DCs? I want to use Cassandra to replicate data to another virtual DC (for analytical purposes), but only "inserts", not "deletes". Picture having two data centers, DC1 for OLTP of short lived data (say 90 day window) an

Re: CMS GC initial-mark taking 6 seconds , bad?

2011-10-20 Thread Todd Burruss
Are you using cassandra's caching? If you are then you will need to play around with the RAM setting to find a sweet spot. A low hit rate on the cache (which is counter productive anyway) will cause more GC. A high hit rate, less GC. If you are not caching, no need to use a large heap as the OS

Re: Cassandra cluster HW spec (commit log directory vs data file directory)

2011-10-25 Thread Todd Burruss
This may help determining your data storage requirements ... http://btoddb-cass-storage.blogspot.com/ On 10/25/11 11:22 AM, "Mohit Anchlia" wrote: >On Tue, Oct 25, 2011 at 11:18 AM, Dan Hendry >wrote: >>> 2. ... So I am going to use rotational disk for the commit log and an >>>SSD >>> for da

Re: Second Cassandra users survey

2011-11-03 Thread Todd Burruss
- Better performance when access random columns in a wide row - caching subsets of wide rows - possibly on the same boundaries as the index - some sort of notification architecture when data is inserted. This could be co-processors, triggers, plugins, etc - auto load balance when adding new nodes

Re: Second Cassandra users survey

2011-11-08 Thread Todd Burruss
I believe I heard someone talk at Cassandra SF conference about creating a partitioner that was a derivation of RandomPartitioner. It essentially would look for keys that adhere to a certain pattern, like :. The portion would be used for determining the location on the ring, but : for actually s

Re: Second Cassandra users survey

2011-11-08 Thread Todd Burruss
A use case that could use this (but isn't in my top requests) is usage history for a given user. I use a single row to save history per user, each column is a user action with name a TimeUUID and value is a blob. I use the TimeUUID to sort the actions, but I don't really care about exact time. a

Re: Second Cassandra users survey

2011-11-09 Thread Todd Burruss
Entity Groups : https://issues.apache.org/jira/browse/CASSANDRA-1684 -Jake On Wed, Nov 9, 2011 at 6:44 AM, Todd Burruss mailto:bburr...@expedia.com>> wrote: I believe I heard someone talk at Cassandra SF conference about creating a partitioner that was a derivation of RandomPartitioner.

Row Groups

2011-11-09 Thread Todd Burruss
ob/solandra/src/lucandra/dht/RandomPartitioner.java But Row Groups is going to be the "official" way. -Jake On Wed, Nov 9, 2011 at 5:53 PM, Todd Burruss mailto:bburr...@expedia.com>> wrote: Thx jake for the JIRA, but there was someone at the conference that had already implemented

Re: Network traffic patterns

2011-11-16 Thread Todd Burruss
Are all of your machines equal hardware? Since those machines are sending data somewhere, maybe they are behind in replicating and are continuously catching up? Use a tool like tcpdump to find out where the data is going From: Philippe mailto:watche...@gmail.com>> Reply-To: "user@cassandra.apa

ParNew and caching

2011-11-17 Thread Todd Burruss
I'm using cassandra 1.0. Been doing some testing on using cass's cache. When I turn it on (using the CLI) I see ParNew jump from 3-4ms to 200-300ms. This really screws with response times, which jump from ~25-30ms to 1300+ms. I've increase new gen and that helps, but still this is suprising

Re: ParNew and caching

2011-11-18 Thread Todd Burruss
on my use case is that I am picking 50 columns from the 70k. Since the whole row is in the cache, and no copying from off-heap nor disk buffers, seems like it should be faster than non-cache mode. More thoughts :) On 11/18/11 6:39 AM, "Sylvain Lebresne" wrote: >On Fri, Nov 18, 201

Re: ParNew and caching

2011-11-18 Thread Todd Burruss
Concurrent provider : 500ms 100% cache hit rate. ParNew is the only stat that I see out of line, so seems like still a lot of copying On 11/18/11 2:40 PM, "Mohit Anchlia" wrote: >On Fri, Nov 18, 2011 at 1:46 PM, Todd Burruss >wrote: >> Ok, I figured something li

deleting rows and tombstones

2012-02-14 Thread Todd Burruss
my design calls for deleting a row (by key, not individual columns) and re-inserting it a lot and I'm concerned about tombstone build up slowing down reads. I know if I delete a lot of individual columns the tombstones will build up and slow down reads until they are cleaned up, but not sure if

Re: deleting rows and tombstones

2012-02-14 Thread Todd Burruss
It that's not, then you will get tombstones that might slow down the reads because they have to be skipped until the next compaction. 2012/2/14 Todd Burruss mailto:bburr...@expedia.com>> my design calls for deleting a row (by key, not individual columns) and re-inserting it a lot and I&

Re: deleting rows and tombstones

2012-02-14 Thread Todd Burruss
y. I proposed an alternative approach here https://issues.apache.org/jira/browse/CASSANDRA-3620 so vote it up if you share problems! Dominic On 14 February 2012 19:54, Todd Burruss mailto:bburr...@expedia.com>> wrote: my design calls for deleting a row (by key, not individual columns) and re-inser

Re: deleting rows and tombstones

2012-02-14 Thread Todd Burruss
ce badly. I'd like to see the end of the whole idea of GCGradeSeconds. On 14 February 2012 21:29, Todd Burruss mailto:bburr...@expedia.com>> wrote: do you find that repair is still as necessary now since hinted handoffs are stored anytime a node does not ACK successfully? From: Domini

Re: timed-out retrieving a giant row.

2012-02-14 Thread Todd Burruss
lookup ColumnSliceIterator in hector. this does what you want a lot easier than managing the "paging" From: Yuhan Zhang mailto:yzh...@onescreen.com>> Reply-To: "user@cassandra.apache.org" mailto:user@cassandra.apache.org>> Date: Tue, 14 Feb 2012 14:46:08 -0800

Re: Key cache hit rate issue

2012-02-16 Thread Todd Burruss
there is a setting in the yaml file that helps relieve memory pressure by reducing the row cache. it is based on the percent of memory used by the JVM the setting are, reduce_cache_sizes_at and reduce_cache_capacity_to. see how much free memory you have and if the numbers suggest that you have

Re: Key cache hit rate issue

2012-02-16 Thread Todd Burruss
jonathan, you said the key to the cache is key + sstable? looking at the code it looks like a DecoratedKey is the "row key". how does sstable come into play? On 2/16/12 1:20 PM, "Jonathan Ellis" wrote: >So, you have roughly 1/6 of your (physical) row keys cached and about >1/4 cache hit rate,

Re: Key cache hit rate issue

2012-02-17 Thread Todd Burruss
cachedPosition = getCachedPosition(unifiedKey, true); > >On Thu, Feb 16, 2012 at 4:23 PM, Todd Burruss >wrote: >> jonathan, you said the key to the cache is key + sstable? looking at >>the >> code it looks like a DecoratedKey is the "row key". how does sstabl

Re: Wide Row Performance & Index Question

2012-02-20 Thread Todd Burruss
I believe you will see a slight "unbalance" regardless of your RF with very wide rows, if they are of varying sizes. one node may get a very wide row and another node may get a not so wide row. it's all based on the key. From: aaron morton mailto:aa...@thelastpickle.com>> Reply-To: "user@cassa

Re: hinted handoff 16 s delay

2012-02-23 Thread Todd Burruss
if I remember correctly, cassandra has a random delay in it so hint deliver is staggered and does not overwhelm the just restarted node. On 2/23/12 1:46 PM, "Hontvári József Levente" wrote: >I have played with a test cluster, stopping cassandra on one node and >updating a row on another. I notic

batch isolation

2012-06-02 Thread Todd Burruss
reading the 1.1 "what's new" here, http://www.datastax.com/docs/1.1/getting_started/new_features, I'm wondering about row level isolation. two questions: 1 – does this mean that a batch_mutate that first sends a "row delete" mutation on key X, then subsequent insert mutations for key X is isol

Re: batch isolation

2012-06-03 Thread Todd Burruss
Re: batch isolation On Sun, Jun 3, 2012 at 2:53 AM, Todd Burruss wrote:> > 1 – does this mean that a batch_mutate that first sends a "row delete" > mutation on key X, then subsequent insert mutations for key X is isolated? I'm not sure what you mean by having "a

Re: batch isolation

2012-06-04 Thread Todd Burruss
I don't think I'm being clear. I just was wondering if a "row delete" is isolated with all the other inserts or deletes to a specific column family and key in the same batch. On 6/4/12 1:58 AM, "Sylvain Lebresne" wrote: >On Sun, Jun 3, 2012 at 6:05 PM, Tod

Re: repair cause large number of SSTABLEs

2011-01-27 Thread Todd Burruss
/CASSANDRA-1959 Aaron On 27 Jan 2011, at 19:22, B. Todd Burruss wrote: > i ran out of file handles on the "repairing node" after doing nodetool repair > - strange as i have never had this issue until using 0.7.0 (but i should say > that i have not truly tested 0.7.0 until now.)

RE: repair cause large number of SSTABLEs

2011-01-27 Thread Todd Burruss
ndra.apache.org Subject: Re: repair cause large number of SSTABLEs Maybe related to https://issues.apache.org/jira/browse/CASSANDRA-1992 ? On Jan 27, 2011, at Thu Jan 27, 1:22 AM, B. Todd Burruss wrote: i ran out of file handles on the "repairing node" after doing nodetool repair - strange

Re: Cassandra training on May 21 in Palo Alto

2010-05-07 Thread Todd Burruss
+1 -Original Message- From: S Ahmed [sahmed1...@gmail.com] Received: 5/7/10 7:09 AM To: user@cassandra.apache.org [u...@cassandra.apache.org] Subject: Re: Cassandra training on May 21 in Palo Alto It would be great if you could make a video of this event. Yes it won't like being there

RE: Cassandra vs. Voldemort benchmark

2010-05-07 Thread Todd Burruss
i did a lot of comparisons between voldemort and cassandra and in the end i decided to go with cassandra. the main reason was recovery and balancing operations. on the surface voldemort is s*** hot fast, until you need to restore a node or add nodes. BDB (the default persistence solution) isn

RE: 0.7, 0.8 roadmaps

2010-07-06 Thread Todd Burruss
Along this line, is it known if upgrading from 0.6 to 0.7 will require cluster downtime? -Original Message- From: tsuraan [tsur...@gmail.com] Received: 7/6/10 9:16 AM To: user@cassandra.apache.org [u...@cassandra.apache.org] Subject: 0.7, 0.8 roadmaps Is there a document anywhere with

RE: Question about CL.ZERO

2010-07-12 Thread Todd Burruss
tched to the replicas and success immediately returned to the client. Wouldn't be hard to add. On Mon, Jul 12, 2010 at 10:51 AM, B. Todd Burruss wrote: > why is there no good reason? if i would like to record informational > events, possibly for debugging or something, i don'

GCGraceSeconds per ColumnFamily/Keyspace

2010-07-12 Thread Todd Burruss
I have two CFs in my keyspace. one i care about allowing a good amount of time for tombstones to propagate (GCGraceSeconds large) ... but the other i couldn't care and in fact i want them gone ASAP so i don't iterate over them. has any thought been given to making this setting per Keyspace or

Re: GCGraceSeconds per ColumnFamily/Keyspace

2010-07-12 Thread Todd Burruss
. On Mon, Jul 12, 2010 at 6:33 PM, Todd Burruss wrote: > I have two CFs in my keyspace. one i care about allowing a good amount of > time for tombstones to propagate (GCGraceSeconds large) ... but the other i > couldn't care and in fact i want them gone ASAP so i don't iterate

Re: GCGraceSeconds per ColumnFamily/Keyspace

2010-07-13 Thread Todd Burruss
Yes -Original Message- From: Jonathan Ellis [jbel...@gmail.com] Received: 7/12/10 9:15 PM To: user@cassandra.apache.org [u...@cassandra.apache.org] Subject: Re: GCGraceSeconds per ColumnFamily/Keyspace Probably. Can you open a ticket? On Mon, Jul 12, 2010 at 10:41 PM, Todd Burruss

Re: RowMutationVerbHandler.java (line 78) Error in row mutation

2010-08-28 Thread Todd Burruss
at 3:58 PM, B. Todd Burruss wrote: > i got the latest code this morning. i'm testing with 0.7 > > > ERROR [ROW-MUTATION-STAGE:388] 2010-08-27 15:54:58,053 > RowMutationVerbHandler.java (line 78) Error in row mutation > org.apache.cassandra.db.UnserializableColumnFamilyE

node repair

2010-03-20 Thread Todd Burruss
i had a node fail, lost all data. so i brought it back up fresh, but assigned it the same token in storage-conf.xml. then ran nodetool repair. all compactions have finished, no streams are happening. nothing. so i did it again. same thing. i don't think its working. is there a log message

RE: node repair

2010-03-20 Thread Todd Burruss
erent ip but the same token, it will confuse things. http://wiki.apache.org/cassandra/Operations "handling failure" section covers best practices here. On Sat, Mar 20, 2010 at 11:51 AM, Todd Burruss wrote: > i had a node fail, lost all data. so i brought it back up fresh, but >

RE: node repair

2010-03-20 Thread Todd Burruss
fyi ... i just compacted and node 105 is definitely not being repaired From: Todd Burruss Sent: Saturday, March 20, 2010 12:34 PM To: user@cassandra.apache.org Subject: RE: node repair same IP, same token. i'm trying Handling Failure, #3. it is ru

RE: node repair

2010-03-21 Thread Todd Burruss
random partitioner and assigned a token to each node. From: Todd Burruss Sent: Saturday, March 20, 2010 6:48 PM To: Todd Burruss; user@cassandra.apache.org Subject: RE: node repair fyi ... i just compacted and node 105 is definitely not being repaired

RE: node repair

2010-03-21 Thread Todd Burruss
debug logs from the run, would you mind opening a JIRA describing the problem? -Original Message- From: "Todd Burruss" Sent: Sunday, March 21, 2010 1:30pm To: "Todd Burruss" , "user@cassandra.apache.org" Subject: RE: node repair one last comment about thest

RE: node repair

2010-03-22 Thread Todd Burruss
is logged about the compactions and repair when you are using DEBUG. Do you think you might have restarted the node being repaired during the streaming process? I'm not sure we have good handling for that case. Thanks, Stu -Original Message- From: "Todd Burruss" Sent: Sund

memory question

2010-03-22 Thread Todd Burruss
after running my cluster for a while performance has become unacceptable, 200+ ms for reads. if running well, i see reads <10ms. when i run iostat the disk is being hammered by reads. seems like i/o caching isn't even being used avg-cpu: %user %nice %system %iowait %steal %idle

RE: node repair

2010-03-22 Thread Todd Burruss
ndra.apache.org Subject: Re: node repair On Mon, Mar 22, 2010 at 11:53 AM, Todd Burruss wrote: > it's very possible if i thought it wasn't working. is there a delay between > compation and streaming? yes, it can be a significant one if you have a lot of data. you can look at th

RE: memory question

2010-03-26 Thread Todd Burruss
need to tell Cassandra to cache more rows (or keys, depending). If you are maxing out your caches and still seeing this then you just need to add more capacity, there's no magic wand. On Mon, Mar 22, 2010 at 5:14 PM, Todd Burruss wrote: > after running my cluster for a while perform

Re: Will cassandra block client ?

2010-04-23 Thread Todd Burruss
Ran, Under very heavy load using more than 50 threads with 20k payload size, I have seen Hector close connections then reopen so such that time_wait builds up and can no longer connect. -Original Message- From: Ran Tavory [ran...@gmail.com] Received: 4/22/10 1:29 AM To: user@cassandra.

Re: Remove node from cluster and have it run as a single node cluster by itself

2012-10-05 Thread B. Todd Burruss
i believe the system keyspace keeps track of the cluster topology. even though you changed info in yaml, the system keyspace still knows about the other nodes. remove the system keyspace files from data dir and try again On Fri, Oct 5, 2012 at 4:47 AM, Fredrik wrote: > I guess that the other nod

Re: cassandra 1.2 beta in production

2012-10-10 Thread B. Todd Burruss
https://issues.apache.org/jira/browse/CASSANDRA/fixforversion/12323284 On Wed, Oct 10, 2012 at 1:41 AM, Alexey Zotov wrote: > Hi Guys, > > What known critical bugs are there that couldn't allow to use 1.2 beta 1 in > production? > We don't use cql and secondary indexes. > > > -- > > Best regards

Re: unbalanced ring

2012-10-10 Thread B. Todd Burruss
major compaction in production is fine, however it is a heavy operation on the node and will take I/O and some CPU. the only time i have seen this happen is when i have changed the tokens in the ring, like "nodetool movetoken". cassandra does not auto-delete data that it doesn't use anymore just

Re: Upgrading hardware on a node in a cluster

2012-10-10 Thread B. Todd Burruss
if you have N nodes in your cluster, add N new nodes using the new hardware, then decommision the old N nodes. (and migrate to VPC like dean said) On Wed, Oct 10, 2012 at 5:23 AM, Hiller, Dean wrote: > Well, you could use amazon VPC in which case you DO pick the IP yourself > ;)….it makes life

Re: unbalanced ring

2012-10-10 Thread B. Todd Burruss
+972 54 8356490 > Fax: +972 2 5612956 > > > > > > On Wed, Oct 10, 2012 at 6:12 PM, B. Todd Burruss wrote: > >> major compaction in production is fine, however it is a heavy operation >> on the node and will take I/O and some CPU. >> >> the only time i

Re: 1.1.1 is "repair" still needed ?

2012-10-11 Thread B. Todd Burruss
as of 1.0 (CASSANDRA-2034) hints are generated for nodes that timeout. On Thu, Oct 11, 2012 at 3:55 AM, Watanabe Maki wrote: > Even if HH works fine, HH will not be created until the failure detector > marks the node is dead. > HH will not be created for partially timeouted mutation request ( b

Re: Cassandra nodes loaded unequally

2012-10-12 Thread B. Todd Burruss
are you connecting to the same node every time? if so, spread out your connections across the ring On Fri, Oct 12, 2012 at 1:22 AM, Alexey Zotov wrote: > Hi Ben, > > I suggest you to compare amount of queries for each node. May be the problem > is on the client side. > Yoy can do that using JMX:

Re: what is more important (RAM vs Cores)

2012-10-12 Thread B. Todd Burruss
i would not worry as much about the single machine specs. find the sweet spot on price for CPU and RAM and by that, then scale horizontal to meet your demand. but .. if i was pressed for a general statement -choose RAM over CPU On Fri, Oct 12, 2012 at 4:34 AM, Serge Fonville wrote: > It seems y

Re: read performance plumetted

2012-10-12 Thread B. Todd Burruss
did the amount of data finally exceed your per machine RAM capacity? is it the same 20% each time you read? or do your periodic reads eventually work through the entire dataset? if you are essentially table scanning your data set, and the size exceeds available RAM, then a degradation like that i

Re: Option for ordering columns by timestamp in CF

2012-10-12 Thread B. Todd Burruss
trying to think of a use case where you would want to order by timestamp, and also have unique column names for direct access. not really trying to challenge the use case, but you can get ordering by timestamp and still maintain a "name" for the column using composites. if the first component of t

Re: Issue removing rows

2012-10-13 Thread B. Todd Burruss
i have used StorageProxy and was forgetting to rewind (or otherwise setup my ByteBuffer properly) and was getting, i believe, the same error. check your ByteBuffers On Sat, Oct 13, 2012 at 8:49 AM, Nick Morizio wrote: > I'm wondering if anyone has seen this issue before: > > We are running Cassa

tombstones and their data

2012-10-22 Thread B. Todd Burruss
if a node, X, has a tombstone marking deleted data, when can node X remove the data - not the tombstone, but the data? i understand the tombstone cannot be removed until GCGraceSeconds has passed, but it seems the data could be compacted away at any time.

Re: tombstones and their data

2012-10-22 Thread B. Todd Burruss
u get >> confirmation. >> >> Dean >> >> On 10/22/12 10:43 AM, "B. Todd Burruss" wrote: >> >>>if a node, X, has a tombstone marking deleted data, when can node X >>>remove the data - not the tombstone, but the data? i understand the >

nodetool cleanup

2012-10-22 Thread B. Todd Burruss
does "nodetool cleanup" perform a major compaction in the process of removing unwanted data? i seem to remember this to be the case, but can't find anything definitive

Re: nodetool cleanup

2012-10-23 Thread B. Todd Burruss
It is typically used after the token > assignments have been changed. > > Cheers > > - > Aaron Morton > Freelance Developer > @aaronmorton > http://www.thelastpickle.com > > On 23/10/2012, at 6:42 PM, Will @ SOHO wrote: > > On 10/23/2012 01:

Re: constant CMS GC using CPU time

2012-10-23 Thread B. Todd Burruss
Regarding memory usage after a repair ... Are the merkle trees kept around? On Oct 23, 2012 3:00 PM, "Bryan Talbot" wrote: > On Mon, Oct 22, 2012 at 6:05 PM, aaron morton wrote: > >> The GC was on-going even when the nodes were not compacting or running a >> heavy application load -- even when th

Re: High bandwidth usage between datacenters for cluster

2012-11-01 Thread B. Todd Burruss
bryce, did you resolve this? i'm interested in the outcome. when you write does it help to use CL = LOCAL_QUORUM? On Mon, Oct 29, 2012 at 12:52 AM, aaron morton wrote: > Outbound messages for other DC's are grouped and a single instance is sent > to a single node in the remote DC. The remote no

leveled compaction and tombstoned data

2012-11-08 Thread B. Todd Burruss
we are having the problem where we have huge SSTABLEs with tombstoned data in them that is not being compacted soon enough (because size tiered compaction requires, by default, 4 like sized SSTABLEs). this is using more disk space than we anticipated. we are very write heavy compared to reads, an

Re: leveled compaction and tombstoned data

2012-11-08 Thread B. Todd Burruss
we are running Datastax enterprise and cannot patch it. how bad is "kill performance"? if it is so bad, why is it an option? On Thu, Nov 8, 2012 at 10:17 AM, Radim Kolar wrote: > Dne 8.11.2012 19:12, B. Todd Burruss napsal(a): > >> my question is would leveled compact

Re: leveled compaction and tombstoned data

2012-11-08 Thread B. Todd Burruss
thanks for the links! i had forgotten about live sampling On Thu, Nov 8, 2012 at 11:41 AM, Brandon Williams wrote: > On Thu, Nov 8, 2012 at 1:33 PM, Aaron Turner wrote: >> There are also ways to bring up a test node and just run Level Compaction on >> that. Wish I had a URL handy, but hopefull

Re: leveled compaction and tombstoned data

2012-11-08 Thread B. Todd Burruss
, Nov 8, 2012 at 11:53 AM, B. Todd Burruss wrote: > thanks for the links! i had forgotten about live sampling > > On Thu, Nov 8, 2012 at 11:41 AM, Brandon Williams wrote: >> On Thu, Nov 8, 2012 at 1:33 PM, Aaron Turner wrote: >>> There are also ways to bring up a test

Re: Replication factor and performance questions

2012-11-10 Thread B. Todd Burruss
@oleg, to answer your last question a cassandra node should never ask another node for information it doesn't have. it uses the key and the partitioner to determine where the data is located before ever contacting another node. On Mon, Nov 5, 2012 at 9:45 AM, Andrey Ilinykh wrote: > You will hav

Re: Multiple Clusters Keyspacse to one core cluster

2012-11-11 Thread B. Todd Burruss
with NetworkTopologyStrategy it theoretically should work http://www.datastax.com/docs/1.0/cluster_architecture/replication On Thu, Nov 8, 2012 at 5:11 PM, ws wrote: > If I have multiple clusters can I replicate a keyspace from each of those > cluster to separate cluster? > >

Re: removing SSTABLEs

2012-11-12 Thread B. Todd Burruss
tion. >> >> >> On Mon, Nov 12, 2012 at 12:09 PM, B. Todd Burruss wrote: >>> >>> if i stop a node and remove an SSTABLE, let's call it X, is that safe? >>> >>> ok, more info. i know that the data in SSTABLE X has been tombstoned >>>

CQL timestamps and timezones

2012-12-07 Thread B. Todd Burruss
trying to figure out if i'm doing something wrong or a bug. i am creating a simple schema, inserting a timestamp using ISO8601 format, but when retrieving the timestamp, the timezone is displayed incorrectly. i'm inserting using GMT, the result is shown with "+", but the time is for my local

Re: Query regarding SSTable timestamps and counts

2012-12-10 Thread B. Todd Burruss
my two cents ... i know this thread is a bit old, but the fact that odd-sized SSTABLEs (usually large ones) will hang around for a while can be very troublesome on disk space and planning. our data is temporal in cassandra, being deleted constantly. we have seen space usage in the 1+ TB range whe

Re: entire range of node out of sync -- out of the blue

2012-12-18 Thread B. Todd Burruss
in your data directory, for each keyspace there is a solr.json. cassandra stores the SSTABLEs it knows about when using leveled compaction. take a look at that file and see if it looks accurate. if not, this is a bug with cassandra that we are checking into as well On Thu, Dec 6, 2012 at 7:38

Re: Does a scrub remove deleted/expired columns?

2012-12-19 Thread B. Todd Burruss
i will add that we have had a good experience with leveled compaction cleaning out tombstoned data faster than size tiered, therefore keeping our total disk usage much more reasonable than size tiered. it is at the cost of I/O ... maybe 2X the I/O?? but that is not bothering us. what is bothering

Re: Moving data from one datacenter to another

2012-12-19 Thread B. Todd Burruss
to get it "correct", meaning consistent, it seems you will need to do a repair no matter what since the source cluster is taking writes during this time and writing to commit log. so to avoid filename issues just do the first copy and then repair. i am not sure if they can have any filename. to

Re: entire range of node out of sync -- out of the blue

2012-12-19 Thread B. Todd Burruss
che.org/jira/browse/CASSANDRA-5041 > TBH i think this was a repair without -pr > > thanks, > Andras > > Andras Szerdahelyi* > *Solutions Architect, IgnitionOne | 1831 Diegem E.Mommaertslaan 20A > M: +32 493 05 50 88 | Skype: sandrew84 > > > > > > On 18 Dec 2

Re: TTL on SecondaryIndex Columns. A bug?

2012-12-19 Thread B. Todd Burruss
i believe we have hit this as well. if you use nodetool to rebuild_index, does it work? On Wed, Dec 19, 2012 at 8:10 PM, aaron morton wrote: > Well that was fun https://issues.apache.org/jira/browse/CASSANDRA-5079 > > Just testing my idea of a fix now. > > Cheers > - > Aaron Mort

Re: when are keyspace dirs removed?

2013-01-04 Thread B. Todd Burruss
Their should be shapshots in there > https://github.com/apache/cassandra/blob/trunk/conf/cassandra.yaml#L402 > > Cheers > > - > Aaron Morton > Freelance Cassandra Developer > New Zealand > > @aaronmorton > http://www.thelastpickle.com > > On 4/01/

Re: help turning compaction..hours of run to get 0% compaction....

2013-01-08 Thread B. Todd Burruss
i'll second edward's comment. cassandra is designed to scale horizontally, so if disk I/O is slowing you down then you must scale On Tue, Jan 8, 2013 at 7:10 AM, Jim Cistaro wrote: > One metric to watch is pending compactions (via nodetool > compactionstats). This count will give you some id

hazelcast

2010-12-10 Thread B. Todd Burruss
http://www.hazelcast.com/product.jsp has anyone tested hazelcast as a distributed locking mechanism for java clients? seems very attractive on the surface.

Re: hazelcast

2010-12-10 Thread B. Todd Burruss
.@gmail.com // sites http://twitter.com/germanklf http://ar.linkedin.com/in/germankondolf On Fri, Dec 10, 2010 at 2:50 PM, B. Todd Burruss wrote: http://www.hazelcast.com/product.jsp has anyone tested hazelcast as a distributed locking mechanism for java clients? seems very attractive on the surface.

Exceptions in RowMutationVerbHandler

2010-12-15 Thread B. Todd Burruss
i am seeing several different exceptions across my 8 node cluster. running 0.7 RC2. the following are all from one node. is this a known issue? ERROR [MutationStage:35] 2010-12-15 09:25:06,466 RowMutationVerbHandler.java (line 83) Error in row mutation org.apache.cassandra.db.Unserializable

cassandra.yaml customization per node

2010-12-30 Thread B. Todd Burruss
how are folks customizing the cassandra.yaml for each node in the cluster. specifically the token and IP address. with XML i used entities, but i'm not familiar with YAML. does yaml support the same concept? or any sort of textual substitution? thx

maven cassandra plugin

2011-01-06 Thread B. Todd Burruss
has anyone created a maven plugin, like cargo for tomcat, for automating starting/stopping a cassandra instance?

Re: maven cassandra plugin

2011-01-06 Thread B. Todd Burruss
nonsense words and other nonsense are a direct result of using swype to type on the screen On 6 Jan 2011 19:38, "B. Todd Burruss" <mailto:bburr...@real.com>> wrote: > has anyone created a maven plugin, like cargo for tomcat, for automating > starting/stopping a cassandra instance?

Re: maven cassandra plugin

2011-01-06 Thread B. Todd Burruss
very useful for automated tasks that needs to run on multiple machines Shiy On 2011 1 6 21:38, "B. Todd Burruss" <mailto:bburr...@real.com>> wrote: has anyone created a maven plugin, like cargo for tomcat, for automating starting/stopping a cassandra instance?

Re: monitoring with Zabbix

2011-01-10 Thread B. Todd Burruss
we use zabbix. we run the agent on our linux boxes and also start zapcat using the class that follows. essentially you go into the zabbix console and setup "hosts" for the zapcat port, and "hosts" for the zabbix agent. then setup items for the "zapcat host" that are JMX metrics. info on zap

repair cause large number of SSTABLEs

2011-01-26 Thread B. Todd Burruss
i ran out of file handles on the "repairing node" after doing nodetool repair - strange as i have never had this issue until using 0.7.0 (but i should say that i have not truly tested 0.7.0 until now.) up'ed the number of file handles, removed data, restarted nodes, then restarted my test. wa

Re: repair cause large number of SSTABLEs

2011-01-27 Thread B. Todd Burruss
ok thx. what about the repair creating hundreds of new sstables and lsof showing cassandra using currently over 800 Data.db files? is this normal? On 01/27/2011 08:40 AM, Brandon Williams wrote: On Thu, Jan 27, 2011 at 10:21 AM, Todd Burruss <mailto:bburr...@real.com>> wrote: t

Re: repair cause large number of SSTABLEs

2011-01-27 Thread B. Todd Burruss
files are marked as -tmp-? On Jan 27, 2011 9:00 AM, "B. Todd Burruss" <mailto:bburr...@real.com>> wrote: > ok thx. what about the repair creating hundreds of new sstables and > lsof showing cassandra using currently over 800 Data.db files? is this > normal? > > On

Re: Secondary Index information

2011-01-28 Thread B. Todd Burruss
batch_mutate doesn't guarantee consistency. each mutation in the batch is guaranteed to be consistent based on your CL, but if it returns an error it means that it couldn't complete all mutations ... but the converse isn't true. it may have successfully completed some mutations. if you get a

0.7.1 release

2011-01-28 Thread B. Todd Burruss
any word on when to expect 0.7.1? lots of good fixes we need. trying to decide if i should apply patches or wait. thx!

Re: Cassandra events next week around Strata

2011-01-28 Thread B. Todd Burruss
web site says sold out, too bad for me ;) On 01/28/2011 07:01 PM, Jonathan Ellis wrote: Next week is the Strata conference and not one, not two, but five Cassandra events! In chronological order: 1. My Strata Cassandra tutorial Tuesday afternoon: http://strataconf.com/strata2011/public/schedul

Re: ORM over Cassandra

2011-02-10 Thread B. Todd Burruss
wiki page is here ... https://github.com/rantav/hector/wiki/Hector-Object-Mapper-(HOM) it does not handle relationships between objects yet, but does handle inheritance On 02/10/2011 12:21 PM, Jonathan Ellis wrote: An o

Another EOFException

2011-02-15 Thread B. Todd Burruss
the following exception seems to be about loading saved caches, but i don't really care about the cache so maybe isn't a big deal. anyway, this is with patched 0.7.1 (0001-Fix-bad-signed-conversion-from-byte-to-int.patch) WARN 11:07:59,800 error reading saved cache /data/cassandra-data/save

Re: Another EOFException

2011-02-15 Thread B. Todd Burruss
the cache keys?" On Tue, Feb 15, 2011 at 1:10 PM, B. Todd Burruss wrote: the following exception seems to be about loading saved caches, but i don't really care about the cache so maybe isn't a big deal. anyway, this is with patched 0.7.1 (0001-Fix-bad-signed-conversion-from-byt

Re: performance tuning - where does the slowness come from?

2010-05-06 Thread B. Todd Burruss
i think you will see a slow down because of large values in your columns. make sure you take a look at MemtableThroughputInMB in your config. if you are writing 1MB of data per row, then you'll probably want to increase this quite a bit so you are not constantly creating sstables. can't reca

Re: Tuning Cassandra

2010-05-10 Thread B. Todd Burruss
have you put your commit log on a disk by itself? not a logical partition shared by oracle or cassandra "data". this will make a difference, as you don't want the cassandra commit logs competing with other OS and oracle I/O. look in storage-conf.xml and see if you can move this. also check

Re: Read Latency

2010-05-11 Thread B. Todd Burruss
you can try this benchmarking tool to compare your drive(s) http://freshmeat.net/projects/fio/ ... you can simulate various loads, etc. my RAID0 outperforms single drive (as mentioned below) under heavy concurrent reads. On 05/11/2010 08:15 AM, Peter Schüller wrote: isolated requests, obvio

0.6.2

2010-05-11 Thread B. Todd Burruss
i was thinking about doing some testing with 0.6.2 ... do the devs consider the tip of 0.6 branch ok to test with?

  1   2   >