Re: help turning compaction..hours of run to get 0% compaction....

2013-01-08 Thread B. Todd Burruss
i'll second edward's comment. cassandra is designed to scale horizontally, so if disk I/O is slowing you down then you must scale On Tue, Jan 8, 2013 at 7:10 AM, Jim Cistaro wrote: > One metric to watch is pending compactions (via nodetool > compactionstats). This count will give you some id

Re: when are keyspace dirs removed?

2013-01-04 Thread B. Todd Burruss
Their should be shapshots in there > https://github.com/apache/cassandra/blob/trunk/conf/cassandra.yaml#L402 > > Cheers > > - > Aaron Morton > Freelance Cassandra Developer > New Zealand > > @aaronmorton > http://www.thelastpickle.com > > On 4/01/

Re: TTL on SecondaryIndex Columns. A bug?

2012-12-19 Thread B. Todd Burruss
i believe we have hit this as well. if you use nodetool to rebuild_index, does it work? On Wed, Dec 19, 2012 at 8:10 PM, aaron morton wrote: > Well that was fun https://issues.apache.org/jira/browse/CASSANDRA-5079 > > Just testing my idea of a fix now. > > Cheers > - > Aaron Mort

Re: entire range of node out of sync -- out of the blue

2012-12-19 Thread B. Todd Burruss
che.org/jira/browse/CASSANDRA-5041 > TBH i think this was a repair without -pr > > thanks, > Andras > > Andras Szerdahelyi* > *Solutions Architect, IgnitionOne | 1831 Diegem E.Mommaertslaan 20A > M: +32 493 05 50 88 | Skype: sandrew84 > > > > > > On 18 Dec 2

Re: Moving data from one datacenter to another

2012-12-19 Thread B. Todd Burruss
to get it "correct", meaning consistent, it seems you will need to do a repair no matter what since the source cluster is taking writes during this time and writing to commit log. so to avoid filename issues just do the first copy and then repair. i am not sure if they can have any filename. to

Re: Does a scrub remove deleted/expired columns?

2012-12-19 Thread B. Todd Burruss
i will add that we have had a good experience with leveled compaction cleaning out tombstoned data faster than size tiered, therefore keeping our total disk usage much more reasonable than size tiered. it is at the cost of I/O ... maybe 2X the I/O?? but that is not bothering us. what is bothering

Re: entire range of node out of sync -- out of the blue

2012-12-18 Thread B. Todd Burruss
in your data directory, for each keyspace there is a solr.json. cassandra stores the SSTABLEs it knows about when using leveled compaction. take a look at that file and see if it looks accurate. if not, this is a bug with cassandra that we are checking into as well On Thu, Dec 6, 2012 at 7:38

Re: Query regarding SSTable timestamps and counts

2012-12-10 Thread B. Todd Burruss
my two cents ... i know this thread is a bit old, but the fact that odd-sized SSTABLEs (usually large ones) will hang around for a while can be very troublesome on disk space and planning. our data is temporal in cassandra, being deleted constantly. we have seen space usage in the 1+ TB range whe

CQL timestamps and timezones

2012-12-07 Thread B. Todd Burruss
trying to figure out if i'm doing something wrong or a bug. i am creating a simple schema, inserting a timestamp using ISO8601 format, but when retrieving the timestamp, the timezone is displayed incorrectly. i'm inserting using GMT, the result is shown with "+", but the time is for my local

Re: removing SSTABLEs

2012-11-12 Thread B. Todd Burruss
tion. >> >> >> On Mon, Nov 12, 2012 at 12:09 PM, B. Todd Burruss wrote: >>> >>> if i stop a node and remove an SSTABLE, let's call it X, is that safe? >>> >>> ok, more info. i know that the data in SSTABLE X has been tombstoned >>>

Re: Multiple Clusters Keyspacse to one core cluster

2012-11-11 Thread B. Todd Burruss
with NetworkTopologyStrategy it theoretically should work http://www.datastax.com/docs/1.0/cluster_architecture/replication On Thu, Nov 8, 2012 at 5:11 PM, ws wrote: > If I have multiple clusters can I replicate a keyspace from each of those > cluster to separate cluster? > >

Re: Replication factor and performance questions

2012-11-10 Thread B. Todd Burruss
@oleg, to answer your last question a cassandra node should never ask another node for information it doesn't have. it uses the key and the partitioner to determine where the data is located before ever contacting another node. On Mon, Nov 5, 2012 at 9:45 AM, Andrey Ilinykh wrote: > You will hav

Re: leveled compaction and tombstoned data

2012-11-08 Thread B. Todd Burruss
, Nov 8, 2012 at 11:53 AM, B. Todd Burruss wrote: > thanks for the links! i had forgotten about live sampling > > On Thu, Nov 8, 2012 at 11:41 AM, Brandon Williams wrote: >> On Thu, Nov 8, 2012 at 1:33 PM, Aaron Turner wrote: >>> There are also ways to bring up a test

Re: leveled compaction and tombstoned data

2012-11-08 Thread B. Todd Burruss
thanks for the links! i had forgotten about live sampling On Thu, Nov 8, 2012 at 11:41 AM, Brandon Williams wrote: > On Thu, Nov 8, 2012 at 1:33 PM, Aaron Turner wrote: >> There are also ways to bring up a test node and just run Level Compaction on >> that. Wish I had a URL handy, but hopefull

Re: leveled compaction and tombstoned data

2012-11-08 Thread B. Todd Burruss
we are running Datastax enterprise and cannot patch it. how bad is "kill performance"? if it is so bad, why is it an option? On Thu, Nov 8, 2012 at 10:17 AM, Radim Kolar wrote: > Dne 8.11.2012 19:12, B. Todd Burruss napsal(a): > >> my question is would leveled compact

leveled compaction and tombstoned data

2012-11-08 Thread B. Todd Burruss
we are having the problem where we have huge SSTABLEs with tombstoned data in them that is not being compacted soon enough (because size tiered compaction requires, by default, 4 like sized SSTABLEs). this is using more disk space than we anticipated. we are very write heavy compared to reads, an

Re: High bandwidth usage between datacenters for cluster

2012-11-01 Thread B. Todd Burruss
bryce, did you resolve this? i'm interested in the outcome. when you write does it help to use CL = LOCAL_QUORUM? On Mon, Oct 29, 2012 at 12:52 AM, aaron morton wrote: > Outbound messages for other DC's are grouped and a single instance is sent > to a single node in the remote DC. The remote no

Re: constant CMS GC using CPU time

2012-10-23 Thread B. Todd Burruss
Regarding memory usage after a repair ... Are the merkle trees kept around? On Oct 23, 2012 3:00 PM, "Bryan Talbot" wrote: > On Mon, Oct 22, 2012 at 6:05 PM, aaron morton wrote: > >> The GC was on-going even when the nodes were not compacting or running a >> heavy application load -- even when th

Re: nodetool cleanup

2012-10-23 Thread B. Todd Burruss
It is typically used after the token > assignments have been changed. > > Cheers > > - > Aaron Morton > Freelance Developer > @aaronmorton > http://www.thelastpickle.com > > On 23/10/2012, at 6:42 PM, Will @ SOHO wrote: > > On 10/23/2012 01:

nodetool cleanup

2012-10-22 Thread B. Todd Burruss
does "nodetool cleanup" perform a major compaction in the process of removing unwanted data? i seem to remember this to be the case, but can't find anything definitive

Re: tombstones and their data

2012-10-22 Thread B. Todd Burruss
u get >> confirmation. >> >> Dean >> >> On 10/22/12 10:43 AM, "B. Todd Burruss" wrote: >> >>>if a node, X, has a tombstone marking deleted data, when can node X >>>remove the data - not the tombstone, but the data? i understand the >

tombstones and their data

2012-10-22 Thread B. Todd Burruss
if a node, X, has a tombstone marking deleted data, when can node X remove the data - not the tombstone, but the data? i understand the tombstone cannot be removed until GCGraceSeconds has passed, but it seems the data could be compacted away at any time.

Re: Issue removing rows

2012-10-13 Thread B. Todd Burruss
i have used StorageProxy and was forgetting to rewind (or otherwise setup my ByteBuffer properly) and was getting, i believe, the same error. check your ByteBuffers On Sat, Oct 13, 2012 at 8:49 AM, Nick Morizio wrote: > I'm wondering if anyone has seen this issue before: > > We are running Cassa

Re: Option for ordering columns by timestamp in CF

2012-10-12 Thread B. Todd Burruss
trying to think of a use case where you would want to order by timestamp, and also have unique column names for direct access. not really trying to challenge the use case, but you can get ordering by timestamp and still maintain a "name" for the column using composites. if the first component of t

Re: read performance plumetted

2012-10-12 Thread B. Todd Burruss
did the amount of data finally exceed your per machine RAM capacity? is it the same 20% each time you read? or do your periodic reads eventually work through the entire dataset? if you are essentially table scanning your data set, and the size exceeds available RAM, then a degradation like that i

Re: what is more important (RAM vs Cores)

2012-10-12 Thread B. Todd Burruss
i would not worry as much about the single machine specs. find the sweet spot on price for CPU and RAM and by that, then scale horizontal to meet your demand. but .. if i was pressed for a general statement -choose RAM over CPU On Fri, Oct 12, 2012 at 4:34 AM, Serge Fonville wrote: > It seems y

Re: Cassandra nodes loaded unequally

2012-10-12 Thread B. Todd Burruss
are you connecting to the same node every time? if so, spread out your connections across the ring On Fri, Oct 12, 2012 at 1:22 AM, Alexey Zotov wrote: > Hi Ben, > > I suggest you to compare amount of queries for each node. May be the problem > is on the client side. > Yoy can do that using JMX:

Re: 1.1.1 is "repair" still needed ?

2012-10-11 Thread B. Todd Burruss
as of 1.0 (CASSANDRA-2034) hints are generated for nodes that timeout. On Thu, Oct 11, 2012 at 3:55 AM, Watanabe Maki wrote: > Even if HH works fine, HH will not be created until the failure detector > marks the node is dead. > HH will not be created for partially timeouted mutation request ( b

Re: unbalanced ring

2012-10-10 Thread B. Todd Burruss
+972 54 8356490 > Fax: +972 2 5612956 > > > > > > On Wed, Oct 10, 2012 at 6:12 PM, B. Todd Burruss wrote: > >> major compaction in production is fine, however it is a heavy operation >> on the node and will take I/O and some CPU. >> >> the only time i

Re: Upgrading hardware on a node in a cluster

2012-10-10 Thread B. Todd Burruss
if you have N nodes in your cluster, add N new nodes using the new hardware, then decommision the old N nodes. (and migrate to VPC like dean said) On Wed, Oct 10, 2012 at 5:23 AM, Hiller, Dean wrote: > Well, you could use amazon VPC in which case you DO pick the IP yourself > ;)….it makes life

Re: unbalanced ring

2012-10-10 Thread B. Todd Burruss
major compaction in production is fine, however it is a heavy operation on the node and will take I/O and some CPU. the only time i have seen this happen is when i have changed the tokens in the ring, like "nodetool movetoken". cassandra does not auto-delete data that it doesn't use anymore just

Re: cassandra 1.2 beta in production

2012-10-10 Thread B. Todd Burruss
https://issues.apache.org/jira/browse/CASSANDRA/fixforversion/12323284 On Wed, Oct 10, 2012 at 1:41 AM, Alexey Zotov wrote: > Hi Guys, > > What known critical bugs are there that couldn't allow to use 1.2 beta 1 in > production? > We don't use cql and secondary indexes. > > > -- > > Best regards

Re: Remove node from cluster and have it run as a single node cluster by itself

2012-10-05 Thread B. Todd Burruss
i believe the system keyspace keeps track of the cluster topology. even though you changed info in yaml, the system keyspace still knows about the other nodes. remove the system keyspace files from data dir and try again On Fri, Oct 5, 2012 at 4:47 AM, Fredrik wrote: > I guess that the other nod

Re: batch isolation

2012-06-04 Thread Todd Burruss
I don't think I'm being clear. I just was wondering if a "row delete" is isolated with all the other inserts or deletes to a specific column family and key in the same batch. On 6/4/12 1:58 AM, "Sylvain Lebresne" wrote: >On Sun, Jun 3, 2012 at 6:05 PM, Tod

Re: batch isolation

2012-06-03 Thread Todd Burruss
Re: batch isolation On Sun, Jun 3, 2012 at 2:53 AM, Todd Burruss wrote:> > 1 – does this mean that a batch_mutate that first sends a "row delete" > mutation on key X, then subsequent insert mutations for key X is isolated? I'm not sure what you mean by having "a

batch isolation

2012-06-02 Thread Todd Burruss
reading the 1.1 "what's new" here, http://www.datastax.com/docs/1.1/getting_started/new_features, I'm wondering about row level isolation. two questions: 1 – does this mean that a batch_mutate that first sends a "row delete" mutation on key X, then subsequent insert mutations for key X is isol

Re: hinted handoff 16 s delay

2012-02-23 Thread Todd Burruss
if I remember correctly, cassandra has a random delay in it so hint deliver is staggered and does not overwhelm the just restarted node. On 2/23/12 1:46 PM, "Hontvári József Levente" wrote: >I have played with a test cluster, stopping cassandra on one node and >updating a row on another. I notic

Re: Wide Row Performance & Index Question

2012-02-20 Thread Todd Burruss
I believe you will see a slight "unbalance" regardless of your RF with very wide rows, if they are of varying sizes. one node may get a very wide row and another node may get a not so wide row. it's all based on the key. From: aaron morton mailto:aa...@thelastpickle.com>> Reply-To: "user@cassa

Re: Key cache hit rate issue

2012-02-17 Thread Todd Burruss
cachedPosition = getCachedPosition(unifiedKey, true); > >On Thu, Feb 16, 2012 at 4:23 PM, Todd Burruss >wrote: >> jonathan, you said the key to the cache is key + sstable? looking at >>the >> code it looks like a DecoratedKey is the "row key". how does sstabl

Re: Key cache hit rate issue

2012-02-16 Thread Todd Burruss
jonathan, you said the key to the cache is key + sstable? looking at the code it looks like a DecoratedKey is the "row key". how does sstable come into play? On 2/16/12 1:20 PM, "Jonathan Ellis" wrote: >So, you have roughly 1/6 of your (physical) row keys cached and about >1/4 cache hit rate,

Re: Key cache hit rate issue

2012-02-16 Thread Todd Burruss
there is a setting in the yaml file that helps relieve memory pressure by reducing the row cache. it is based on the percent of memory used by the JVM the setting are, reduce_cache_sizes_at and reduce_cache_capacity_to. see how much free memory you have and if the numbers suggest that you have

Re: timed-out retrieving a giant row.

2012-02-14 Thread Todd Burruss
lookup ColumnSliceIterator in hector. this does what you want a lot easier than managing the "paging" From: Yuhan Zhang mailto:yzh...@onescreen.com>> Reply-To: "user@cassandra.apache.org" mailto:user@cassandra.apache.org>> Date: Tue, 14 Feb 2012 14:46:08 -0800

Re: deleting rows and tombstones

2012-02-14 Thread Todd Burruss
ce badly. I'd like to see the end of the whole idea of GCGradeSeconds. On 14 February 2012 21:29, Todd Burruss mailto:bburr...@expedia.com>> wrote: do you find that repair is still as necessary now since hinted handoffs are stored anytime a node does not ACK successfully? From: Domini

Re: deleting rows and tombstones

2012-02-14 Thread Todd Burruss
y. I proposed an alternative approach here https://issues.apache.org/jira/browse/CASSANDRA-3620 so vote it up if you share problems! Dominic On 14 February 2012 19:54, Todd Burruss mailto:bburr...@expedia.com>> wrote: my design calls for deleting a row (by key, not individual columns) and re-inser

Re: deleting rows and tombstones

2012-02-14 Thread Todd Burruss
It that's not, then you will get tombstones that might slow down the reads because they have to be skipped until the next compaction. 2012/2/14 Todd Burruss mailto:bburr...@expedia.com>> my design calls for deleting a row (by key, not individual columns) and re-inserting it a lot and I&

deleting rows and tombstones

2012-02-14 Thread Todd Burruss
my design calls for deleting a row (by key, not individual columns) and re-inserting it a lot and I'm concerned about tombstone build up slowing down reads. I know if I delete a lot of individual columns the tombstones will build up and slow down reads until they are cleaned up, but not sure if

Re: ParNew and caching

2011-11-18 Thread Todd Burruss
Concurrent provider : 500ms 100% cache hit rate. ParNew is the only stat that I see out of line, so seems like still a lot of copying On 11/18/11 2:40 PM, "Mohit Anchlia" wrote: >On Fri, Nov 18, 2011 at 1:46 PM, Todd Burruss >wrote: >> Ok, I figured something li

Re: ParNew and caching

2011-11-18 Thread Todd Burruss
on my use case is that I am picking 50 columns from the 70k. Since the whole row is in the cache, and no copying from off-heap nor disk buffers, seems like it should be faster than non-cache mode. More thoughts :) On 11/18/11 6:39 AM, "Sylvain Lebresne" wrote: >On Fri, Nov 18, 201

ParNew and caching

2011-11-17 Thread Todd Burruss
I'm using cassandra 1.0. Been doing some testing on using cass's cache. When I turn it on (using the CLI) I see ParNew jump from 3-4ms to 200-300ms. This really screws with response times, which jump from ~25-30ms to 1300+ms. I've increase new gen and that helps, but still this is suprising

Re: Network traffic patterns

2011-11-16 Thread Todd Burruss
Are all of your machines equal hardware? Since those machines are sending data somewhere, maybe they are behind in replicating and are continuously catching up? Use a tool like tcpdump to find out where the data is going From: Philippe mailto:watche...@gmail.com>> Reply-To: "user@cassandra.apa

Row Groups

2011-11-09 Thread Todd Burruss
ob/solandra/src/lucandra/dht/RandomPartitioner.java But Row Groups is going to be the "official" way. -Jake On Wed, Nov 9, 2011 at 5:53 PM, Todd Burruss mailto:bburr...@expedia.com>> wrote: Thx jake for the JIRA, but there was someone at the conference that had already implemented

Re: Second Cassandra users survey

2011-11-09 Thread Todd Burruss
Entity Groups : https://issues.apache.org/jira/browse/CASSANDRA-1684 -Jake On Wed, Nov 9, 2011 at 6:44 AM, Todd Burruss mailto:bburr...@expedia.com>> wrote: I believe I heard someone talk at Cassandra SF conference about creating a partitioner that was a derivation of RandomPartitioner.

Re: Second Cassandra users survey

2011-11-08 Thread Todd Burruss
A use case that could use this (but isn't in my top requests) is usage history for a given user. I use a single row to save history per user, each column is a user action with name a TimeUUID and value is a blob. I use the TimeUUID to sort the actions, but I don't really care about exact time. a

Re: Second Cassandra users survey

2011-11-08 Thread Todd Burruss
I believe I heard someone talk at Cassandra SF conference about creating a partitioner that was a derivation of RandomPartitioner. It essentially would look for keys that adhere to a certain pattern, like :. The portion would be used for determining the location on the ring, but : for actually s

Re: Second Cassandra users survey

2011-11-03 Thread Todd Burruss
- Better performance when access random columns in a wide row - caching subsets of wide rows - possibly on the same boundaries as the index - some sort of notification architecture when data is inserted. This could be co-processors, triggers, plugins, etc - auto load balance when adding new nodes

Re: Cassandra cluster HW spec (commit log directory vs data file directory)

2011-10-25 Thread Todd Burruss
This may help determining your data storage requirements ... http://btoddb-cass-storage.blogspot.com/ On 10/25/11 11:22 AM, "Mohit Anchlia" wrote: >On Tue, Oct 25, 2011 at 11:18 AM, Dan Hendry >wrote: >>> 2. ... So I am going to use rotational disk for the commit log and an >>>SSD >>> for da

Re: CMS GC initial-mark taking 6 seconds , bad?

2011-10-20 Thread Todd Burruss
Are you using cassandra's caching? If you are then you will need to play around with the RAM setting to find a sweet spot. A low hit rate on the cache (which is counter productive anyway) will cause more GC. A high hit rate, less GC. If you are not caching, no need to use a large heap as the OS

selective replication

2011-09-14 Thread Todd Burruss
Has anyone done any work on what I'll call "selective replication" between DCs? I want to use Cassandra to replicate data to another virtual DC (for analytical purposes), but only "inserts", not "deletes". Picture having two data centers, DC1 for OLTP of short lived data (say 90 day window) an

Cassandra Storage Sizing

2011-07-20 Thread Todd Burruss
I put together a blog post on Cassandra Storage Sizing so I don't need to keep figuring it out again and again. Hope everyone finds it useful, and give feedback if you find errors. http://btoddb-cass-storage.blogspot.com/ ... enjoy

Re: Another EOFException

2011-02-15 Thread B. Todd Burruss
the cache keys?" On Tue, Feb 15, 2011 at 1:10 PM, B. Todd Burruss wrote: the following exception seems to be about loading saved caches, but i don't really care about the cache so maybe isn't a big deal. anyway, this is with patched 0.7.1 (0001-Fix-bad-signed-conversion-from-byt

Another EOFException

2011-02-15 Thread B. Todd Burruss
the following exception seems to be about loading saved caches, but i don't really care about the cache so maybe isn't a big deal. anyway, this is with patched 0.7.1 (0001-Fix-bad-signed-conversion-from-byte-to-int.patch) WARN 11:07:59,800 error reading saved cache /data/cassandra-data/save

Re: ORM over Cassandra

2011-02-10 Thread B. Todd Burruss
wiki page is here ... https://github.com/rantav/hector/wiki/Hector-Object-Mapper-(HOM) it does not handle relationships between objects yet, but does handle inheritance On 02/10/2011 12:21 PM, Jonathan Ellis wrote: An o

Re: Cassandra events next week around Strata

2011-01-28 Thread B. Todd Burruss
web site says sold out, too bad for me ;) On 01/28/2011 07:01 PM, Jonathan Ellis wrote: Next week is the Strata conference and not one, not two, but five Cassandra events! In chronological order: 1. My Strata Cassandra tutorial Tuesday afternoon: http://strataconf.com/strata2011/public/schedul

0.7.1 release

2011-01-28 Thread B. Todd Burruss
any word on when to expect 0.7.1? lots of good fixes we need. trying to decide if i should apply patches or wait. thx!

Re: Secondary Index information

2011-01-28 Thread B. Todd Burruss
batch_mutate doesn't guarantee consistency. each mutation in the batch is guaranteed to be consistent based on your CL, but if it returns an error it means that it couldn't complete all mutations ... but the converse isn't true. it may have successfully completed some mutations. if you get a

Re: repair cause large number of SSTABLEs

2011-01-27 Thread B. Todd Burruss
files are marked as -tmp-? On Jan 27, 2011 9:00 AM, "B. Todd Burruss" <mailto:bburr...@real.com>> wrote: > ok thx. what about the repair creating hundreds of new sstables and > lsof showing cassandra using currently over 800 Data.db files? is this > normal? > > On

Re: repair cause large number of SSTABLEs

2011-01-27 Thread B. Todd Burruss
ok thx. what about the repair creating hundreds of new sstables and lsof showing cassandra using currently over 800 Data.db files? is this normal? On 01/27/2011 08:40 AM, Brandon Williams wrote: On Thu, Jan 27, 2011 at 10:21 AM, Todd Burruss <mailto:bburr...@real.com>> wrote: t

RE: repair cause large number of SSTABLEs

2011-01-27 Thread Todd Burruss
ndra.apache.org Subject: Re: repair cause large number of SSTABLEs Maybe related to https://issues.apache.org/jira/browse/CASSANDRA-1992 ? On Jan 27, 2011, at Thu Jan 27, 1:22 AM, B. Todd Burruss wrote: i ran out of file handles on the "repairing node" after doing nodetool repair - strange

Re: repair cause large number of SSTABLEs

2011-01-27 Thread Todd Burruss
/CASSANDRA-1959 Aaron On 27 Jan 2011, at 19:22, B. Todd Burruss wrote: > i ran out of file handles on the "repairing node" after doing nodetool repair > - strange as i have never had this issue until using 0.7.0 (but i should say > that i have not truly tested 0.7.0 until now.)

repair cause large number of SSTABLEs

2011-01-26 Thread B. Todd Burruss
i ran out of file handles on the "repairing node" after doing nodetool repair - strange as i have never had this issue until using 0.7.0 (but i should say that i have not truly tested 0.7.0 until now.) up'ed the number of file handles, removed data, restarted nodes, then restarted my test. wa

Re: monitoring with Zabbix

2011-01-10 Thread B. Todd Burruss
we use zabbix. we run the agent on our linux boxes and also start zapcat using the class that follows. essentially you go into the zabbix console and setup "hosts" for the zapcat port, and "hosts" for the zabbix agent. then setup items for the "zapcat host" that are JMX metrics. info on zap

Re: maven cassandra plugin

2011-01-06 Thread B. Todd Burruss
very useful for automated tasks that needs to run on multiple machines Shiy On 2011 1 6 21:38, "B. Todd Burruss" <mailto:bburr...@real.com>> wrote: has anyone created a maven plugin, like cargo for tomcat, for automating starting/stopping a cassandra instance?

Re: maven cassandra plugin

2011-01-06 Thread B. Todd Burruss
nonsense words and other nonsense are a direct result of using swype to type on the screen On 6 Jan 2011 19:38, "B. Todd Burruss" <mailto:bburr...@real.com>> wrote: > has anyone created a maven plugin, like cargo for tomcat, for automating > starting/stopping a cassandra instance?

maven cassandra plugin

2011-01-06 Thread B. Todd Burruss
has anyone created a maven plugin, like cargo for tomcat, for automating starting/stopping a cassandra instance?

cassandra.yaml customization per node

2010-12-30 Thread B. Todd Burruss
how are folks customizing the cassandra.yaml for each node in the cluster. specifically the token and IP address. with XML i used entities, but i'm not familiar with YAML. does yaml support the same concept? or any sort of textual substitution? thx

Exceptions in RowMutationVerbHandler

2010-12-15 Thread B. Todd Burruss
i am seeing several different exceptions across my 8 node cluster. running 0.7 RC2. the following are all from one node. is this a known issue? ERROR [MutationStage:35] 2010-12-15 09:25:06,466 RowMutationVerbHandler.java (line 83) Error in row mutation org.apache.cassandra.db.Unserializable

Re: hazelcast

2010-12-10 Thread B. Todd Burruss
.@gmail.com // sites http://twitter.com/germanklf http://ar.linkedin.com/in/germankondolf On Fri, Dec 10, 2010 at 2:50 PM, B. Todd Burruss wrote: http://www.hazelcast.com/product.jsp has anyone tested hazelcast as a distributed locking mechanism for java clients? seems very attractive on the surface.

hazelcast

2010-12-10 Thread B. Todd Burruss
http://www.hazelcast.com/product.jsp has anyone tested hazelcast as a distributed locking mechanism for java clients? seems very attractive on the surface.

Re: using too much RAM

2010-10-14 Thread B. Todd Burruss
thx, it does say that in the log, but that is probably just a reflection of whatever is read from cassandra.yaml. i am wondering if some unix tool can tell me if my process is mmap'ing files. maybe lsof? On 10/14/2010 12:07 PM, Rob Coli wrote: On 10/14/10 10:59 AM, B. Todd Burruss

using too much RAM

2010-10-14 Thread B. Todd Burruss
0.7.0-beta2 top is reporting my cassandra process as using 11g. i have set "disk_access_mode: standard" and Xmx8G (verified via JMX) i have only noticed using more RAM than Xmx when using mmap i/o. this leads me to believe that disk_access_mode was not set properly, even though it is in t

Re: Silent Crash

2010-10-13 Thread B. Todd Burruss
if it is actually corrupted). Do you know if compact or repair would detect bad data and disregard it? I'd like to try something like that if possible before just upgrading the JVM and potentially hiding the real problem. On Wed, Oct 13, 2010 at 9:35 PM, B. Todd Burruss <mailto:bbur

Re: Silent Crash

2010-10-13 Thread B. Todd Burruss
you should upgrade to the latest version of the JVM, 1.6.0_21 there was a bug around 1.6.0_18 (or there abouts) that affected cassandra On 10/13/2010 07:55 PM, Eric Czech wrote: And this is the java version: java version "1.6.0_13" Java(TM) SE Runtime Environment (build 1.6.0_13-b03) Java Hot

Re: getSchemaVersion

2010-10-11 Thread B. Todd Burruss
On 10/11/2010 06:14 PM, Jonathan Ellis wrote: On Mon, Oct 11, 2010 at 7:53 PM, B. Todd Burruss wrote: to determine if my programmatic schema changes have been distributed throughout the cluster, I am supposed to use getSchemaVersionMap, correct? my question is how do I properly use it? I

getSchemaVersion

2010-10-11 Thread B. Todd Burruss
to determine if my programmatic schema changes have been distributed throughout the cluster, I am supposed to use getSchemaVersionMap, correct? my question is how do I properly use it? I have the schema version returned from the thrift method, and I can lookup in the schema map returned getS

Re: Advice on settings

2010-10-07 Thread B. Todd Burruss
if you are updating columns quite rapidly, you will scatter the columns over many sstables as you update them over time. this means that a read of a specific column will require looking at more sstables to find the data. performing a compaction (using nodetool) will merge the sstables into on

Re: [RELEASE] 0.7.0 beta2

2010-10-01 Thread B. Todd Burruss
i don't see a beta2 subversion tag. is there one? On 10/01/2010 11:56 AM, Eric Evans wrote: It's like Christmas in October, but without the long lines. First, the obligatory disclaimer. This is beta software. It's like a teenage driver, it seems as though it's up to the task, and it almost i

Re: drop/recreate column family race condition

2010-09-07 Thread B. Todd Burruss
interesting is that "truncate" API doesn't return a schema version nor take a consistency level. does this mean that when it returns the cluster is always consistent? On 09/07/2010 02:50 PM, Jonathan Ellis wrote: On Tue, Sep 7, 2010 at 4:29 PM, B. Todd Burruss wrote:

Re: drop/recreate column family race condition

2010-09-07 Thread B. Todd Burruss
5 secs isn't enough for me, 10 is good. i haven't tried any other values as i can get around this through another manner. On 09/07/2010 02:24 PM, Edward Capriolo wrote: On Tue, Sep 7, 2010 at 5:10 PM, Jonathan Ellis wrote: On Tue, Sep 7, 2010 at 3:55 PM, B. Todd Burr

Re: drop/recreate column family race condition

2010-09-07 Thread B. Todd Burruss
https://issues.apache.org/jira/browse/CASSANDRA-1477 comments below On 09/07/2010 02:10 PM, Jonathan Ellis wrote: On Tue, Sep 7, 2010 at 3:55 PM, B. Todd Burruss wrote: using 0.7 latest from trunk as of few minutes ago. 1 client, 1 node i have the scenario where i want to drop a column

drop/recreate column family race condition

2010-09-07 Thread B. Todd Burruss
using 0.7 latest from trunk as of few minutes ago. 1 client, 1 node i have the scenario where i want to drop a column family and recreate it - unit testing for instance, is a good reason you may want to do this (always start fresh). the problem i observe is that if i do the following: 1 - d

Re: RowMutationVerbHandler.java (line 78) Error in row mutation

2010-08-28 Thread Todd Burruss
at 3:58 PM, B. Todd Burruss wrote: > i got the latest code this morning. i'm testing with 0.7 > > > ERROR [ROW-MUTATION-STAGE:388] 2010-08-27 15:54:58,053 > RowMutationVerbHandler.java (line 78) Error in row mutation > org.apache.cassandra.db.UnserializableColumnFamilyE

RowMutationVerbHandler.java (line 78) Error in row mutation

2010-08-27 Thread B. Todd Burruss
i got the latest code this morning. i'm testing with 0.7 ERROR [ROW-MUTATION-STAGE:388] 2010-08-27 15:54:58,053 RowMutationVerbHandler.java (line 78) Error in row mutation org.apache.cassandra.db.UnserializableColumnFamilyException: Couldn't find cfId=1002 at org.apache.cassandra.db.Colu

Internal error processing batch_mutate

2010-08-27 Thread B. Todd Burruss
i got the latest code from tip of trunk this morning, but i'm seeing this. i recall a JIRA about this. maybe patch isn't on trunk? ERROR [pool-1-thread-61] 2010-08-27 15:55:36,429 Cassandra.java (line 2980) Internal error processing batch_mutate java.lang.NullPointerException at org.ap

Re: get_slice slow

2010-08-25 Thread B. Todd Burruss
if you can reduce the tombstone volume, say by switching to a new row every 5 minutes, that would help a lot. On Wed, Aug 25, 2010 at 11:43 AM, B. Todd Burruss wrote: i did check sstables, and there are only three. i haven't done any major compacts. do u think it is taking so long

Re: get_slice slow

2010-08-25 Thread B. Todd Burruss
Long rows written over long periods of time are almost certain to give worse read performance, even far worse, than rows written all at once. b On Tue, Aug 24, 2010 at 10:17 PM, B. Todd Burruss wrote: thx artie, i haven't used a super CF because i thought it has more trouble doing slic

Re: get_slice slow

2010-08-24 Thread B. Todd Burruss
el? Artie On Tue, Aug 24, 2010 at 9:14 PM, B. Todd Burruss <mailto:bburr...@real.com>> wrote: i am using get_slice to pull columns from a row to emulate a queue. column names are TimeUUID and the values are small, < 32 bytes. simple ColumnFamily. i am using SliceP

get_slice slow

2010-08-24 Thread B. Todd Burruss
i am using get_slice to pull columns from a row to emulate a queue. column names are TimeUUID and the values are small, < 32 bytes. simple ColumnFamily. i am using SlicePredicate like this to pull the first ("oldest") column in the row: SlicePredicate predicate = new SlicePredicate

Re: KeyRange.token in 0.7.0

2010-08-24 Thread B. Todd Burruss
. :) > > On Tue, Aug 24, 2010 at 1:28 PM, B. Todd Burruss wrote: > > i just came across this and i use tokens in range queries because it is > > an easy straightforward way to divide the keyspace and operate on it > > using multiple threads and throttle the processing. may

Re: KeyRange.token in 0.7.0

2010-08-24 Thread B. Todd Burruss
i just came across this and i use tokens in range queries because it is an easy straightforward way to divide the keyspace and operate on it using multiple threads and throttle the processing. maybe this is what hadoop does, i don't know much about hadoop. so i don't really agree that i'm doing i

Re: linux flavor?

2010-08-24 Thread B. Todd Burruss
CentOS works fine for me. straight out-o-the box. i also use ubuntu 10.04 w/o any troubles. make sure to jave jdk 1.6.0_20 or better. there was a bug that affects cassandra somewhere around 1.6.0_18 i think. On Tue, 2010-08-24 at 08:58 -0700, S Ahmed wrote: > Is there a particular linux flavor

  1   2   >