Re: Dynamic schema modification an anti-pattern?

2014-10-07 Thread Colin
Anti-pattern. Dynamically altering the schema won't scale and is bad ju ju. -- Colin Clark +1-320-221-9531 > On Oct 6, 2014, at 10:56 PM, Todd Fast wrote: > > There is a team at my work building a entity-attribute-value (EAV) store > using Cassandra. There is a colu

Re: which snitch ?

2014-10-26 Thread Colin
I would try propertyfilesnitch and use the public ip's of the nodes in aws. You'll need to set the configuration files on each node. > On Oct 26, 2014, at 9:44 PM, Srinivas Chamarthi > wrote: > > what about for the nodes on the private cloud cluster ? if I mention, > ec2MultiRegion, it is

Re: Repair/Compaction Completion Confirmation

2014-10-28 Thread Colin
When I use virtual nodes, I typically use a much smaller number - usually in the range of 10. This gives me the ability to add nodes easier without the performance hit. -- Colin Clark +1-320-221-9531 > On Oct 28, 2014, at 10:46 AM, Alain RODRIGUEZ wrote: > > I have been tr

Re: opscenter with community cassandra

2014-10-28 Thread Colin
I cant run opscenter in a secure environment for a couple of reasons, one - it phones home, two - lack of role based security. It is a mistake to call a proprietary piece of software community when you cant use it in production. It is easy enough to automate what opscenter does rather than rely

Re: opscenter with community cassandra

2014-10-28 Thread Colin
specific offerings. -- Colin Clark +1-320-221-9531 > On Oct 28, 2014, at 10:41 AM, Tyler Hobbs wrote: > > >> On Tue, Oct 28, 2014 at 10:08 AM, Colin wrote: >> It is a mistake to call a proprietary piece of software community when you >> cant use it in production.

Re: opscenter with community cassandra

2014-10-28 Thread Colin
cassandra with datatstax proprietary offerings. And now I will tell you that, in a blanket statement, opscenter *IS NOT READY FOR A PRODUCTION ENVIRONMENT* Your mileage may vary, and you might not take security very seriously, so go head and expose your cluster. Enjoy! -- Colin Clark +1-320

Re: Cassandra for Analytics?

2014-12-18 Thread Colin
Almost every stream processing system I know of offers joins out of the box and has done so for years Even open source offerings like Esper have offered joins for years. What hasnt are systems like storm, spark, etc which I dont really classify as stream processors anyway. -- Colin

Re: STCS limitation with JBOD?

2015-01-02 Thread Colin
Forcing a major compaction is usually a bad idea. What is your reason for doing that? -- Colin Clark +1-320-221-9531 > On Jan 2, 2015, at 1:17 PM, Dan Kinder wrote: > > Hi, > > Forcing a major compaction (using nodetool compact) with STCS will result in > a single

Re: Best approach in Cassandra (+ Spark?) for Continuous Queries?

2015-01-03 Thread Colin
Use a message bus with a transactional get, get the message, send to cassandra, upon write success, submit to esp, commit get on bus. Messaging systems like rabbitmq support this semantic. Using cassandra as a queuing mechanism is an anti-pattern. -- Colin Clark +1-320-221-9531 > On Ja

Re: who owns the data?

2015-01-17 Thread Colin
Try running Nodetool status system By specifying a keyspace (system) in the command, you should get more meaningful results. Using the command on keyspaces as you dev/test/etc will provide real results. -- Colin Clark +1-320-221-9531 > On Jan 17, 2015, at 7:22 PM, Tim Dunphy wr

Re: number of replicas per data center?

2015-01-18 Thread Colin
stractions externally as internally. -- Colin Clark +1-320-221-9531 > On Jan 18, 2015, at 9:49 PM, Kevin Burton wrote: > > How do people normally setup multiple data center replication in terms of > number of *local* replicas? > > So say you have two data centers, do you

Re: Any problem mounting a keyspace directory in ram memory?

2015-02-01 Thread Colin
Until the in-memory option stores data off heap, I would strongly recommend staying away from this option. This was a marketing driven hack in my opinion. -- Colin Clark +1 612 859 6129 Skype colin.p.clark > On Feb 2, 2015, at 5:31 AM, Jan wrote: > > HI Gabriel; > > I don

Re: Question about use scenario with fulltext search

2015-02-01 Thread Colin
o get the data from the other db's. This avoids conflation of concerns, isolates failures, but is dependent upon multiple writes. I use a message bus and services based approach. In my experience, at scale this approach works better and avoids vendor lock in. -- Colin Clark +1 612 859

Re: Mutable primary key in a table

2015-02-08 Thread Colin
nized by uuid, and user name (cluster on the name). This pattern is referred to as an inverted index and provides a lot of power and flexibility once mastered. I use it all the time with cassandra - in fact, to be successful with cassandra, it might actually be a requirement! -- Colin Clark +

Re: High GC activity on node with 4TB on data

2015-02-08 Thread Colin
The most data I put on a node with spinning disk is 1TB. What are the machine specs? Cpu, memory, etc and what is the read/write pattern-heavy ingest rate/heavy read rate and how ling do you keep data in the cluster? -- Colin Clark +1 612 859 6129 Skype colin.p.clark > On Feb 8, 2015, a

Re: How to remove obsolete error message in Datastax Opscenter?

2015-02-09 Thread Colin
Stop using opscenter? :) Sorry, couldnt resist... -- Colin Clark +1 612 859 6129 Skype colin.p.clark > On Feb 9, 2015, at 3:01 AM, Björn Hachmann wrote: > > Good morning, > > unfortunately my last rolling restart of our Cassandra cluster issued from > OpsCenter (5.0.2) f

Re: How to speed up SELECT * query in Cassandra

2015-02-11 Thread Colin
gives more flexibility in terms of large datasets and performance. The very nature of cassandra's distributed nature vs partitioning data on hadoop makes spark on hdfs actually fasted than on cassandra -- Colin Clark +1 612 859 6129 Skype colin.p.clark > On Feb 11, 2015, at 4:49

Re: How to speed up SELECT * query in Cassandra

2015-02-11 Thread Colin
it's in production at my company) -- Colin Clark +1 612 859 6129 Skype colin.p.clark > On Feb 11, 2015, at 6:51 AM, DuyHai Doan wrote: > > "The very nature of cassandra's distributed nature vs partitioning data on > hadoop makes spark on hdfs actually fasted than o

Re: How to speed up SELECT * query in Cassandra

2015-02-11 Thread Colin
No, the question isnt closed. You dont get to decide that. I dont run a website making claims regarding cassandra and spark - your employer does. Again, where are your benchmarks? I will publish mine, then we'll see what you've got. -- Colin Clark +1 612 859 6129 Skype col

Re: What's to think of when increasing disk size on Cassandra nodes?

2015-04-08 Thread Colin
Yikes, 18tb/node is a very bad idea. I dont like to go over 2-3 personally and you have to be careful with JBOD. See one of Ellis's latest posts on this and suggested use of LVM. It is a reversal on previous position re JBOD. -- Colin +1 612 859 6129 Skype colin.p.clark > On Apr

Re: Tracking word frequencies

2014-01-20 Thread Colin
When updating, use table that uses rows of words and increment the count? -- Colin +1 320 221 9531 > On Jan 20, 2014, at 6:58 AM, David Tinker wrote: > > I haven't actually tried to use that schema yet, it was just my first idea. > If we use that solution our app would

Re: Datamodel for a highscore list

2014-01-22 Thread Colin
Read users score, increment, update friends list, update user with new high score Would that work? -- Colin +1 320 221 9531 > On Jan 22, 2014, at 11:44 AM, Kasper Middelboe Petersen > wrote: > > Hi! > > I'm a little worried about the data model I have co

Re: Datamodel for a highscore list

2014-01-22 Thread Colin
Might this work? -- Colin +1 320 221 9531 > On Jan 22, 2014, at 11:13 AM, Kasper Middelboe Petersen > wrote: > > Many million users. Just the one game- I might have some different scores I > need to keep track of, but I very much hope to be able to use the same > app

Re: OpenJDK is not recommended? Why

2014-01-28 Thread Colin
node knows that address as well. There's a good guide to configuration on the datastax website. -- Colin +1 320 221 9531 > On Jan 28, 2014, at 9:55 PM, Kumar Ranjan wrote: > > I am in process of setting 2 node cluster with C* version 2.0.4. When I > started each n

Re: Which hector version is suitable for cassandra 2.0.6 ?

2014-03-27 Thread Colin
Have you tried the Datastax java driver? -- Colin 320-221-9531 > On Mar 27, 2014, at 8:17 AM, user 01 wrote: > > Which hector version is suitable for cassandra 2.0.6 ? > > I am seeing that version 1.1-4(which I believe is latest release?) has been > there around since v

Re: (SOLVED) Installing Datastax Cassandra 1.2.15 Using Yum (Java Issue)

2014-03-28 Thread Colin
OpenJDK will crash under load whilst running Cassandra. -- Colin +1 320 221 9531 > On Mar 28, 2014, at 4:11 PM, Jon Forrest wrote: > > In a previous message I described my guess at > what was causing the Datastax Cassandra installation > to require OpenJDK. Using the me

Re: How safe is "nodetool move" in 1.2 ?

2014-04-16 Thread Colin
replicas. I will be doing some more testing on this under different scenarios, but so far so good However, I would strongly recommend an RF of at least 3 when performing quorum based reads because otherwise you're subject to failed reads in event of losing one node. -- Colin 320-221-9531

Re: JDK 8

2014-04-28 Thread Colin
It seems to run ok, but I havent seen it yet in production on 8. -- Colin Clark +1-320-221-9531 > On Apr 28, 2014, at 4:01 PM, "Ackerman, Mitchell" > wrote: > > I’ve been searching around, but cannot find any information as to whether > Cassandra runs on JRE 8.

Re: What % of cassandra developers are employed by Datastax?

2014-05-16 Thread Colin
that with the pure proprietary model, and I'm sure it will help you sleep easier. -- Colin Clark +1-320-221-9531 > On May 15, 2014, at 10:52 AM, "Jack Krupansky" > wrote: > > You can always check the project committer wiki: > http://wiki.apache.org/cassandra/Comm

Re: initial token crashes cassandra

2014-05-17 Thread Colin
You may have used the old random partitioner token generator. Use the murmur partitioner token generator instead. -- Colin 320-221-9531 > On May 17, 2014, at 1:15 PM, Tim Dunphy wrote: > > Hey all, > > I've set my initial_token in cassandra 2.0.7 using a python scri

Re: vcdiff/bmdiff , cassandra , and the ordered partitioner…

2014-05-17 Thread Colin
Cassandra offers compression out of the box. Look into the options available upon table creation. The use of orderedpartitioner is an anti-pattern 999/1000 times. It creates hot spots - the use of wide rows can often accomplish the same result through the use of clustering columns. -- Colin

Re: Ordering of schema updates and data modifications

2014-05-18 Thread Colin
make sure a good number of nodes are in agreement before proceeding. Does this make sense? -- Colin Clark +1-320-221-9531 > On May 18, 2014, at 3:30 AM, Jan Algermissen > wrote: > > Hi, > > in our project, we apparently have a problem or misunderstanding of the >

Re: Possible to Add multiple columns in one query ?

2014-05-25 Thread Colin
post a link to my github with an example when I get off the road -- Colin 320-221-9531 > On May 25, 2014, at 1:56 PM, "Jack Krupansky" wrote: > > Typo: I presume “channelid” should be “tagid” for the partition key for your > table. > > Yes, BATCH statements are

Re: Possible to Add multiple columns in one query ?

2014-05-25 Thread Colin
Also, make sure you're using prepared statements. -- Colin 320-221-9531 > On May 25, 2014, at 1:56 PM, "Jack Krupansky" wrote: > > Typo: I presume “channelid” should be “tagid” for the partition key for your > table. > > Yes, BATCH statements are the way

Re: Avoiding High Cell Tombstone Count

2014-05-27 Thread Colin
Charlie, I would be willing to help you out with your issues tomorrow afternoon, feel free to give me a call after 4m ET. There are lots of people who store *and* update data with cassandra (at scale). -- Colin Clark | Solutions Architect DataStax | www.datastax.com m | +1-320-221-9531 e

Re: Cassandra CSV & JSON uploader

2014-05-27 Thread Colin
Why wouldnt you use datastax enterprise in-memory option vs oracle coherence? > On May 27, 2014, at 10:33 PM, Samir Faci wrote: > > http://www.datastax.com/docs/1.0/references/sstable2json might be what > you're looking for. It's in the bin folder of your cassandra installation. > > Thou

Re: Batched statements two different cassandra clusters.

2014-06-01 Thread Colin
Set it up as one cluster with multiple datacenters and configure replication accordingly. -- Colin Clark +1-320-221-9531 > On Jun 1, 2014, at 2:43 PM, Kevin Burton wrote: > > As far as I understand, this is impossible. > > There isn't a way to figure out which sta

Re: Tune cache MB settings per table.

2014-06-01 Thread Colin
The OS should handle this really well as long as your on v3 linux kernel -- Colin Clark +1-320-221-9531 > On Jun 1, 2014, at 2:49 PM, Kevin Burton wrote: > > It's possible to set caching to: > > all, keys_only, rows_only, or none > > .. for a given table.

Re: Tune cache MB settings per table.

2014-06-01 Thread Colin
Have you been unable to achieve your SLA's using Cassandra out of the box so far? Based upon my experience, trying to tune Cassandra before the app is done and without simulating real world load patterns, you might actually be doing yourself a disservice. -- Colin 320-221-9531 >

Re: Tune cache MB settings per table.

2014-06-01 Thread Colin
Your data model will most likely be the far most important component of your migration. Get that right, and the rest is easy. -- Colin Clark +1-320-221-9531 > On Jun 1, 2014, at 7:01 PM, Kevin Burton wrote: > > Good question. still migrating.. but we don't want to paint ou

Re: Data model for streaming a large table in real time.

2014-06-07 Thread Colin
I believe Byteorderedpartitioner is being deprecated and for good reason. I would look at what you could achieve by using wide rows and murmur3partitioner. -- Colin 320-221-9531 > On Jun 6, 2014, at 5:27 PM, Kevin Burton wrote: > > We have the requirement to have clients read

Re: Data model for streaming a large table in real time.

2014-06-07 Thread Colin
might prove useful. -- Colin 320-221-9531 > On Jun 7, 2014, at 1:53 PM, Kevin Burton wrote: > > Another way around this is to have a separate table storing the number of > buckets. > > This way if you have too few buckets, you can just increase them in the > future. >

Re: Data model for streaming a large table in real time.

2014-06-07 Thread Colin
The add seconds to the bucket. Also, the data will get cached-it's not going to hit disk on every read. Look at the key cache settings on the table. Also, in 2.1 you have even more control over caching. -- Colin 320-221-9531 > On Jun 7, 2014, at 4:30 PM, Kevin Burton wrote: >

Re: Data model for streaming a large table in real time.

2014-06-07 Thread Colin
To have any redundancy in the system, start with at least 3 nodes and a replication factor of 3. Try to have at least 8 cores, 32 gig ram, and separate disks for log and data. Will you be replicating data across data centers? -- Colin 320-221-9531 > On Jun 7, 2014, at 9:40 PM, Kevin Bur

Re: Object mapper for CQL

2014-06-08 Thread Colin
ion so far, I don't think you need to get too funky with your data access layer. Whatever you do, make sure the driver you use supports CQL 3 and the native protocol. Thrift, like BOP, will most likely go away at some point in the future. -- Colin 320-221-9531 > On Jun 8, 2014, at 8:58

Re: Object mapper for CQL

2014-06-08 Thread Colin
has asked-his project is ambitious for a first dip into Cassandra, I want to see him succeed, and have given him the same advice I give our customers. -- Colin 320-221-9531 > On Jun 8, 2014, at 9:43 PM, Jeff Genender wrote: > > Comments in line... > >> On Jun 8, 2014, at 8

Re: Object mapper for CQL

2014-06-08 Thread Colin
ted in the same thing-helping people be successful using Casaandra. -- Colin 320-221-9531 > On Jun 8, 2014, at 10:24 PM, Johan Edstrom wrote: > > On a second reply I'll provide some docs. > > We looked at Astynax (Yeah I didn't like the refactor) > We looked at sprin

Re: Fetching ONE cell with a row cache hit takes 1 second on an idle box?

2014-07-01 Thread Colin
Rowcache is typically turned off because it is only useful in very specific situations-the row(s) need to fit in memory. Also, the access patterns have to fit. If all the rows you're accessing can fit, Rowcache is a great thing. Otherwise, not so great. -- Colin 320-221-9531 >

Re: need some help with counters

2011-06-09 Thread Colin
Hey guy, have you tried amazon turk? -- Colin Clark +1 315 886 3422 cell +1 701 212 4314 office http://cloudeventprocessing.com http://blog.cloudeventprocessing.com @EventCloudPro *Sent from Star Trek like flat panel device, which although larger than my Star Trek like communicator device, may

bring out your rpms...

2011-06-14 Thread Colin
Does anyone know where an rpm for 0.7.6-2 might be? (rhel) I checked the datastax site and only see up to 0.7.6-1

RE: bring out your rpms...

2011-06-14 Thread Colin
packaging. This keeps coming up though, so I'll probably just go ahead and roll a 0.7.6-2 for rpm.datastax.com so as not to confuse folks. On Tue, Jun 14, 2011 at 4:19 PM, Colin wrote: > Does anyone know where an rpm for 0.7.6-2 might be? (rhel) > > > > I checked the datastax sit

Re: Not all data structures need timestamps (and don't require wasted memory).

2011-09-04 Thread Colin
Kevin, You will find that many of us using cassanda are already doing what you suggest (custom serializer/deserializer). We call it JSON. -- Colin *Sent from Star Trek like flat panel device, which although larger than my Star Trek like communicator device, may have typo's and ex

Re: disappointed

2013-07-25 Thread Colin
Mysql? -- Colin +1 320 221 9531 On Jul 25, 2013, at 6:08 AM, Derek Andree wrote: > Yeah, Rob is smart. don't run crap in production. Run what others are > stable at. If you are running the latest greatest dumbest craziest in prod > then you ask for fail, and you will

Re: Cassandra book/tuturial

2013-10-27 Thread Colin
I wouldnt buy that book, it's old and not too useful. Find some tutorials and dive in. > On Oct 27, 2013, at 8:54 PM, Mohan L wrote: > > > > >> On Sun, Oct 27, 2013 at 9:57 PM, Erwin Karbasi wrote: >> Hey Guys, >> >> What is the best book to learn Cassandra from scratch? >> >> Thanks

Unsubscribe

2010-12-12 Thread Colin
Unsubscribe Please Sent from my iPad On Dec 12, 2010, at 1:26 AM, Dave Martin wrote: > Hi there, > > I see the following: > > 1) Add 8,000,000 columns to a single row. Each column name is a UUID. > 2) Use cassandra-cli to run count keyspace.cf['myGUID'] > > The following is reported in the

Re: Ditching Cassandra

2011-03-29 Thread Colin
Eric, Seems like the answer to everything is 8. 8 has been very painful. Are you saying that 8 will or not be compatible with 7? If not, would you recommend waiting until 8? We have done an awful lot of work, have an awful lot of work left, and have become very frustrated. Any idea on when

RE: Ditching Cassandra

2011-03-29 Thread Colin
rom: Edward Capriolo [mailto:edlinuxg...@gmail.com] Sent: Tuesday, March 29, 2011 9:58 PM To: user@cassandra.apache.org Subject: Re: Ditching Cassandra On Tue, Mar 29, 2011 at 9:56 PM, Colin wrote: > Eric, > > Seems like the answer to everything is 8. > > 8 has been very painful. >

RE: Ditching Cassandra

2011-03-29 Thread Colin
Thank you Eric. I appreciate it. -Original Message- From: Eric Evans [mailto:eev...@rackspace.com] Sent: Tuesday, March 29, 2011 11:47 PM To: user@cassandra.apache.org Subject: Re: Ditching Cassandra On Tue, 2011-03-29 at 20:56 -0500, Colin wrote: > Are you saying that 8 will or not

unsubscribe

2010-09-22 Thread Colin
unsubscribe -Original Message- From: Aaron Morton [mailto:aa...@thelastpickle.com] Sent: Wednesday, September 22, 2010 7:47 PM To: user@cassandra.apache.org Subject: Re: column expiration and rows in 0.7 The data will only be physically deleted when a major compaction runs and the GCGra

Re: Ring show high load average when restarting a node.

2016-12-06 Thread Colin Kuo
Hi, The feature of speculative execution in Cassandra 2.0 helps in this case. You can get further explanation on below link. http://www.datastax.com/dev/blog/rapid-read-protection-in-cassandra-2-0-2 Thanks! On Tue, Dec 6, 2016 at 10:13 AM, Sungju Hong wrote: > Hello, > > when I restart a node

Re: Creating a keyspace fails

2013-01-22 Thread Colin Blower
he version you are using. It might also be nice for DataStax to update the 1.1 documentation with a warning. -- *Colin Blower* On 01/22/2013 04:06 AM, Paul van Hoven wrote: Okay, that worked. Why is the statement from the tutorial wrong. I mean, why would a company like datastax post somthing

Re: decommissioning a cassandra node

2014-10-27 Thread Colin Kuo
Hi Tim, The node with IP 94 is leaving. Maybe something wrong happens during streaming data. You could use "nodetool netstats" on both nodes to monitor if there is any streaming connection stuck. Indeed, you could force remove the leaving node by shutting down it directly. Then, perform "nodetool

Re: sstables keep growing on cassandra 2.1

2014-11-19 Thread Colin Kuo
Hi, Can you please firstly check the "nodetool compactionstats" during repair? I'm afraid that minor compaction may be blocked by whatever tasks that causes the number of SStable keep growing. On Sat, Nov 15, 2014 at 7:47 AM, James Derieg wrote: > Hi everyone, > I'm hoping someone can help me w

Cassandra on Ceph

2015-01-31 Thread Colin Taylor
I may be forced to run Cassandra on top of Ceph. Does anyone have experience / tips with this. Or alternatively, strong reasons why this won't work. cheers Colin

Re: Cassandra on Ceph

2015-02-01 Thread Colin Taylor
nt (excepting for C*) and that's their standard. cheers Colin On 2 February 2015 at 14:42, Daniel Compton wrote: > As Jan has already mentioned, Ceph and Cassandra do almost all of the same > things. "Replicated self healing data storage on commodity hardware without > a SPOF&

Re: Cassandra on Ceph

2015-02-01 Thread Colin Taylor
Oops - Nonetheless in on my environments -> Nonetheless in *one of* my environments On 2 February 2015 at 16:12, Colin Taylor wrote: > Thanks all for you input. > > I'm aware of the overlap, I'm aware I need to turn Ceph replication off, > I'm aware this isn

Re: Mutable primary key in a table

2015-02-08 Thread Colin Clark
usually considered a bad idea and is simply not even permitted by most RDBMS. — Colin Clark co...@clark.ws +1 320-221-9531 skype colin.p.clark > On Feb 8, 2015, at 4:16 PM, Eric Stevens wrote: > > It sounds like changing user names is the kind of thing which doesn't happen > often,

Re: Replication to second data center with different number of nodes

2015-03-28 Thread Colin Clark
I typically use a # a lot lower than 256, usually less than 20 for num_tokens as a larger number has historically had a dramatic impact on query performance. — Colin Clark co...@clark.ws +1 612-859-6129 skype colin.p.clark > On Mar 28, 2015, at 3:46 PM, Eric Stevens wrote: > >

Re: Cassandra 1.2.9 will not start

2015-04-18 Thread Colin Clark
unsubscribe > On Apr 18, 2015, at 4:26 PM, Bill Miller wrote: > > I tried restarting two nodes that were working and now I get this. > > > > INFO 15:13:50,296 Initializing system.range_xfers > INFO 15:13:50,300 Initializing system.schema_keyspaces > INFO 15:13:50,301 Opening > /cassandra/

Re: Adding nodes to existing cluster

2015-04-20 Thread Colin Clark
unsubscribe > On Apr 20, 2015, at 8:08 AM, Carlos Rolo wrote: > > Independent of the snitch, data needs to travel to the new nodes (plus all > the keyspace information that goes via gossip). So I won't bootstrap them all > at once, even if it is only for network traffic generated. > > Don't

Setting up Cassandra to store on a specific node and not replicate

2013-12-18 Thread Colin MacDonald
nodes in a cluster? I’m not seeing an obvious configuration option for that, presumably because it obviates much of the point of using Cassandra, but again, we’re working within some rather unfortunate constraints. Any hints or suggestions would be most gratefully received. Kind regards, -Colin

RE: Setting up Cassandra to store on a specific node and not replicate

2013-12-18 Thread Colin MacDonald
ally is a question of: *can* I cripple Cassandra in this way, and if so how? Thanks for the response. -Colin MacDonald-

RE: Setting up Cassandra to store on a specific node and not replicate

2013-12-18 Thread Colin MacDonald
datacentre" a logical group *within* an overall cluster? So I can create a separate "datacentre" for each node, and if I write to that node the data will be forced to stay in that datacentre, i.e. that node? I do apologise for the noobish questions, my attention is currently split between investigating several possible solutions. I rather favour Cassandra though, if I can hobble it appropriately. Kind regards, -Colin MacDonald-

RE: Setting up Cassandra to store on a specific node and not replicate

2013-12-18 Thread Colin MacDonald
the > "other nodes" group, C* will gladly honor it and store no data on node of the > "other nodes" group). > > -- > Sylvain Thank you so much, that's clear and helpful. I appreciate you taking the time to explain it. -Colin-

Re: Help me to find Compatable JDBC jar for Apache Cassandra 2.0.4

2014-01-22 Thread Colin Clark
Is the jar on the path? Is cassandra home set correctly? Looks like cassandra cant find the jar-verify existence by searching. -- Colin +1 320 221 9531 On Jan 22, 2014, at 11:50 AM, Chiranjeevi Ravilla wrote: Hi All, I am using Apache Cassandra 2.0.4 version with cassandra-jdbc-1.2.5

Re: Datamodel for a highscore list

2014-01-22 Thread Colin Clark
How many users and how many games? -- Colin +1 320 221 9531 On Jan 22, 2014, at 10:59 AM, Kasper Middelboe Petersen < kas...@sybogames.com> wrote: I can think of two cases where something bad would happen in this case: 1. Something bad happens after the increment but before some or all

Re: Datamodel for a highscore list

2014-01-23 Thread Colin Clark
r's friend list on one row. The row would look like this: ROWID USERID Colin +1 320 221 9531 On Thu, Jan 23, 2014 at 2:34 AM, Kasper Middelboe Petersen < kas...@sybogames.com> wrote: > What would the consequence be of having this updated highscore table > (using friendId as part

Re: Datamodel for a highscore list

2014-01-23 Thread Colin Clark
y updating all rows with that userid with the new score That way, all friends are on the same row, which makes query easy. And you can still issue query to find the top score across the entire userbase by querying userid, and userscore. Is this a better explanation of my previous and lame explanat

Re: How do you remote backup your cassandra nodes ?

2014-02-21 Thread Colin Blower
.com>> wrote: > > What is your strategy/tools set to backup your Cassandra nodes, > apart from from cluster replication/ snapshots within cluster? > > -- *Colin Blower* /Software Engineer/ Barracuda Networks Inc. =

Re: CQL decimal encoding

2014-02-25 Thread Colin Blower
ecimal type. > > Or in the absence of a spec, just a heads up from other language > driver implementors as to what approach they've taken. > > Does this make sense? > > Cheers, > > Ben -- *Colin Blower* /Software Engineer/ Barracuda Networks Inc. +1 408-

Re: Row_key from sstable2json to actual value of the key

2014-04-03 Thread Colin Blower
tomcat.t5 where c1='concerted value'; > > Thanks in advance for the help. > -- *Colin Blower* /Software Engineer/ Barracuda Networks Inc. +1 408-342-5576 (o) === Find out how eSigning generates significant financial benef

[ANN] ccm-clj - test Cassandra clusters via Clojure

2014-04-08 Thread Colin Taylor
;)) (do (ccm/switch! "testcluster") (ccm/start! "testcluster"))) (ccm/remove! "testcluster") cheers Colin Taylor

Re: initial token crashes cassandra

2014-05-17 Thread Colin Clark
You probably generated the wrong token type. Look for a murmur token generator on the Datastax site. -- Colin 320-221-9531 On May 17, 2014, at 7:00 PM, Tim Dunphy wrote: Hi and thanks for your response. The puzzling thing is that yes I am using the murmur partition, yet I am still getting

Re: initial token crashes cassandra

2014-05-17 Thread Colin Clark
le to see what you've changed. -- Colin 320-221-9531 On May 17, 2014, at 10:29 PM, Tim Dunphy wrote: You probably generated the wrong token type. Look for a murmur token > generator on the Datastax site. What Colin is saying is that the tool you used to create the token, is not >

Re: decommissioning a node

2014-05-25 Thread Colin Clark
Try this: nodetool decomission UN means UP, NORMAL -- Colin +1 320 221 9531 On Sun, May 25, 2014 at 9:09 AM, Tim Dunphy wrote: > Also for information that may help diagnose this issue I am running > cassandra 2.0.7 > > I am also using these java options: > > [root@beta

Re: Nectar client - New Cassandra Client for .Net

2014-06-02 Thread Colin Clark
Is your version of Hector using native protocol or thrift? -- Colin +1 320 221 9531 On Mon, Jun 2, 2014 at 6:41 AM, Peter Lin wrote: > > I'm happy to announce Concord has decided to open source our port of > Hector to .Net. > > The project is hosted on google code > ht

Re: Nectar client - New Cassandra Client for .Net

2014-06-02 Thread Colin Clark
Unless a cassandra driver is using the native protocol, it's going to have a very short life going forward. -- Colin +1 320 221 9531 On Mon, Jun 2, 2014 at 7:10 AM, Peter Lin wrote: > > it is using thrift. I've updated the project page to state that info. > > > On Mo

Re: Nectar client - New Cassandra Client for .Net

2014-06-02 Thread Colin Clark
Peter, There's very little reason today to write your own Cassandra driver for .net, java, or python. Those firms that do are now starting to wrap those drivers with any specific functionality they might require, like Netflix, for example. Have you looked at DataStax's .NET driver? -

Re: Data model for streaming a large table in real time.

2014-06-07 Thread Colin Clark
ients (huge) connect directly the cluster to do this-put some type of app server in between to handle the comm's and fan out. You'll get better utilization of resources and less overhead in addition to flexibility of which data center you're utilizing to serve requests. -- Colin 3

Re: Data model for streaming a large table in real time.

2014-06-07 Thread Colin Clark
rity of my days helping people utilize Cassandra correctly, and rescuing those that haven't. :) -- Colin 320-221-9531 On Jun 7, 2014, at 6:53 PM, Kevin Burton wrote: well you could add milliseconds, at best you're still bottlenecking most of your writes one one box.. maybe 2-3 if there

Re: Data model for streaming a large table in real time.

2014-06-07 Thread Colin Clark
Not if you add another column to the partition key; source for example. I would really try to stay away from the ordered partitioner if at all possible. What ingestion rates are you expecting, in size and speed. -- Colin 320-221-9531 On Jun 7, 2014, at 9:05 PM, Kevin Burton wrote: Thanks

Re: Data model for streaming a large table in real time.

2014-06-07 Thread Colin Clark
With 100 nodes, that ingestion rate is actually quite low and I don't think you'd need another column in the partition key. You seem to be set in your current direction. Let us know how it works out. -- Colin 320-221-9531 On Jun 7, 2014, at 9:18 PM, Kevin Burton wrote: What's

Re: Data model for streaming a large table in real time.

2014-06-07 Thread Colin Clark
r of 3 ensures reads and protection against losing a node. In event of losing a node, you can downgrade the CL automatically and then also accept a little eventual consistency. -- Colin 320-221-9531 On Jun 7, 2014, at 10:03 PM, James Campbell wrote: This is a basic question, but having heard that

Re: Data model for streaming a large table in real time.

2014-06-07 Thread Colin Clark
You won't need containers - running one instance of Cassandra in that configuration will hum along quite nicely and will make use of the cores and memory. I'd forget the raid anyway and just mount the disks separately (jbod) -- Colin 320-221-9531 On Jun 7, 2014, at 10:02 PM, Ke

Re: Advice on how to handle corruption in system/hints

2014-06-09 Thread Colin Kuo
Hi Francois, We're facing the same issue like yours. The approach we did is to 1. scrub that corrupted data file 2. repair that column family Immediately delete that corrupted files is not suggested if C* instance is running. This might be happening if bad disk or power outage. Thanks,

Re: high pending compactions

2014-06-09 Thread Colin Kuo
As Jake suggested, you could firstly increase "compaction_throughput_mb_per_sec" and "concurrent_compactions" to suitable values if system resource is allowed. From my understanding, major compaction will internally acquire lock before running compaction. In your case, there might be a major compac

Re: How to restart bootstrap after a failed streaming due to Broken Pipe (1.2.16)

2014-06-09 Thread Colin Kuo
You can use "nodetool repair" instead. Repair is able to re-transmit the data which belongs to new node. On Tue, Jun 10, 2014 at 10:40 AM, Mike Heffner wrote: > Hi, > > During an attempt to bootstrap a new node into a 1.2.16 ring the new node > saw one of the streaming nodes periodically disap

Re: 2x disk space required for full compaction? Don't vnodes help this problem?

2014-07-24 Thread Colin Clark
Triggering a major compaction is usually not a good idea. If you've got ssd's, go leveled as DuyHai says. The results will be tasty. -- Colin 320-221-9531 On Jul 24, 2014, at 5:28 PM, Kevin Burton wrote: This was after a bootstrap… so I triggered a major compaction. Should I ju

Re: unable to load data using sstableloader

2014-07-28 Thread Colin Kuo
Have you created the schema for these data files? I meant the schema should be created before you load these data file to C*. Here is the article for introduction of sstableloader that you could refer. http://www.datastax.com/documentation/cassandra/1.2/cassandra/tools/toolsBulkloader_t.html On

Re: Loading Keyspace from YAML in 0.8

2011-06-03 Thread Colin Taylor
Its ugly for integration testing ... Socket socket = new Socket("127.0.0.1", 9170); final OutputStream cli = socket.getOutputStream(); log.info("loading schema via cli"); new PrintWriter(cli).print(new String(readFully(new FileInputStream("cassandra-schema"; cli

  1   2   >