Re: Machine Learning With Cassandra

2014-08-30 Thread James Horey
If you want distributed machine learning, you can use either Mahout (runs on Hadoop) or Spark (MLLib). If you choose the Hadoop route, Datastax provides a connector (CFS) to interact with data stored in Cassandra. Otherwise you can try to use the Cassandra InputFormat (not as simple, but plenty

Re: Cassandra use cases/Strengths/Weakness

2014-07-04 Thread James Horey
I’ve supported a variety of different “big data” systems and most have their own particular set of use cases that make sense. Having said that, I believe that Cassandra uniquely excels at the following: * Low write latency with respect to small to medium write sizes (logs, sensor data, etc.) *

Re: autoscaling cassandra cluster

2014-05-21 Thread James Horey
alise I will have to run > nodetool to add and remove the nodes from the cluster and also the node > cleanup. > > Disclaimer: this is not a production system but something Im experimenting > with in my own time. > > > Thanks > > Jabbar Azam > > &g

Re: autoscaling cassandra cluster

2014-05-21 Thread James Horey
If you're interested and/or need some Cassandra docker images let me know I'll shoot you a link. James Sent from my iPhone > On May 21, 2014, at 10:19 AM, Jabbar Azam wrote: > > That sounds interesting. I was thinking of using coreos with docker > containers for the business logic, fronten

Re: How to clear all data using CQL?

2014-04-16 Thread James Horey
If you’re running unit tests and repeatadly clearing the Cassandra keyspaces, you may want to check out Ferry (ferry.opencore.io). It lets you standup/destroy multiple Cassandra stacks locally on your machine and is useful for the use case you described. I’m the author of Ferry, and would be gla

Help collecting Cassandra examples

2014-03-31 Thread James Horey
Hello all, I’m trying to collect and organize Cassandra applications for educational purposes. I’m hoping that by collating these applications in a single place, new users will be able to get up to speed a bit easier. If you know of a great application (should be open-source and preferably up t