> If you need to dump a lot of data consider the Hadoop integration.
http://wiki.apache.org/cassandra/HadoopSupport It can run a bit faster
than going through the thrift api.
cassandra hadoop integration reads sstables directly instead of going
via thrift?
You might check out some of the stuff Netflix does with their Cassandra
backup, and Cassandra ETL tools.:
http://techblog.netflix.com/2012/02/aegisthus-bulk-data-pipeline-out-of.html
http://techblog.netflix.com/2012/02/announcing-priam.html
-Jeremiah
On 02/29/2012 11:04 AM, Alexandru Sicoe wro
On Sun, Feb 26, 2012 at 8:24 PM, aaron morton wrote:
> All nodes in the cluster need two way communication. Nodes need to talk to
> Gossip to each other so they know they are alive.
>
> If you need to dump a lot of data consider the Hadoop integration.
> http://wiki.apache.org/cassandra/HadoopSupp
All nodes in the cluster need two way communication. Nodes need to talk to
Gossip to each other so they know they are alive.
If you need to dump a lot of data consider the Hadoop integration.
http://wiki.apache.org/cassandra/HadoopSupport It can run a bit faster than
going through the thrift a
Hello everyone,
I'm battling with this contraint that I have: I need to regularly ship out
timeseries data from a Cassandra cluster that sits within an enclosed
network, outside of the network.
I tried to select all the data within a certian time window, writing to a
file, and then copying the fi