Re: Cassandra 2.0.7 keeps reporting errors due to no space left on device

2014-05-03 Thread DuyHai Doan
Hello Yatong "If I restart the node or using 'cleanup', it will resume to normal." --> what does df -hl shows for /data6 when you restart or cleanup the node ? By the way, a single SSTable of 3.6Tb is kind of huge. Do you perform manual repair frequently ? On Sun, May 4, 2014 at 1:51 AM, Yat

Is the updating compaction strategy from 'sized tiered' to 'leveled' automatic or need to be done manually?

2014-05-03 Thread Yatong Zhang
Hi there, I am updating compaction strategy from 'sized tiered' to 'leveled' and from http://www.datastax.com/dev/blog/leveled-compaction-in-apache-cassandra it is said: When updating an existing column family, reads and writes can continue as > usual while leveling of existing sstables is perfor

Re: Cassandra vs Elasticsearch.

2014-05-03 Thread Tim Dunphy
Utkarsh, Thanks! I think I'll try it! Tim On Sat, May 3, 2014 at 5:18 PM, Utkarsh Sengar wrote: > I have also written a prototype ES-Cassandra river: > https://github.com/eBay/cassandra-river > Never tested it in prod, might need improvements. > > Thanks, > -Utkarsh > > > On Sat, May 3, 2014 a

Re: Cassandra 2.0.7 keeps reporting errors due to no space left on device

2014-05-03 Thread Yatong Zhang
My Cassandra cluster has plenty of free space, for now only about 30% of space are used On Sun, May 4, 2014 at 6:36 AM, Yatong Zhang wrote: > Hi there, > > It was strange that the 'xxx-tmp-xxx.db' file kept increasing until > Cassandra throw exceptions with 'No space left on device'. I am using

Cassandra 2.0.7 keeps reporting errors due to no space left on device

2014-05-03 Thread Yatong Zhang
Hi there, It was strange that the 'xxx-tmp-xxx.db' file kept increasing until Cassandra throw exceptions with 'No space left on device'. I am using CQL 3 to create a table to store data about 200K ~ 500K per record. I have 6 harddisks per node and cassandra was configured with 6 data directories(e

Re: Cassandra vs Elasticsearch.

2014-05-03 Thread Utkarsh Sengar
I have also written a prototype ES-Cassandra river: https://github.com/eBay/cassandra-river Never tested it in prod, might need improvements. Thanks, -Utkarsh On Sat, May 3, 2014 at 1:37 PM, Tim Dunphy wrote: > I'd like to try your ElasticSearch / Cassandra driver as well. Could you > post a l

Re: Cassandra vs Elasticsearch.

2014-05-03 Thread Tim Dunphy
I'd like to try your ElasticSearch / Cassandra driver as well. Could you post a link? Is it on GitHub or similar? Thanks Tim Sent from my iPhone > On May 3, 2014, at 4:06 PM, prabhat wrote: > > Great idea. I can do test > > Prabhat Kumar Singh > > > >> On Sun, May 4, 2014 at 12:32 AM, Eli

Re: Cassandra vs Elasticsearch.

2014-05-03 Thread prabhat
Great idea. I can do test Prabhat Kumar Singh On Sun, May 4, 2014 at 12:32 AM, Elias Ross wrote: > I've come up with a driver so that Elasticsearch can store its index > data in Cassandra. I'm not sure how well it performs, as I haven't > really put it through any big data sets. But you then

Re: Cassandra vs Elasticsearch.

2014-05-03 Thread Elias Ross
I've come up with a driver so that Elasticsearch can store its index data in Cassandra. I'm not sure how well it performs, as I haven't really put it through any big data sets. But you then get the advantage of data durability in Cassandra and the search capability of Elasticsearch. It's very expe

Re: Cassandra vs Elasticsearch.

2014-05-03 Thread Jack Krupansky
DataStax Enterprise integrates Cassandra and Apache Solr, with Solr as a secondary index so that the Solr query index can be kept in sync with the Cassandra data automatically and even fully reindexed if your index mapping changes, as a single request. So, C* provides the fully distributed, dura

Re: Cassandra vs Elasticsearch.

2014-05-03 Thread Jon Haddad
Agreed w/ ES not being the durable data store. I would recommend treating it as ephemeral, and using Cassandra as your source of truth. Keep in mind if you change your ES index mapping, you’ll require a full reindex in order to search the data properly. It’s not like adding a secondary index

Re: Cassandra vs Elasticsearch.

2014-05-03 Thread DuyHai Doan
Hello Tim You're absolutely right about ES for the query part. This is the perfect fit for complex queries. Now regarding your question: "What advantages does Cassandra give me over ES?" --> linear scalability & durability. ES is just a super index cluster. I've talked to ES guys. If they do not

Cassandra vs Elasticsearch.

2014-05-03 Thread Tim Uckun
Hey all. I have been trying out some data stores for time series data and Cassandra was the first on my list because so many people are using it for the same purpose. I have read many articles on how to model my time series data and tried several variations of schemas which I thought made sense f