Re: Unexplainably large reported partition sizes

2016-03-05 Thread Tom van den Berge
I don't think compression can be the cause of the difference, because of two reasons: 1) The partition size I calculated myself (3 MB) is the uncompressed size, and so is the reported size (2.3 GB) 2) The difference is simply way too big to be explained by compression, even if the calculated size

Re: Unexplainably large reported partition sizes

2016-03-05 Thread DuyHai Doan
Maybe tombstones ? Do you issue a lot of DELETE statements ? Or do you re-insert in the same partition with different TTL values ? On Sat, Mar 5, 2016 at 7:16 PM, Tom van den Berge wrote: > I don't think compression can be the cause of the difference, because of > two reasons: > > 1) The partiti

Re: How to create an additional cluster in Cassandra exclusively for Analytics Purpose

2016-03-05 Thread Bhuvan Rawal
Thanks Sean and Nirmallaya. @Jack, We are going with DSC right now and plan to use spark and later solr over the analytics DC. The use case is to have olap and oltp workloads separated and not intertwine them, whether it is achieved by creating a new DC or a new cluster altogether. From Nirmallay

Re: How to create an additional cluster in Cassandra exclusively for Analytics Purpose

2016-03-05 Thread Jack Krupansky
You haven't been clear about how you intend to add Solr. You can also use Stratio or Stargate for basic Lucene search if you don't want need full Solr support and want to stick to open source rather than go with DSE Search for Solr. -- Jack Krupansky On Sun, Mar 6, 2016 at 12:25 AM, Bhuvan Rawal

Re: How to create an additional cluster in Cassandra exclusively for Analytics Purpose

2016-03-05 Thread Bhuvan Rawal
Yes Jack, we are rolling out with Stratio right now, we will assess the performance benefit it yields and can go for ElasticSearch/Solr later. As per your experience how does Stratio perform vis-a-vis Secondary Indexes? On Sun, Mar 6, 2016 at 11:15 AM, Jack Krupansky wrote: > You haven't been c