Re: scylladb

2015-11-12 Thread Jack Krupansky
I just did a Twitter search on scylladb and did not see any tweets about actual use, so far. -- Jack Krupansky On Wed, Nov 11, 2015 at 10:54 AM, Carlos Alonso wrote: > Any update about this? > > @Carlos Rolo, did you tried it? Thoughts? > > Carlos Alonso | Software Engineer | @calonso

Spark on cassandra

2015-11-12 Thread Prem Yadav
Hi, Is it better to use Spark APIs to do join on cassandra tables or should we use SPARK-SQL. We have been struggling with SPARK-SQL as we need to do multiple large table joins and there is always failure. I tried to do joins using the API like this: val join1 = sc.cassandraTable("Keyspace1","tabl

Data.db too large and after sstableloader still large

2015-11-12 Thread qihuang.zheng
We do snapshot, and found some Data.db too large: [qihuang.zheng@spark047219 5]$ find . -type f -size +800M -print0 | xargs -0 ls -lh -rw-r--r--. 2 qihuang.zheng users 1.5G 10月 28 14:49 ./forseti/velocity/forseti-velocity-jb-103631-Data.db And sstableloader to new cluster, one node has this lar

Re: Spark on cassandra

2015-11-12 Thread DuyHai Doan
Hello Prem I believe it's better to ask your question on the ML of the Spark Cassandra connector: http://groups.google.com/a/lists.datastax.com/forum/#!forum/spark-connector-user Second "we need to join multiple table from multiple keyspaces. How can we do that?", the response is given in your ex

回复:Data.db too large and after sstableloader still large

2015-11-12 Thread qihuang.zheng
Original snapshot files: [qihuang.zheng@spark047219 226_1105]$ ll 2/forseti/velocity/ -h | grep Data -rw-r--r--. 1 qihuang.zheng users 158M 10月 28 15:03 forseti-velocity-jb-102486-Data.db -rw-r--r--. 1 qihuang.zheng users 161M 10月 28 16:28 forseti-velocity-jb-103911-Data.db -rw-r--r--. 1 qihuang

Re: Timeout with static column

2015-11-12 Thread Tyler Hobbs
Can you try to isolate this to a reproducible test case or script and open a jira ticket at https://issues.apache.org/jira/browse/CASSANDRA? On Wed, Nov 11, 2015 at 2:54 PM, Brice Figureau < brice+cassan...@daysofwonder.com> wrote: > Hi, > > Following my previous Read query timing out, I'm now ru

Re: Data.db too large and after sstableloader still large

2015-11-12 Thread Robert Coli
On Thu, Nov 12, 2015 at 6:44 AM, qihuang.zheng wrote: > question is : why sstableloader can’t balance data file size? > Because it streams ranges from the source SStable to a distributed set of ranges, especially if you are using vnodes. It is a general property of Cassandra's streaming that i

Re: Data.db too large and after sstableloader still large

2015-11-12 Thread qihuang.zheng
Tks,Rob. We use spark-cassandra-connector to read data from table, then do repartition action. If some nodes with large file bring out running this tasktoo slow, maybe serveral hours which is unacceptable. But those nodes with small file running finished quickly. So I think if sstableloader can s

too many full gc in one node of the cluster

2015-11-12 Thread Shuo Chen
Hi, We have a small cassandra cluster with 4 nodes for production. All the nodes have similar hardware configuration and similar data load. The C* version is 1.0.7 (prretty old) One of the node has much higher cpu usage than others and high full gc frequency, but the io of this node is not high a