Re: Spark SQL JDBC Server + DSE

2015-06-03 Thread Brian O'Neill
Kudos Ben. We¹ve been tracking Zeppelin, and considered doing the same thing. You beat us to it. Well done. -brian --- Brian O'Neill Chief Technology Officer Health Market Science, a LexisNexis Company 215.588.6024 Mobile € @boneill42 This information tran

Re: How to store denormalized data

2015-06-03 Thread Jack Krupansky
Your requirement is still not quite clear - are you counting users or reports, or reports of a type for each user, or...? You can have a separate table, with the partition key being the user type, and using the user id as a clustering column - provided that the number of users is only thousands or

RE: How to store denormalized data

2015-06-03 Thread Matthew Johnson
Thanks Shahab, That was my initial thought. The downside I can think of for that approach is if/when we decide to use this data to serve suggestions in real time back to the users (in a sort of “if you clicked on this you might also like to click on this”) and the algorithms for that would need

Re: How to store denormalized data

2015-06-03 Thread Shahab Yunus
Suggestion or rather food for thought Do you expect to read/analyze the written data right away? Or will it be a batch process, kicked off later in time? What I am trying to say is that if the 'read/analysis' part is a) batch process and b) kicked off later in time, then #3 is a fine solution?

How to store denormalized data

2015-06-03 Thread Matthew Johnson
Hi all, I am trying to store some data (user actions in our application) for future analysis (probably using Spark). I understand best practice is to store it in denormalized form, and this will definitely make some of our future queries much easier. But I have a problem with denormalizing the d

Re: How to interpret some GC logs

2015-06-03 Thread Carlos Rolo
GC Logs are a weird science. I use a couple of resources to get through them. Regarding your question my 1.8.0_40 always have the first the "->". I "greped" through 2h of logs, and on a test environment. I use the following set of options: -XX:+PrintGCDetails" -XX:+PrintGCDateStamps" -XX:+PrintHe

Re: Different number of records from COPY command

2015-06-03 Thread Josef Lindman Hörnlund
I ran into that issue a while ago and it was because I hit the tombstone limit on one of the nodes. Try running `nodetool compact adlog 'adclicklog20150528.csv` and see if that helps. Josef Lindman Hörnlund > On 02 Jun 2015, at 17:48, Saurabh Chandolia wrote: > > Still getting inconsistent n

duplicate create table event

2015-06-03 Thread 高健峰
Hi, I am using Cassandra 2.1.3 and datastax driver 2.1.3. I have four nodes in my datacenter. In Client I Just have 1 Cluster instance object. Now when a table was created, My Client receives 236 CREATE TABLE EVENT in 1 second. So anybody knows why this happend? thanks! -- -- Joseph

Re: How to set datastax-angent connect with jmx an

2015-06-03 Thread 贺伟平
yes,I use the same username/pw in jvisualvm ,it works. Is the username/pw configure in opscenter UI or agent ? when I add cluster in opscenter UI and fill in the jmx username/pw ……. 发件人: Jason Wee 发送时间: ‎2015‎年‎6‎月‎3‎日 ‎星期三 ‎12‎:‎41 收件人: user@cassandra.apache.org the error in the