Re: CSV Import is taking huge time

2014-07-23 Thread Akshay Ballarpure
Thanks Jack for quick reply. i didn't understood your question completely. i am very new to Cassandra. I just installed single node cluster [root@CSL-simulation bin]# ./nodetool -host 10.59.18.206 -p 7199 status Note: Ownership information does not include topology; for complete information, spe

Cassandra on AWS suggestions for data safety

2014-07-23 Thread Hao Cheng
Hello, Based on what I've read in the archives here and on the documentation on Datastax and the Cassandra Community, EBS volumes, even provisioned IOPS with EBS optimized instances, are not recommended due to inconsistent performance. This I can deal with, but I was hoping for some recommendation

What is C*?

2014-07-23 Thread jcllings
Keep seeing refs to C*. I assume that C* == Cassandra? IMHO not a good ref to use what with C, C++, C#. A language called C* can't be far behind assuming it doesn't already exist. ;-) Jim C. signature.asc Description: OpenPGP digital signature

Re: All writes fail with ONE consistency level when adding second node to cluster?

2014-07-23 Thread Robert Coli
On Wed, Jul 23, 2014 at 1:18 PM, DuyHai Doan wrote: > Why that ? In worst case, CL.ANY will write hints for replicas that are > down. If will be extraordinary unlucky to have all replicas down at the > same time > Hints are not writes for the purposes of consistency or durability, so your write

Re: All writes fail with ONE consistency level when adding second node to cluster?

2014-07-23 Thread graham sanderson
I was being a little tongue in cheek! On Jul 23, 2014, at 3:20 PM, Jack Krupansky wrote: > Granted, for “normal” apps it is unlikely to be appropriate but... > > From an old post by Jonathan: > --- > Extreme write availability > > For applications that want Cassandra to accept writes even wh

Re: CSV Import is taking huge time

2014-07-23 Thread Tyler Hobbs
See https://issues.apache.org/jira/browse/CASSANDRA-7405. Currently cqlsh's COPY FROM just uses a single-threaded for-loop with synchronous inserts. On Wed, Jul 23, 2014 at 8:09 AM, Jack Krupansky wrote: > Is it compute bound or I/O bound? > > What does your cluster look like? > > -- Jack Kr

Re: All writes fail with ONE consistency level when adding second node to cluster?

2014-07-23 Thread Jack Krupansky
Granted, for “normal” apps it is unlikely to be appropriate but... >From an old post by Jonathan: --- Extreme write availability For applications that want Cassandra to accept writes even when all the normal replicas are down (so even ConsistencyLevel.ONE cannot be satisfied), Cassandra provide

Re: All writes fail with ONE consistency level when adding second node to cluster?

2014-07-23 Thread DuyHai Doan
Why that ? In worst case, CL.ANY will write hints for replicas that are down. If will be extraordinary unlucky to have all replicas down at the same time On Wed, Jul 23, 2014 at 9:26 PM, Robert Coli wrote: > On Wed, Jul 23, 2014 at 12:01 PM, graham sanderson > wrote: > >> Hey now; it is GREAT

Re: Why is the cassandra documentation such poor quality?

2014-07-23 Thread Jack Krupansky
Out of curiosity, did you look at or utilize DataStax’s free online training? See: http://www.datastax.com/what-we-offer/products-services/training/virtual-training Any feedback? Any suggestions as to what needs it does or doesn’t fulfill? -- Jack Krupansky From: Nicholas Okunew Sent: Wednesda

Re: All writes fail with ONE consistency level when adding second node to cluster?

2014-07-23 Thread Robert Coli
On Wed, Jul 23, 2014 at 12:01 PM, graham sanderson wrote: > Hey now; it is GREAT for a 100% write only use case ;-) > A well WORN [1] path in databases, for sure. =Rob [1] Write Once Read Never

Re: Why is the cassandra documentation such poor quality?

2014-07-23 Thread Robert Coli
On Wed, Jul 23, 2014 at 11:44 AM, Kevin Burton wrote: > I have a lot of experience in distribute systems, understand the space > well, but I just can't find documentation on how cassandra does things from > a high level perspective. > My belief as to the reason why you are unable to find design

Re: All writes fail with ONE consistency level when adding second node to cluster?

2014-07-23 Thread graham sanderson
Hey now; it is GREAT for a 100% write only use case ;-) On Jul 23, 2014, at 12:15 PM, Robert Coli wrote: > On Tue, Jul 22, 2014 at 7:46 PM, Andrew wrote: > ONE means write to one replica (in addition to the original). If you want to > write to any of them, use ANY. Is that the right understa

Re: All writes fail with ONE consistency level when adding second node to cluster?

2014-07-23 Thread Kevin Burton
Interesting.. it was unclear what it does… ONE sounds right to me so I was curious what was up with ANY. We just set it to ANY so that we could track down what was causing this bug. On Wed, Jul 23, 2014 at 10:15 AM, Robert Coli wrote: > On Tue, Jul 22, 2014 at 7:46 PM, Andrew wrote: > >> ONE

Re: Why is the cassandra documentation such poor quality?

2014-07-23 Thread Kevin Burton
This is a great post… and really does a great job of summarizing the problems I have with the cassandra documentation. I realize that Datastax is trying to step in and fix the problem.. but there are now a lot of parties involved (JIRA tickets, blog posts, datastax doc, apache doc, etc) … it's all

Re: Why is the cassandra documentation such poor quality?

2014-07-23 Thread Kevin Burton
On Wed, Jul 23, 2014 at 4:55 AM, Jason Wee wrote: > I agree to the people here already sharing their ways to access > documentation. If you are starter, you should better spend time to search > for documentation (like using google) or hours to read. Then start ask > specific question. Coming here

Re: cluster rebalancing…

2014-07-23 Thread Robert Coli
On Tue, Jul 22, 2014 at 7:03 PM, Kevin Burton wrote: > So , shouldn't it be easy to rebalance a cluster? > > I'm not super excited to type out 200 commands to move around individual > tokens. > That's why vnodes exist? Before vnodes, the only sane option was to double your cluster size... =Rob

Re: All writes fail with ONE consistency level when adding second node to cluster?

2014-07-23 Thread Robert Coli
On Tue, Jul 22, 2014 at 7:46 PM, Andrew wrote: > ONE means write to one replica (in addition to the original). If you want > to write to any of them, use ANY. Is that the right understanding? > This has come up a few times, so let me be unambiguous about when to use CL.ANY : NEVER EVER USE CL

Re: Cassandra select results differs

2014-07-23 Thread Russell Bradberry
try running at CL QUORUM and see if the problem goes away, if it does then it might be a consistency issue. also, what version of C*, how many nodes, what is your RF and what CL do you normally read? On July 23, 2014 at 12:55:32 PM, Batranut Bogdan (batra...@yahoo.com) wrote: I have cron job

Re: Cassandra select results differs

2014-07-23 Thread Batranut Bogdan
I have cron jobs that repair every week. node 1 - monday , node 2 tuesday . On Wednesday, July 23, 2014 7:52 PM, Russell Bradberry wrote: sounds like you may need to run a repair On July 23, 2014 at 12:50:23 PM, Batranut Bogdan (batra...@yahoo.com) wrote: Hello all, I have a CF  C

Re: Cassandra select results differs

2014-07-23 Thread Russell Bradberry
sounds like you may need to run a repair On July 23, 2014 at 12:50:23 PM, Batranut Bogdan (batra...@yahoo.com) wrote: Hello all, I have a CF  CREATE TABLE cf (   a text,   b int,   c int,   d int,   e int,   PRIMARY KEY (a) )  WITH   bloom_filter_fp_chance=0.01 AND   caching='KEYS_ONLY'

Cassandra select results differs

2014-07-23 Thread Batranut Bogdan
Hello all, I have a CF  CREATE TABLE cf (   a text,   b int,   c int,   d int,   e int,   PRIMARY KEY (a) )  WITH   bloom_filter_fp_chance=0.01 AND   caching='KEYS_ONLY' AND   comment='' AND   dclocal_read_repair_chance=0.00 AND   gc_grace_seconds=864000 AND   index_interval=128 AND   re

Re: Why is the cassandra documentation such poor quality?

2014-07-23 Thread Jeremy Jongsma
My experience is similar to Nicholas'. Basic usage was easy to get a handle on, but the advanced tuning/tweaking info is scattered EVERYWHERE around the web, mostly on personal blogs. It feels like it took way too long to become confident enough in my understanding of Cassandra that I trust our dep

Re: Why is the cassandra documentation such poor quality?

2014-07-23 Thread Jake Luciani
I'll note that historically the wiki used to be open to all and due massive amounts of spam it was put on lockdown by the ASF. If there is a better platform the community feels would make it simpler to provide community based documentation then we should consider it. The ASF also has confluence wi

Re: All writes fail with ONE consistency level when adding second node to cluster?

2014-07-23 Thread Jack Krupansky
Besides the obviously confusing error message, this particular case could simply be that the hash value of the primary key belonged to the other node that wasn’t up, so even though one node was up, it didn’t own that particular hash value or token, so CL=ONE could not succeed. What was RF set t

Re: Why is the cassandra documentation such poor quality?

2014-07-23 Thread Dave Brosius
We had a massive spam problem before we locked down the wiki, so unfortunately that was the choice we had to make. But as stated we can add you to the contributers list. What is your Wiki user name? On 2014-07-23 07:33, Peter Lin wrote: > I've tried to contribute docs to Cassandra wiki in

RE: EXTERNAL: Re: Running Cassandra Server in an OSGi container

2014-07-23 Thread Rodgers, Hugh
Yes, the application includes the C* server and client. From: Robert Stupp [mailto:sn...@snazy.de] Sent: Wednesday, July 23, 2014 12:19 AM To: user@cassandra.apache.org Subject: Re: EXTERNAL: Re: Running Cassandra Server in an OSGi container You mean "unzip and run" of an application using C* ?

Re: Why is the cassandra documentation such poor quality?

2014-07-23 Thread Peter Lin
@benedict - you're right that I've haven't requested permission to edit. You're also right that I've given up on getting edit permission to cassandra wiki. I've been struggling and struggled with "how" to manage open source projects, so I totally get it. Managing projects is a thankless job most of

Re: Why is the cassandra documentation such poor quality?

2014-07-23 Thread Ben Hood
On Wed, Jul 23, 2014 at 1:30 PM, Peter Lin wrote: > > I sent a request to add a link my .Net driver for cassandra to the wiki > over 5 weeks back and no response at all. > TL;DR There is something wrong with Cassandra information sharing, but I am partly to blame. My experience has not been too

Re: CSV Import is taking huge time

2014-07-23 Thread Jack Krupansky
Is it compute bound or I/O bound? What does your cluster look like? -- Jack Krupansky From: Akshay Ballarpure Sent: Wednesday, July 23, 2014 5:00 AM To: user@cassandra.apache.org Subject: CSV Import is taking huge time Hello, I am trying copy command in Cassandra to import CSV file in to DB,

Re: Why is the cassandra documentation such poor quality?

2014-07-23 Thread Benedict Elliott Smith
Requesting a change is very different to requesting permission to edit (which, I note, still hasn't been made); we do our best to promote community engagement, so granting a privilege request has a different mental category to a random edit request, which is much more likely to be forgotten by any

Re: Why is the cassandra documentation such poor quality?

2014-07-23 Thread Peter Lin
it supports CQL, but it's through thrift. I don't currently support native protocol, since that was evolving rapidly last year when I made the port. I state clearly on nectar-client wiki on google code that it supports CQL3 via thrift. I've pretty much given up on cassandra wiki. Using my blog to

Re: Why is the cassandra documentation such poor quality?

2014-07-23 Thread Jack Krupansky
I do recall seeing your announcement of your driver, but I think it got lost in the discussion of whether it supported CQL. If you say it supports CQL and native protocol, I’m sure it will get very prompt attention. -- Jack Krupansky From: Peter Lin Sent: Wednesday, July 23, 2014 8:30 AM To: u

Re: Why is the cassandra documentation such poor quality?

2014-07-23 Thread Peter Lin
I sent a request to add a link my .Net driver for cassandra to the wiki over 5 weeks back and no response at all. I sent another request way back in 2013 and got zero response. Again, I totally understand people are busy and I'm just as guilty as everyone else of letting requests slip by. It's the

Re: Why is the cassandra documentation such poor quality?

2014-07-23 Thread Nicholas Okunew
I think the problem is a little deeper than that. I've been working with cassandra for about 7 months now - it was very challenging to find out any real information about using cassandra, and even harder to get clear information on operating it. There's a truckload of reading you have to do, and no

Re: Why is the cassandra documentation such poor quality?

2014-07-23 Thread Benedict Elliott Smith
All requests I've seen in the past year to edit the wiki (admittedly only 2-3) have been answered promptly with editing privileges. Personally I don't have a major preference either way for policy - there are positives and negatives to each approach - but, like I said, raise it on the dev list and

Re: Why is the cassandra documentation such poor quality?

2014-07-23 Thread Jason Wee
I agree to the people here already sharing their ways to access documentation. If you are starter, you should better spend time to search for documentation (like using google) or hours to read. Then start ask specific question. Coming here kpkb about poor quality of documentation just does not cut

Re: Why is the cassandra documentation such poor quality?

2014-07-23 Thread Peter Lin
I've submitted requests to edit the wiki in the past and nothing ever got done. Having been an apache committer and contributor over the years, I can totally understand that people are busy. I also understand that "most" developer find writing docs tedious. I'd rather not harass the committers ab

Re: Why is the cassandra documentation such poor quality?

2014-07-23 Thread Jack Krupansky
And the simplest and easiest thing to do is simply email this list when you see something wrong or missing in the DataStax Cassandra doc, or for anything that is not adequately anywhere. I work with the doc people there, so I can make sure they see corrections and improvements. And simply sharin

Re: Why is the cassandra documentation such poor quality?

2014-07-23 Thread Benedict Elliott Smith
It only takes a moment to ask to be added as a wiki contributor; if you email the dev list or ask on irc, somebody with privileges will ordinarily add you within a day. It may be a psychological barrier, but it isn't really a practical one. Still, if you feel the policy is incorrect, raise this on

Re: Why is the cassandra documentation such poor quality?

2014-07-23 Thread Peter Lin
I've tried to contribute docs to Cassandra wiki in the past, but there's an obstacle. currently wiki.apache.org/cassandra is locked down, so only commiters can edit it. I really wish that wasn't the case, since it wastes time. the commiters are busy writing code. Having to email a commiter and ask

Re: horizontal query scaling issues follow on

2014-07-23 Thread Benedict Elliott Smith
> > if you find that adding nodes causes performance to degrade I would > suspect that you are querying data in one CQL statement that is spread over > multiple partitions This is exactly what is happening. The better way to query multiple partitions is to simply despatch multiple queries (asynch

Re: Should PREPARE QUERY return metadata for the query result?

2014-07-23 Thread Ben Hood
On Wed, Jul 23, 2014 at 12:07 PM, Ben Hood <0x6e6...@gmail.com> wrote: > Or have I just been looking at the wrong version of the spec all along? So it turns out that this is a case of PEBCAK: v2 of the protocol is formulated thusly: 4.2.5.4. Prepared The result to a PREPARE message. The rest o

Re: Should PREPARE QUERY return metadata for the query result?

2014-07-23 Thread Ben Hood
On Wed, Jul 23, 2014 at 11:14 AM, Ben Hood <0x6e6...@gmail.com> wrote: > But I was wondering if we were doing something wrong by not returning > the result meta data from the PREPARE result (if it does indeed > exist). Looking into this a bit further, it looks like the client driver needs to deser

Should PREPARE QUERY return metadata for the query result?

2014-07-23 Thread Ben Hood
Hi all, I'm looking at the specification of statement preparation (section 4.2.5.4 of the CQL protocol) and I'm wondering whether the metadata result of the PREPARE query only returns column information for the query arguments, and not for the columns of the actual query result. The background is

Re: horizontal query scaling issues follow on

2014-07-23 Thread Diane Griffith
I posted the query wrong, I gave the query for 1 key versus the large batch of ids like I was testing. What it was using for large batch was IN, so Select * from foo where key IN and col_name='LATEST So after breaking it down and reading as much as I can with regard to our - schema, dynami

Re: DataType protocol ID error for TIMESTAMPs when upgrading from 1.2.11 to 2.0.9

2014-07-23 Thread Ben Hood
On Wed, Jul 23, 2014 at 1:53 AM, Robert Coli wrote: > On Tue, Jul 22, 2014 at 1:53 AM, Ben Hood <0x6e6...@gmail.com> wrote: > Indeed, reading up on the issue (and discussing it with folks) there are a > number of mitigating factors, most significantly driver workarounds use of > TimeUUIDs, which m

Re: DataType protocol ID error for TIMESTAMPs when upgrading from 1.2.11 to 2.0.9

2014-07-23 Thread Ben Hood
On Wed, Jul 23, 2014 at 1:53 AM, Robert Coli wrote: > On Tue, Jul 22, 2014 at 1:53 AM, Ben Hood <0x6e6...@gmail.com> wrote: > In this particular case, the answer to "why not" involves the idea that one > needs to be able to test with a driver in order to expose it, and currently > (as I understand

CSV Import is taking huge time

2014-07-23 Thread Akshay Ballarpure
Hello, I am trying copy command in Cassandra to import CSV file in to DB, Import is taking huge time, any suggestion to improve it? id,a,b,c,d,e,f,g,h,i,j,k,l,m,n,o,p,q,r,s,t,u,v,w,x,y,z 100,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26 101,1,2,3,4,5,6,7,8,9,10,11,12,13,14

Re: Case Study from Migrating from RDBMS to Cassandra

2014-07-23 Thread DuyHai Doan
"How they have migrated from RDBMS to Cassandra?" -> there is no one-answer-fits-all. It all depends on the application features "What are the things to consider?" -> design data model with query-first approach. The choice of how/when/what to denormalize is crucial "How they have converted data

Re: Why is the cassandra documentation such poor quality?

2014-07-23 Thread spawgi
I would like to help out with the documentation of C*. How do I start? On Wed, Jul 23, 2014 at 12:46 PM, Robert Stupp wrote: > Just a note: > If you have suggestions how to improve documentation on the datastax > website, write them an email to d...@datastax.com. They appreciate > proposals :)

Re: When will a node's host ID change?

2014-07-23 Thread Mark Reddy
It won't. Cassandra stores a node's host ID in an end-point to host ID mapping, each ID must be unique and cannot be changed after the fact. Mark On Mon, Jul 21, 2014 at 4:44 PM, John Sanda wrote: > Under what circumstances, if any, will a node's host ID change? > > - John >

Re: ONE consistency required 2 writes? huh?

2014-07-23 Thread Olivier Michallat
Hi Kevin, This message was likely generated by the Java driver, not by Cassandra. I'll follow up to your other post on the driver's mailing list. -- Olivier On Wed, Jul 23, 2014 at 4:37 AM, Kevin Burton wrote: > Perhaps it's me but it seems this exception is wrong: > > "Cassandra timeout dur

Re: EXTERNAL: Re: Running Cassandra Server in an OSGi container

2014-07-23 Thread Robert Stupp
You mean "unzip and run" of an application using C* ? Am 23.07.2014 um 00:34 schrieb Rodgers, Hugh : > What got our team on the path of trying to embed C* was the wiki page > http://wiki.apache.org/cassandra/Embedding which implies this can be done. > Also WSO2 Carbon and Achilles have both emb

Re: Why is the cassandra documentation such poor quality?

2014-07-23 Thread Robert Stupp
Just a note: If you have suggestions how to improve documentation on the datastax website, write them an email to d...@datastax.com. They appreciate proposals :) Am 23.07.2014 um 09:10 schrieb Mark Reddy : > Hi Kevin, > > The difference here is that the Apache Cassandra site is maintained by th

Re: Why is the cassandra documentation such poor quality?

2014-07-23 Thread Mark Reddy
Hi Kevin, The difference here is that the Apache Cassandra site is maintained by the community whereas the DataStax site is maintained by paid employees with a vested interest in producing documentation. With DataStax having some comprehensive docs, I guess the desire for people to maintain the A