Re: Upserting the same values multiple times

2014-01-21 Thread Robert Wille
No tombstones, just many copies of the same data until compaction occurs. From: Sanjeeth Kumar Reply-To: Date: Tuesday, January 21, 2014 at 8:37 PM To: Subject: Upserting the same values multiple times Hi, I have a table A, one of the fields of which is a text column called body. This

RE: Upserting the same values multiple times

2014-01-21 Thread Viktor Jevdokimov
It's not about tombstones. Tombstones are virtually markers for deleted columns (using delete or ttl) in new sstables after compaction to keep such columns for gcgrace period. Updates do not create tombstones for previous records, latest version upon timestamp will be saved from memtable or whe

Re: Data modeling users table with CQL

2014-01-21 Thread Drew Kutcharian
You’re right. I didn’t catch that. No need to have email in the PRIMARY KEY. On Jan 21, 2014, at 5:11 PM, Jon Ribbens wrote: > On Tue, Jan 21, 2014 at 10:40:39AM -0800, Drew Kutcharian wrote: >> Thanks, I was actually thinking of doing that. Something along the lines >> of >> CREATE TABLE

Upserting the same values multiple times

2014-01-21 Thread Sanjeeth Kumar
Hi, I have a table A, one of the fields of which is a text column called body. This text's length could vary somewhere between 120 characters to say 400 characters. The contents of this column can be the same for millions of rows. To prevent the repetition of the same data, I thought I will ad

Re: Best design for a usecase ??

2014-01-21 Thread Naresh Yadav
just to add : on this table there will be lakhs of select queries to get tagcombinationid fro a partial set of tags... On Tue, Jan 21, 2014 at 2:33 PM, Naresh Yadav wrote: > Hi, > > I need to design a table which will give a UUID to set of tags. > Each tag itself has unique UUID > > *TagCombinat

Re: Moving from relational to Cassandra, how to handle intra-table relationships?

2014-01-21 Thread Les Hartzman
True. Fortunately though in this application, the data is write-once/read-many. So that is one bullet I would dodge! Les On Tue, Jan 21, 2014 at 5:34 PM, Patricia Gorla wrote: > Hey, > > One thing to keep in mind if you want to go the serialized JSON route, is > that you will need to read out t

Re: Moving from relational to Cassandra, how to handle intra-table relationships?

2014-01-21 Thread Patricia Gorla
Hey, One thing to keep in mind if you want to go the serialized JSON route, is that you will need to read out the data each time you want to do an update. Cheers, Patricia On Tuesday, January 21, 2014, Les Hartzman wrote: > Hi, > > I'm looking to move from a relational DB to Cassandra. I just

Moving from relational to Cassandra, how to handle intra-table relationships?

2014-01-21 Thread Les Hartzman
Hi, I'm looking to move from a relational DB to Cassandra. I just found that there are intra-table relationships in one table where the ids of the related rows are saved in a 'parent' row. How can these kinds of relationships be handled in Cassandra? I'm thinking that if the individual rows need

Re: Data modeling users table with CQL

2014-01-21 Thread Jon Ribbens
On Tue, Jan 21, 2014 at 10:40:39AM -0800, Drew Kutcharian wrote: >Thanks, I was actually thinking of doing that. Something along the lines >of >CREATE TABLE user ( > idtimeuuid PRIMARY KEY, > emailtext, > nametext, > ... >); >CREATE TABLE user_ema

Re: upgrade from cassandra 1.2.3 -> 1.2.13 + start using SSL

2014-01-21 Thread Cyril Scetbon
Yes it really seems to be similar. I'll update the Jira with my information. I can easily reproduce it. I saw it lasting for one hour last time and not coming back after that. -- Cyril SCETBON On 21 Jan 2014, at 21:57, Robert Coli wrote: > On Mon, Jan 20, 2014 at 3:22 AM, Cyril Scetbon wrote

Possible optimization: avoid creating tombstones for TTLed columns if updates to TTLs are disallowed

2014-01-21 Thread Donald Smith
I'm aware of https://issues.apache.org/jira/browse/CASSANDRA-4917, which optimizes tombstone creation for TTLed columns: "We only need to ensure that ExpiringColumn and tombstone together live as long as gc_grace. If the ExpiringColumn's TTL>=gc_grace_seconds then we can create an already gcable

Re: bad interaction between CompositeTypes and Secondary index

2014-01-21 Thread Brian Tarbox
The table was created this way, we also avoid altering exiting tables. On Tue, Jan 21, 2014 at 4:19 PM, Jacob Rhoden wrote: > Was the original table created, or created then altered? It makes a > difference as I have seen this type of thing occur on tables I first > created then updated. Not su

Re: bad interaction between CompositeTypes and Secondary index

2014-01-21 Thread Jacob Rhoden
Was the original table created, or created then altered? It makes a difference as I have seen this type of thing occur on tables I first created then updated. Not sure if that issue was fixed in 2.0.4, I'm avoiding altering tables completely for now. __ Sent from iPh

Re: data export with different replication factor.

2014-01-21 Thread Robert Coli
On Sat, Jan 18, 2014 at 11:29 AM, chandra Varahala < hadoopandcassan...@gmail.com> wrote: > I have 6 node cluster production cluster with replication factor of 3 with > 4 keyspaces, and 1 Test cluster with 2 nodes , is there a way I can export > data from production cluster and copy into test c

Re: upgrade from cassandra 1.2.3 -> 1.2.13 + start using SSL

2014-01-21 Thread Robert Coli
On Mon, Jan 20, 2014 at 3:22 AM, Cyril Scetbon wrote: > The only thing I'm worrying about is that I met a situation where I had a > lot of flushes on some nodes. You can find one of my system logs at > http://pastebin.com/YZKUQLXz. I'm not sure as I didn't let it run for > more than 4 minutes, but

bad interaction between CompositeTypes and Secondary index

2014-01-21 Thread Brian Tarbox
We're trying to use CompositeTypes and Secondary indexes and are getting an assertion failure in ExtendedFilter.java line 258 (running C* 2.0.3) when we call getIndexedColumns. The assertion is for not finding any columns. The strange bit is that if we re-create the column family in question and

RE: How to add a new DC to cluster in Cassandra 2.x

2014-01-21 Thread Lu, Boying
Thanks a lot. That’s what I want. From: Tupshin Harper [mailto:tups...@tupshin.com] Sent: 2014年1月21日 23:16 To: user@cassandra.apache.org Subject: Re: How to add a new DC to cluster in Cassandra 2.x This should be the doc you are looking for. http://www.datastax.com/documentation/cassandra/2.0/

RE: Question about node tool repair

2014-01-21 Thread Logendran, Dharsan (Dharsan)
Thanks Rob, Dharsan From: Robert Coli [mailto:rc...@eventbrite.com] Sent: January-21-14 2:26 PM To: user@cassandra.apache.org Subject: Re: Question about node tool repair On Mon, Jan 20, 2014 at 2:47 PM, Logendran, Dharsan (Dharsan) mailto:dharsan.logend...@alcatel-lucent.com>> wrote: We have

Re: Data modeling users table with CQL

2014-01-21 Thread Tupshin Harper
It's a broad topic, but I mean all of the best practices alluded to by writeups like this. http://www.technicalinfo.net/papers/WebBasedSessionManagement.html -Tupshin On Jan 21, 2014 11:37 AM, "Drew Kutcharian" wrote: > Cool. BTW, what do you mean by have additional session tracking ids? > What

Re: Question about node tool repair

2014-01-21 Thread Robert Coli
On Mon, Jan 20, 2014 at 2:47 PM, Logendran, Dharsan (Dharsan) < dharsan.logend...@alcatel-lucent.com> wrote: > We have a two node cluster with the replication factor of 2. The db > has more than 2500 column families(tables). The nodetool -pr repair on an > empty database(one or table has a l

Re: Upgrading 1.0.9 to 2.0

2014-01-21 Thread Robert Coli
On Mon, Jan 20, 2014 at 1:47 AM, Or Sher wrote: > Can I use sstableloader to load SSTables from a RandomPartitioner cluster > to a Murmuer3Partitioner cluster? > My expectation would be yes, if you try it and it works, let us know! =Rob

Re: Data modeling users table with CQL

2014-01-21 Thread Drew Kutcharian
Cool. BTW, what do you mean by have additional session tracking ids? What’d that be for? - Drew On Jan 21, 2014, at 10:48 AM, Tupshin Harper wrote: > It does sound right. > > You might want to have additional session tracking id's, separate from the > user id, but that is an additional imp

Re: Data modeling users table with CQL

2014-01-21 Thread Tupshin Harper
It does sound right. You might want to have additional session tracking id's, separate from the user id, but that is an additional implementation detail, and could be external to Cassandra. But the approach you describe accurately describes what I would do as a first pass, at least. -Tupshin On

Re: Data modeling users table with CQL

2014-01-21 Thread Drew Kutcharian
Thanks, I was actually thinking of doing that. Something along the lines of CREATE TABLE user ( idtimeuuid PRIMARY KEY, emailtext, nametext, ... ); CREATE TABLE user_email_index ( email text, id timeuuid, PRIMARY KEY (email, id) ); And during registration, I would ju

Re: Data modeling users table with CQL

2014-01-21 Thread Tupshin Harper
One CQL row per user, keyed off of the UUID. Another table keyed off of email, with another column containing the UUID for lookups in the first table. Only registration will require a lightweight transaction, and only for the purpose of avoiding duplicate email registration race conditions. -Tup

Re: Data modeling users table with CQL

2014-01-21 Thread Drew Kutcharian
A shameful bump ;) > On Jan 20, 2014, at 2:14 PM, Drew Kutcharian wrote: > > Hey Guys, > > I’m new to CQL (but have been using C* for a while now). What would be the > best way to model a users table using CQL/Cassandra 2.0 Lightweight > Transactions where we would like to have: > - A unique

Re: How to add a new DC to cluster in Cassandra 2.x

2014-01-21 Thread Tupshin Harper
This should be the doc you are looking for. http://www.datastax.com/documentation/cassandra/2.0/webhelp/index.html#cassandra/operations/ops_add_dc_to_cluster_t.html -Tupshin On Jan 21, 2014 2:14 AM, "Lu, Boying" wrote: > Hi, All, > > > > I’m new to Cassandra. I want to know how to add a DC to

Cassandra Complete Initialisation

2014-01-21 Thread Nigel LEACH
I have a crash and burn cluster, used for all sorts of integration testing (DataStax 2.0.2, five nodes, 8GB heap, two seeds, vnodes, RF 2). I wanted to completely initialise/refresh my environment, so did something like this (can't be sure something else did not slip in too), *Removed all user

How to add a new DC to cluster in Cassandra 2.x

2014-01-21 Thread Lu, Boying
Hi, All, I'm new to Cassandra. I want to know how to add a DC to existing Cassandra cluster (all running Cassandra 2.x). I found a related document at http://www.datastax.com/docs/1.1/cluster_management Is it still valid for Cassandra 2.x? Thanks Boying

Best design for a usecase ??

2014-01-21 Thread Naresh Yadav
Hi, I need to design a table which will give a UUID to set of tags. Each tag itself has unique UUID *TagCombination* table TC1 -> India, Pen TC2 -> Shampoo, U.K TC3 -> Team1, Product1, Location1 TC4 -> Office1, India, Pen I can have *billion *of such unique combinations and there can be

Long GC due to promotion failures

2014-01-21 Thread John Watson
Pretty reliable, at some point, nodes will have super long GCs. Followed by https://issues.apache.org/jira/browse/CASSANDRA-6592 Lovely log messages: 9030.798: [ParNew (0: promotion failure size = 4194306) (2: promotion failure size = 4194306) (4: promotion failure size = 4194306) (promotion