Avoiding Data Duplication

2015-06-05 Thread Abhishek Singh Bailoo
Hello! I have a column family to log in data coming from my GPS devices. CREATE TABLE log( imei ascii, date ascii, dtime timestamp, data ascii, stime timestamp, PRIMARY KEY ((imei, date), dtime)) WITH CLUSTERING ORDER BY (dtime DESC) ; It is the standard schema for modeling time se

Re: Decommission datacenter - repair?

2015-06-05 Thread Robert Coli
On Fri, Jun 5, 2015 at 5:15 AM, Jens Rantil wrote: > Datastax's documentation on "Decommissioning a data center > " > tells me to run a full repair and then decommission each node. Isn't > decommissioning go

Re: sstableloader usage doubts

2015-06-05 Thread Robert Coli
On Fri, Jun 5, 2015 at 7:53 AM, Sebastian Estevez < sebastian.este...@datastax.com> wrote: > Since you only restored one dc's sstables, you should be able to rebuild > them on the second DC. > > Refresh means pick up new SSTables that have been directly added to the > data directory. > > Rebuild m

Re: TTL and gc_grace_period

2015-06-05 Thread Tyler Hobbs
On Fri, Jun 5, 2015 at 11:02 AM, Kévin LOVATO wrote: > Great, so is there any reason I wouldn't want to set gc_grace_seconds to 0 > on an "insert once/ttl only" column family, since it feels like the best > thing to do? Nope, setting gc_grace_seconds to 0 is just fine in your case. -- Tyler

Re: TTL and gc_grace_period

2015-06-05 Thread Kévin LOVATO
Great, so is there any reason I wouldn't want to set gc_grace_seconds to 0 on an "insert once/ttl only" column family, since it feels like the best thing to do? On Fri, Jun 5, 2015 at 5:35 PM, Tyler Hobbs wrote: > > On Fri, Jun 5, 2015 at 10:30 AM, Kévin LOVATO wrote: > >> >> I have a column fa

Re: TTL and gc_grace_period

2015-06-05 Thread Tyler Hobbs
On Fri, Jun 5, 2015 at 10:30 AM, Kévin LOVATO wrote: > > I have a column family with data (metrics) that is never overwritten and > only deleted using TTLs, and I am wondering if it would be reasonable to > have a very low gc_grace_period (even 0) on that CF. I would like to do > that mainly to s

TTL and gc_grace_period

2015-06-05 Thread Kévin LOVATO
Hi, I have a column family with data (metrics) that is never overwritten and only deleted using TTLs, and I am wondering if it would be reasonable to have a very low gc_grace_period (even 0) on that CF. I would like to do that mainly to save space and also to prevent tombstone scanning. >From wha

Re: sstableloader usage doubts

2015-06-05 Thread Sebastian Estevez
Since you only restored one dc's sstables, you should be able to rebuild them on the second DC. Refresh means pick up new SSTables that have been directly added to the data directory. Rebuild means stream data from other replicas to re create SSTables from scratch. On Jun 5, 2015 6:40 AM, "ZeroUn

Re: Decommission datacenter - repair?

2015-06-05 Thread Kiran mk
Yes, but will not handover the data to any node, it rather intimates that other nodes of the replicas so that they rebalance the nodes with the help of repair. Best Regards, Kiran.M.K. On Jun 5, 2015 6:38 PM, "Jens Rantil" wrote: > Hi Kiran, > > So, am I understanding you correctly that a deco

Re: sstableloader usage doubts

2015-06-05 Thread ZeroUno
Il 04/06/15 19:50, Robert Coli ha scritto: http://www.pythian.com/blog/bulk-loading-options-for-cassandra/ Thank you Rob, but actually it doesn't matter to me which method is used, I can use both nodetool refresh or sstableloader, as long as they work! ;-) My problem here is that it looks

Re: sstableloader usage doubts

2015-06-05 Thread ZeroUno
Il 04/06/15 17:17, Sebastian Estevez ha scritto: If you have all the sstables for each node and no token range changes, you can just move the sstables to their spot in the data directory (rsync or w/e) and bring up your nodes. If you're already up you can use nodetool refresh to load the sstable

Re: Decommission datacenter - repair?

2015-06-05 Thread Jens Rantil
Hi Kiran, So, am I understanding you correctly that a decommissioning node only will hand over its data to a single node? If it would hand it over to all other replica nodes, I see that essentially as an implicit repair. Am I wrong? Thanks, Jens On Fri, Jun 5, 2015 at 2:27 PM, Kiran mk wrote:

Re: Decommission datacenter - repair?

2015-06-05 Thread Kiran mk
Hi Jens, If you decommission a data center, The data residing in the Data Center which you are planning for decommission has to be balanced to the nodes of the other data center satisfying RF. Hence Repair is required. Best Regards, Kiran.M.K. On Fri, Jun 5, 2015 at 5:45 PM, Jens Rantil wrote

Decommission datacenter - repair?

2015-06-05 Thread Jens Rantil
Hi, I asked this on IRC earlier today, but didn't get any response; Datastax's documentation on "Decommissioning a data center " tells me to run a full repair and then decommission each node. Isn't decommiss

Re: Throttle Heavy Read / Write Loads

2015-06-05 Thread Graham Sanderson
Are you doing large batch inserts via thrift - you need to be careful there Sent from my iPhone > On Jun 4, 2015, at 11:37 PM, Anishek Agarwal wrote: > > may be just increase the read and write timeouts at cassandra currently at 5 > sec i think. i think the datastax java client driver provides