Re: open source cassandra and hadoop

2014-11-28 Thread Tim Dunphy
Great! Thanks for these suggestions. I'll look into these tomorrow. Tim Sent from my iPhone > On Nov 28, 2014, at 10:31 PM, Jason Wee wrote: > > There are two examples of hadoop with cassandra in the examples codes, > https://github.com/apache/cassandra/tree/trunk/examples/hadoop_word_count

Re: open source cassandra and hadoop

2014-11-28 Thread Jason Wee
There are two examples of hadoop with cassandra in the examples codes, https://github.com/apache/cassandra/tree/trunk/examples/hadoop_word_count https://github.com/apache/cassandra/tree/trunk/examples/hadoop_cql3_word_count Does these help? Jason On Sat, Nov 29, 2014 at 2:30 AM, Tim Dunphy wro

Re: Nodes get stuck in crazy GC loop after some time, leading to timeouts

2014-11-28 Thread graham sanderson
I should note that the young gen size is just a tuning suggestion, not directly related to your problem at hand. You might want to make sure you don’t have issues with key/row cache. Also, I’m assuming that your extra load isn’t hitting tables that you wouldn’t normally be hitting. > On Nov 28

Re: Nodes get stuck in crazy GC loop after some time, leading to timeouts

2014-11-28 Thread graham sanderson
Your GC settings would be helpful, though you can see guesstimate by eyeballing (assuming settings are the same across all 4 images) Bursty load can be a big cause of old gen fragmentation (as small working set objects tends to get spilled (promoted) along with memtable slabs which aren’t flush

Re: Column family ID mismatch-Error on concurrent schema modifications

2014-11-28 Thread Eric Stevens
@Jens, > will "inactive" CFs be released from C*'s memory after i.e. a few days > or when under resource pressure? No, certain memory structures are allocated and will remain resident on each node for as long as the table exists. > These CFs are used as "time buckets", but are to be kept for spe

Re: Date Tiered Compaction Strategy and collections

2014-11-28 Thread Eric Stevens
The underlying write time is still tracked for each value in the collection - it's part of how conflict resolution is managed - but it's not exposed through CQL. On Fri Nov 28 2014 at 4:18:47 AM Batranut Bogdan wrote: > Hello all, > > If one has a table like this: > id text, > ts timestamp > val

Nodes get stuck in crazy GC loop after some time, leading to timeouts

2014-11-28 Thread Paulo Ricardo Motta Gomes
Hello, This is a recurrent behavior of JVM GC in Cassandra that I never completely understood: when a node is UP for many days (or even months), or receives a very high load spike (3x-5x normal load), CMS GC pauses start becoming very frequent and slow, causing periodic timeouts in Cassandra. Tryi

open source cassandra and hadoop

2014-11-28 Thread Tim Dunphy
Hey all, I have a 3 node Cassandra cluster I would like to hook into hadoop for processing the information in the Cassandra DB. I know that Datastax version of Cassandra includes support for Hadoop right out of the box. But I've been googling around and I don't see any good information on how to

Re: Column family ID mismatch-Error on concurrent schema modifications

2014-11-28 Thread Robert Wille
I would suggest that dynamic table creation is, in general, not a great idea, regardless of the database. I would seriously consider altering your approach to use a fixed set of tables. On Nov 28, 2014, at 1:53 AM, Marcus Olsson mailto:marcus.ols...@ericsson.com>> wrote: Hi, We encountered th

Re: mysql based columnar DB to Cassandra DB - Migration

2014-11-28 Thread Jack Krupansky
Planet Cassandra has some resource pages related to migrations to Cassandra. For HBase: http://planetcassandra.org/hbase-to-cassandra-migration/ There are pages for migration from Oracle, MySQL, MongoDB, and Redis, as well. -- Jack Krupansky From: Akshay Ballarpure Sent: Friday, November 28, 2

Date Tiered Compaction Strategy and collections

2014-11-28 Thread Batranut Bogdan
Hello all, If one has a table like this:id text,ts timestampvalues list  PK (id,ts)  How will the DTCS work? I am asking this because the writeTime() function does not work on collections.

Re: Column family ID mismatch-Error on concurrent schema modifications

2014-11-28 Thread Marcus Olsson
Hi, We encountered the same problem and created a JIRA for it: https://issues.apache.org/jira/browse/CASSANDRA-8387 . /Marcus O On 11/27/2014 04:19 PM, DuyHai Doan wrote: Hello Peter For safe concurrent table creation, use CREATE TABLE xxx IF NOT EXISTS. It will use light weight transacti