RE: Questions related to the data in SSTable files

2013-10-23 Thread java8964 java8964
ate: Tue, 22 Oct 2013 17:52:24 -0700 Subject: Re: Questions related to the data in SSTable files From: rc...@eventbrite.com To: user@cassandra.apache.org On Tue, Oct 22, 2013 at 5:17 PM, java8964 java8964 wrote: Any way I can verify how often the system being "repaired"? I can ask a

RE: Questions related to the data in SSTable files

2013-10-22 Thread java8964 java8964
he regular good data in memtable, then in the SSTable files. Yong Date: Tue, 22 Oct 2013 14:50:07 -0700 Subject: Re: Questions related to the data in SSTable files From: rc...@eventbrite.com To: user@cassandra.apache.org On Tue, Oct 22, 2013 at 2:29 PM, java8964 java8964 wrote: 1) In the da

Questions related to the data in SSTable files

2013-10-22 Thread java8964 java8964
Hi, I have some questions related the data in the SSTable files. Our production environment has 36 boxes, so in theory 12 of them will make one group of data without replication. Right now, I got all the SSTable files from 12 nodes of the cluster (Based on my understanding, these 12 nodes are one

Is it possible to control the sstable file size in incremental backup or snapshot

2013-09-20 Thread java8964 java8964
Hi, The current our production is using Cassandra 1.0, and will upgrade to 1.1 next week. I noticed the snapshot and incremental backup sstable files size generated from our production environment vary dramatically. Some files can be hundreds of M, or even close to G, but a lot of files are even

RE: cassandra just gone..no heap dump, no log info

2013-09-18 Thread java8964 java8964
We faced same issue sometimes too. 1) Linux OOM killer kill your Cassandra process. You should find this event log in /var/log/message.2) The JVM crashed. You should be able to find the hs_err_pid file under /tmp folder, if you didn't specify the location when you started your JVM. We still cann

RE: questions related to the SSTable file

2013-09-17 Thread java8964 java8964
Quote: " To be clear, "incremental backup" feature backs up the data being modified in that period, because it writes only those files to the incremental backup dir as hard links, between full snapshots." I thought I was clearer, but your clarification confused me again.My understanding so far

RE: questions related to the SSTable file

2013-09-17 Thread java8964 java8964
Another question related to the SSTable files generated in the incremental backup is not really ONLY incremental delta, right? It will include more than delta in the SSTable files. I will use the example to show my question: first, we have this data in the SSTable file 1: rowkey(1), columns (make

RE: questions related to the SSTable file

2013-09-17 Thread java8964 java8964
ent value > > When it finally compacts, the old value can go away. > > Dean > > From: java8964 java8964 mailto:java8...@hotmail.com>> > Reply-To: "user@cassandra.apache.org<mailto:user@cassandra.apache.org>" > mailto:user@cassandra.a

RE: questions related to the SSTable file

2013-09-17 Thread java8964 java8964
row of (key, column, column_value = Gold, hex, #FFD700) > memtable is flushed => Color-4-Data.db ---- > > Then, a compaction will merge all those fragments together into the latest > ones as follows. > > - Color-5-Data.db: [{Lavender: {hex: #E6E6FA}, {Aqua: {hex: #00}, {Gre

RE: questions related to the SSTable file

2013-09-17 Thread java8964 java8964
* assuming RandomPartitioner is used Hope they would help. - Takenori (2013/09/17 10:51), java8964 java8964 wrote: Hi, I have some questions related to the SSTable in the Cassandra, as I am doing a project

questions related to the SSTable file

2013-09-16 Thread java8964 java8964
Hi, I have some questions related to the SSTable in the Cassandra, as I am doing a project to use it and hope someone in this list can share some thoughts. My understand is the SSTable is per column family. But each column family could have multi SSTable files. During the runtime, one row COULD s

RE: is there any type of table existing on all nodes(slow to up date, fast to read in map/reduce)?

2013-09-14 Thread java8964 java8964
Or some configuration in Cassandra integration part of Hadoop, that tells Cassandra we know this table is small enough, make it a distribute Cache in hadoop, in all the MR jobs generated in Cassandra. Date: Fri, 13 Sep 2013 14:06:50 -0700 Subject: Re: is there any type of table existing on all n

SSTableReader support files in Distribute File System

2013-09-05 Thread java8964 java8964
Hi, I don't think currently Cassandra can support this, but if it does, can someone tell me how, or is it reasonable to ask this feature and where should I submit it to? I am thinking that SSTableReader class, and underline supporting classes, should support handling the files in a DFS, like CFS

RE: map/reduce performance time and sstable readerÅ .

2013-09-03 Thread java8964 java8964
I am trying to do the same thing, as in our project, we want to load the data from Cassandra into Hadoop cluster, and SSTable is one obvious option, as you can get the changed data since last batch loading directly from the SSTable incremental backup files. But, based on so far my research (I ma