Re: Volunteers needed - Wiki

2011-10-10 Thread Sasha Dolgy
maybe that should be the first wiki update the TODO On Tue, Oct 11, 2011 at 7:21 AM, Maki Watanabe wrote: > Hello aaron, > I raise my hand too. > If you have to-do list about the wiki, please let us know. > > maki >

Re: Multi DC setup

2011-10-10 Thread Cassa L
We already have two separate rings. Idea of bidirectional sync is, if one ring is down, we can still send the traffic to other ring. When original cluster comes back, it will pick up the data from available cluster. I'm not sure if it makes sense to have separate rings or combine these two rings in

Re: Volunteers needed - Wiki

2011-10-10 Thread Maki Watanabe
Hello aaron, I raise my hand too. If you have to-do list about the wiki, please let us know. maki 2011/10/10 aaron morton : > Hi there, > The dev's have been very busy and Cassandra 1.0 is just around the corner > and full of new features. To celebrate I'm trying to give the wiki some > loving t

Re: Multi DC setup

2011-10-10 Thread Milind Parikh
Why have two rings? Cassandra manages the replication for youone ring with physical nodes in two dc might be a better option. Of course, depending on the inter-dc failure characteristics, might need to endure split-brain for a while. /*** sent from my android...please pardo

Multi DC setup

2011-10-10 Thread Cassa L
I am trying to understand multi DC setup for cassandra. As I understand, in this setup, replicas exists in same cluster ring, but physically nodes are distributed across DCs. Is this correct? I have two different cluster rings in two DCs, and want to replicate data bidirectionally. They both have

Re: anyway to throttle nodetool repair?

2011-10-10 Thread Yan Chunlu
I am using commodity hardware so even minor compact make disk io goes 100% and server load get very high On Tue, Oct 11, 2011 at 11:19 AM, Yan Chunlu wrote: > so how about disk io? is there anyway to use ionice to control it? > > I have tried to adjust the priority by "ionice -c3 -p [cassandra

Re: anyway to throttle nodetool repair?

2011-10-10 Thread Yan Chunlu
so how about disk io? is there anyway to use ionice to control it? I have tried to adjust the priority by "ionice -c3 -p [cassandra pid]. seems not working... On Wed, Sep 28, 2011 at 12:02 AM, Peter Schuller < peter.schul...@infidyne.com> wrote: > > I saw the ticket about compaction throttling

Re: cassandra on laptop

2011-10-10 Thread Peter Sanford
By default, Cassandra is configured to use half the ram of your system. That's way overkill for playing around with it on a laptop. Edit /etc/cassandra/cassandra-env.sh and set max_heap_size_in_mb to something more suited for your environment. I have it set to 256M for my laptop (with 4G of ram).

Efficiency of hector's setRowCount

2011-10-10 Thread Don Smith
Hector's IndexedSlicesQuery has a setRowCount method that you can use to page through the results, as described in https://github.com/rantav/hector/wiki/User-Guide . rangeSlicesQuery.setRowCount(1001); . rangeSlicesQuery.setKeys(lastRow.getKey(), ""); Is it efficient? Spe

Re: Existing column(s) not readable

2011-10-10 Thread Thomas Richter
Hi Aaron, normally we use hector to access cassandra, but for debugging I switched to cassandra-cli. Column can not be read by a simple get CFName['rowkey']['colname']; Response is "Value was not found" if i query another column, everything is just fine. Serverlog for unsuccessful read (keyspac

Re: Volunteers needed - Wiki

2011-10-10 Thread aaron morton
Thanks, Hani. If you would like to update the storage config page that would be handy. Just update http://wiki.apache.org/cassandra/FrontPage_draft_aaron/ to say you are working on it. Just click the login link at the top to setup an account. wrt setting up eclipse, perhaps yo

seeking contractor to assist with upgrade/expansion

2011-10-10 Thread Scott Dworkis
hope this is not off topic? we've been struggling following ostensible procedures for awhile now, ready to pony up for some pro help (but not quite ready to pony up for datastax). please contact me at svd at mylife dot com if you are interested. -scott

cassandra on laptop

2011-10-10 Thread Gary Jefferson
I'm running an underpowered laptop (ubuntu) for development work. Installing Cassandra was easy, and getting the twissandra example app up and working was also easy. Here's the problem: after about a day of letting it run (with no load generated to webapp or db), my laptop now becomes unrespons

Re: 0.7.9 RejectedExecutionException

2011-10-10 Thread Ashley Martens
It is actually not at the exact same time of the day. It varies but happens within certain blocks of time, like between 00hr and 02hr. The could be up for hours or it could crash again in 15 minutes. The memory is fine, just using a larger footprint than 0.6 in all ways. On Mon, Oct 10, 2011 at 1:

Re: 0.7.9 RejectedExecutionException

2011-10-10 Thread aaron morton
The service keeps dieing at the same time every day and there is nothing in the app logs, it's going to be something external. Sorry but I'm not sure what the problem with the memory usage is. Is the server running out of memory, or is it experiencing a lot of GC ? Cheers - Aa

Re: Existing column(s) not readable

2011-10-10 Thread aaron morton
How are they unreadable ? You need to go into some details about what is going wrong. What sort of read ? What client ? What is in the logging on client and server side ? Try turning the logging up to DEBUG on the server to watch what happens. Cheers - Aaron Morton Freel

AUTO: Manoj Chaudhary is out of the office (returning 10/14/2011)

2011-10-10 Thread Manoj Chaudhary
I am out of the office until 10/14/2011. I am attending conference in Europe and meeting customers and parteners from 10/10/2011 to 10/15/2011. They are might be delay in responding the emails. I will try to respond to email periodically between meetings and some evenings in the local time zone

Re: MapReduce with two ethernet cards

2011-10-10 Thread Brandon Williams
On Mon, Oct 10, 2011 at 11:47 AM, Scott Fines wrote: > Hi all, > This may be a silly question, but I'm at a bit of a loss, and was hoping for > some help. > I have a Cassandra cluster set up with two NICs--one for internel > communication between cassandra machines (10.1.1.*), and one to respond t

Re: Volunteers needed - Wiki

2011-10-10 Thread Brandon Williams
On Mon, Oct 10, 2011 at 11:51 AM, hani elabed wrote: > Hi Aaron, > I can help with the documentation... I grabbed tons of screenshots as I was > installing Cassandra source trunk(1.0.0.rc2?) on my Mac OS X Snow leopard on > Eclipse Galileo and later Eclipse Indigo, I will be installing it on Eclip

Re: how to reduce disk read? (and bloom filter performance)

2011-10-10 Thread Mohit Anchlia
Does it mean you are not updating a row or deleting them? Can you look at JMX values of BloomFilter* ? I don't believe bloom filter false positive % value is configurable. Someone else might be able to throw more light on this. I believe if you want to keep disk seeks to 1 ssTable you will need

Re: Volunteers needed - Wiki

2011-10-10 Thread hani elabed
Hi Aaron, I can help with the documentation... I grabbed tons of screenshots as I was installing Cassandra source trunk(1.0.0.rc2?) on my Mac OS X Snow leopard on Eclipse Galileo and later Eclipse Indigo, I will be installing it on Eclipse for Ubuntu 10.04 soon. I took the sceenshots after I notic

MapReduce with two ethernet cards

2011-10-10 Thread Scott Fines
Hi all, This may be a silly question, but I'm at a bit of a loss, and was hoping for some help. I have a Cassandra cluster set up with two NICs--one for internel communication between cassandra machines (10.1.1.*), and one to respond to Thrift RPC (172.28.*.*). I also have a Hadoop cluster se

Re: factors on the effectiveness of bloom filter?

2011-10-10 Thread Radim Kolar
Dne 10.10.2011 18:31, Yang napsal(a): I noticed that 2 of my CFs are showing very different bloom filter false ratios, one is close to 1.0; the other one is only 0.3 cassandra bloom filters are computed for 1% false positive ratio. is there any measure to increase the effectiveness of bloom fil

factors on the effectiveness of bloom filter?

2011-10-10 Thread Yang
I noticed that 2 of my CFs are showing very different bloom filter false ratios, one is close to 1.0; the other one is only 0.3 they have roughly the same sizes in SStables and counts, the difference is key construction, the one with 0.3 false ratio has a shorter key. assuming the key can not be

Re: 0.7.9 RejectedExecutionException

2011-10-10 Thread Ashley Martens
I have check both the output file and the system log, neither have errors in them. I don't believe anything external is killing the process, I could be wrong but this node's setup is the same as all my other nodes (including hardware) so it doesn't make much sense. jsvc.exec -user cassandra -home

Re: A good key for data distribution over nodes

2011-10-10 Thread David McNelis
You should be ok, depending on the partitioner strategy you use. The keys end up created as a hash (which is why when you're setting up your nodes you can give them a specific key. Then, whatever your key is will be used to create an MD5 hash, that hash will then determine what node your data wil

A good key for data distribution over nodes

2011-10-10 Thread Laurent Aufrechter
Hi, I am planing to make tests on Cassandra with a few nodes. I want to create a column family where the key will be the date down to the second (like 2011/10/10-16:07:53). Doing so, my keys will be very similar from each others. Is it ok to use such keys if I want my data to be evenly distribu

Volunteers needed - Wiki

2011-10-10 Thread aaron morton
Hi there, The dev's have been very busy and Cassandra 1.0 is just around the corner and full of new features. To celebrate I'm trying to give the wiki some loving to make things a little more welcoming for new users. To keep things manageable I'd like to focus on completeness an

[RELEASE] Apache Cassandra 0.8.7 released

2011-10-10 Thread Sylvain Lebresne
The Cassandra team is pleased to announce the release of Apache Cassandra version 0.8.7. Cassandra is a highly scalable second-generation distributed database, bringing together Dynamo's fully distributed design and Bigtable's ColumnFamily-based data model. You can read more here: http://cassand

Re: "Insufficient space" on 1.0.0-rc2 when compacting compressed CFs

2011-10-10 Thread Sylvain Lebresne
On Mon, Oct 10, 2011 at 10:08 AM, Günter Ladwig wrote: > Hi, > > I couldn't find anything on this issue, but maybe my google-fu is weak. > > I'm running a Cassandra 1.0.0-rc2 cluster with compression enabled for all of > the two CFs I have right now. The load on a single node is about 32GB (disk

Re: Existing column(s) not readable

2011-10-10 Thread Thomas Richter
Hi, no errors in the server logs. The columns are unreadable on all nodes at any consistency level (ONE, QUORUM, ALL). We started with 0.7.3 and upgraded to 0.7.6-2 two months ago. Best, Thomas On 10/10/2011 10:03 AM, aaron morton wrote: > What error are you seeing in the server logs ? Are the

"Insufficient space" on 1.0.0-rc2 when compacting compressed CFs

2011-10-10 Thread Günter Ladwig
Hi, I couldn't find anything on this issue, but maybe my google-fu is weak. I'm running a Cassandra 1.0.0-rc2 cluster with compression enabled for all of the two CFs I have right now. The load on a single node is about 32GB (disk is 80GB per node). Whenever I try to run a compaction using nod

Re: ebs or ephemeral

2011-10-10 Thread Yi Yang
Agree, EBS systems are not so good for cassandra systems and during previous conversations in this mail list, people tend to use ephemeral. 從我的 BlackBerry® 無線裝置 -Original Message- From: Sasha Dolgy Date: Mon, 10 Oct 2011 10:03:26 To: Reply-To: user@cassandra.apache.org Subject: Re: eb

Re: ebs or ephemeral

2011-10-10 Thread aaron morton
yes, should have been And an explanation of why we normally avoid *EBS*. My bad. - Aaron Morton Freelance Cassandra Developer @aaronmorton http://www.thelastpickle.com On 10/10/2011, at 9:03 PM, Sasha Dolgy wrote: > just catching the tail end of this discussion. aaron, in yo

Re: ebs or ephemeral

2011-10-10 Thread Sasha Dolgy
just catching the tail end of this discussion. aaron, in your previous email, you said "And an explanation of why we normally avoid ephemeral. " shouldn't this be, avoiding EBS? EBS was a nightmare for us in terms of performance. On Mon, Oct 10, 2011 at 9:23 AM, aaron morton wrote: > 6 nod

Re: Existing column(s) not readable

2011-10-10 Thread aaron morton
What error are you seeing in the server logs ? Are the columns unreadable at all Consistency Levels ? i.e. are the columns unreadable on all nodes. What is the upgrade history of the cluster ? What version did it start at ? Cheers - Aaron Morton Freelance Cassandra Developer

Re: 54 memtable flushes in hour at peaktime

2011-10-10 Thread aaron morton
It's not a problem by it's self, compaction will do it's thing. It you are also seeing read latency increase it may be something you want to look it. What version are you using ? The tuning is different (i.e. it gets easier) between versions 0.7, 0.8 and 1.0. It's probably just the case that

Re: 0.7.9 RejectedExecutionException

2011-10-10 Thread aaron morton
Have you checked /var/log/cassandra/output.txt (the packaged install pipes std out/err to there) or the system logs ? If there are no errors in the logs it may well be something external killing it. With regard to memory usage, it's hard for people to help unless you provide some numbers. Wha

Re: ebs or ephemeral

2011-10-10 Thread aaron morton
6 nodes and RF3 will mean you can handle between 1 and 2 failed nodes. see http://thelastpickle.com/2011/06/13/Down-For-Me/ Cheers - Aaron Morton Freelance Cassandra Developer @aaronmorton http://www.thelastpickle.com On 7/10/2011, at 9:37 PM, Madalina Matei wrote: > Hi Aaron,