Re: Load balancing in C* Cluster

2014-10-28 Thread Jonathan Lacefield
Hello, Most drivers will handle the load balancing for you and provide policies for configuring your desired approach for load balancing, i.e. load balance around the entire ring or localize around a specific DC. Your clients will leverage the driver for connections so that the client machines

Re: Load balancing issue with virtual nodes

2014-04-29 Thread DuyHai Doan
Thanks you Ben for the links On Tue, Apr 29, 2014 at 3:40 AM, Ben Bromhead wrote: > Some imbalance is expected and considered normal: > > See http://wiki.apache.org/cassandra/VirtualNodes/Balance > > As well as > > https://issues.apache.org/jira/browse/CASSANDRA-7032 > > Ben Bromhead > Instac

Re: Load balancing issue with virtual nodes

2014-04-28 Thread Ben Bromhead
Some imbalance is expected and considered normal: See http://wiki.apache.org/cassandra/VirtualNodes/Balance As well as https://issues.apache.org/jira/browse/CASSANDRA-7032 Ben Bromhead Instaclustr | www.instaclustr.com | @instaclustr | +61 415 936 359 On 29 Apr 2014, at 7:30 am, DuyHai Doan w

Re: Load balancing issue with virtual nodes

2014-04-28 Thread DuyHai Doan
Hello all Some update about the issue. After wiping completely all sstable/commitlog/saved_caches folder and restart the cluster from scratch, we still experience weird figures. After the restart, nodetool status does not show an exact balance of 50% of data for each node : Status=Up/Down |/

Re: Load balancing issue with virtual nodes

2014-04-24 Thread Batranut Bogdan
I don't know about hector but the datastax java driver needs just one ip from the cluster and it will discover the rest of the nodes. Then by default it will do a round robin when sending requests. So if Hector does the same the patterb will againg appear. Did you look at the size of the dirs? T

Re: Load balancing issue with virtual nodes

2014-04-24 Thread DuyHai Doan
I did some experiments. Let's say we have node1 and node2 First, I configured Hector with node1 & node2 as hosts and I saw that only node1 has high CPU load To eliminate the "client connection" issue, I re-test with only node2 provided as host for Hector. Same pattern. CPU load is above 50% on

Re: Load balancing issue with virtual nodes

2014-04-24 Thread Batranut Bogdan
Htop is not the only tool for this . Cassandra will hit io bottlnecks before cpu (on faster cpus) . A simple solution is to check the size of the data dir on the boxes. If you have aprox the same size then cassandra is wrinting in the whole cluster. Check how the data dir size changes when impor

Re: Load balancing issue with virtual nodes

2014-04-24 Thread Michael Shuler
On 04/24/2014 10:29 AM, DuyHai Doan wrote: Client used = Hector 1.1-4 Default Load Balancing connection policy Both nodes addresses are provided to Hector so according to its connection policy, the client should switch alternatively between both nodes OK, so is only one connection being e

Re: Load balancing issue with virtual nodes

2014-04-24 Thread DuyHai Doan
Hello Michael RF = 1 Client used = Hector 1.1-4 Default Load Balancing connection policy Both nodes addresses are provided to Hector so according to its connection policy, the client should switch alternatively between both nodes Regards Duy Hai DOAN On Thu, Apr 24, 2014 at 4:37 PM, Mic

Re: Load balancing issue with virtual nodes

2014-04-24 Thread Michael Shuler
On 04/24/2014 09:14 AM, DuyHai Doan wrote: My customer has a cluster with 2 nodes only. I've set virtual nodes so future addition of new nodes will be easy. with RF=? Now, after some benching tests with massive data insert, I can see with "htop" that one node has its CPU occupation up to

Re: Load balancing using 'nodetool move'

2012-03-15 Thread aaron morton
Check nodetool ring to see what state the nodes are in, they all need to be UP. Cheers - Aaron Morton Freelance Developer @aaronmorton http://www.thelastpickle.com On 15/03/2012, at 7:31 PM, Rishabh Agrawal wrote: > Hello, > > I initially had two node cluster and now I joined

Re: Load balancing

2010-06-18 Thread Oleg Anastasjev
Mubarak Seyed apple.com> writes: > > - How does client (application) connect to cassandra cluster? Is it always for one node (and thrift can get ring info) and send the request to connected node This depends on client library you use. Any cassandra node can accept client connections and forward

Re: Load Balancing Mapper Tasks

2010-05-17 Thread Jonathan Ellis
That means they are blocking for something to be added to the task queue On Mon, May 17, 2010 at 9:42 AM, Joost Ouwerkerk wrote: > At any given moment at least half of those threads are in the following > state; what does it represent? > Name: ROW-READ-STAGE:6 > State: WAITING on > java.util.conc

Re: Load Balancing Mapper Tasks

2010-05-17 Thread Joost Ouwerkerk
At any given moment at least half of those threads are in the following state; what does it represent? Name: ROW-READ-STAGE:6 State: WAITING on java.util.concurrent.locks.abstractqueuedsynchronizer$conditionobj...@fea6030 Total blocked: 44 Total waited: 479 Stack trace: sun.misc.Unsafe.park(Nati

Re: Load Balancing Mapper Tasks

2010-05-16 Thread Jonathan Ellis
On Sun, May 16, 2010 at 2:52 PM, Joost Ouwerkerk wrote: > Meanwhile. I'm still getting TimedOutException errors when mapping this > 30-million row table, even when retrieving no data at all.  It looks like it > is related to disk activity on "hot" nodes (when the same cassandra node has > to handl

Re: Load Balancing Mapper Tasks

2010-05-16 Thread Joost Ouwerkerk
Hadoop doesn't make any assumptions about how input source data is distributed. It can't 'know' that the data for the first 30 splits emitted by the InputFormat are all stored on the same cassandra node. The new case with the patch is CASSANDRA-1096 Meanwhile. I'm still getting TimedOutException

Re: Load Balancing Mapper Tasks

2010-05-15 Thread Jonathan Ellis
Oh, very interesting. I assumed Hadoop would be smart enough to load-balance the jobs it sends out. Guess not. Can you submit a patch? On Wed, May 12, 2010 at 12:32 PM, Joost Ouwerkerk wrote: > I've been trying to improve the time it takes to map 30 million rows using a > hadoop / cassandra cl

Re: Load balancing and Failover

2010-03-26 Thread Jonathan Ellis
nodetool ring http://wiki.apache.org/cassandra/NodeProbe On Fri, Mar 26, 2010 at 10:37 AM, Y Aw wrote: > Yes it does... > > Is there an easy way to know if a node is down or cannot reply to queries (a > simple telnet command) ? > > > > > > 2010/3/25 Jeremy Dunck >> >> On Thu, Mar 25, 2010 at 1:

Re: Load balancing

2010-03-25 Thread Jeremy Dunck
On Thu, Mar 25, 2010 at 1:20 PM, Y Aw wrote: > Hi all, > I have a question about load-balancing. http://wiki.apache.org/cassandra/FAQ#node_clients_connect_to Does that help?