Hello,
Most drivers will handle the load balancing for you and provide policies
for configuring your desired approach for load balancing, i.e. load balance
around the entire ring or localize around a specific DC. Your clients will
leverage the driver for connections so that the client machines
Thanks you Ben for the links
On Tue, Apr 29, 2014 at 3:40 AM, Ben Bromhead wrote:
> Some imbalance is expected and considered normal:
>
> See http://wiki.apache.org/cassandra/VirtualNodes/Balance
>
> As well as
>
> https://issues.apache.org/jira/browse/CASSANDRA-7032
>
> Ben Bromhead
> Instac
Some imbalance is expected and considered normal:
See http://wiki.apache.org/cassandra/VirtualNodes/Balance
As well as
https://issues.apache.org/jira/browse/CASSANDRA-7032
Ben Bromhead
Instaclustr | www.instaclustr.com | @instaclustr | +61 415 936 359
On 29 Apr 2014, at 7:30 am, DuyHai Doan w
Hello all
Some update about the issue.
After wiping completely all sstable/commitlog/saved_caches folder and
restart the cluster from scratch, we still experience weird figures. After
the restart, nodetool status does not show an exact balance of 50% of data
for each node :
Status=Up/Down
|/
I don't know about hector but the datastax java driver needs just one ip from
the cluster and it will discover the rest of the nodes. Then by default it will
do a round robin when sending requests. So if Hector does the same the patterb
will againg appear.
Did you look at the size of the dirs?
T
I did some experiments.
Let's say we have node1 and node2
First, I configured Hector with node1 & node2 as hosts and I saw that only
node1 has high CPU load
To eliminate the "client connection" issue, I re-test with only node2
provided as host for Hector. Same pattern. CPU load is above 50% on
Htop is not the only tool for this . Cassandra will hit io bottlnecks before
cpu (on faster cpus) . A simple solution is to check the size of the data dir
on the boxes. If you have aprox the same size then cassandra is wrinting in the
whole cluster. Check how the data dir size changes when impor
On 04/24/2014 10:29 AM, DuyHai Doan wrote:
Client used = Hector 1.1-4
Default Load Balancing connection policy
Both nodes addresses are provided to Hector so according to its
connection policy, the client should switch alternatively between both nodes
OK, so is only one connection being e
Hello Michael
RF = 1
Client used = Hector 1.1-4
Default Load Balancing connection policy
Both nodes addresses are provided to Hector so according to its connection
policy, the client should switch alternatively between both nodes
Regards
Duy Hai DOAN
On Thu, Apr 24, 2014 at 4:37 PM, Mic
On 04/24/2014 09:14 AM, DuyHai Doan wrote:
My customer has a cluster with 2 nodes only. I've set virtual nodes so
future addition of new nodes will be easy.
with RF=?
Now, after some benching tests with massive data insert, I can see
with "htop" that one node has its CPU occupation up to
Check nodetool ring to see what state the nodes are in, they all need to be UP.
Cheers
-
Aaron Morton
Freelance Developer
@aaronmorton
http://www.thelastpickle.com
On 15/03/2012, at 7:31 PM, Rishabh Agrawal wrote:
> Hello,
>
> I initially had two node cluster and now I joined
Mubarak Seyed apple.com> writes:
>
> - How does client (application) connect to cassandra cluster? Is it always for
one node (and thrift can get ring info) and send the request to connected node
This depends on client library you use. Any cassandra node can accept client
connections and forward
That means they are blocking for something to be added to the task queue
On Mon, May 17, 2010 at 9:42 AM, Joost Ouwerkerk wrote:
> At any given moment at least half of those threads are in the following
> state; what does it represent?
> Name: ROW-READ-STAGE:6
> State: WAITING on
> java.util.conc
At any given moment at least half of those threads are in the following
state; what does it represent?
Name: ROW-READ-STAGE:6
State: WAITING on
java.util.concurrent.locks.abstractqueuedsynchronizer$conditionobj...@fea6030
Total blocked: 44 Total waited: 479
Stack trace:
sun.misc.Unsafe.park(Nati
On Sun, May 16, 2010 at 2:52 PM, Joost Ouwerkerk wrote:
> Meanwhile. I'm still getting TimedOutException errors when mapping this
> 30-million row table, even when retrieving no data at all. It looks like it
> is related to disk activity on "hot" nodes (when the same cassandra node has
> to handl
Hadoop doesn't make any assumptions about how input source data is
distributed. It can't 'know' that the data for the first 30 splits emitted
by the InputFormat are all stored on the same cassandra node.
The new case with the patch is CASSANDRA-1096
Meanwhile. I'm still getting TimedOutException
Oh, very interesting. I assumed Hadoop would be smart enough to
load-balance the jobs it sends out. Guess not.
Can you submit a patch?
On Wed, May 12, 2010 at 12:32 PM, Joost Ouwerkerk wrote:
> I've been trying to improve the time it takes to map 30 million rows using a
> hadoop / cassandra cl
nodetool ring
http://wiki.apache.org/cassandra/NodeProbe
On Fri, Mar 26, 2010 at 10:37 AM, Y Aw wrote:
> Yes it does...
>
> Is there an easy way to know if a node is down or cannot reply to queries (a
> simple telnet command) ?
>
>
>
>
>
> 2010/3/25 Jeremy Dunck
>>
>> On Thu, Mar 25, 2010 at 1:
On Thu, Mar 25, 2010 at 1:20 PM, Y Aw wrote:
> Hi all,
> I have a question about load-balancing.
http://wiki.apache.org/cassandra/FAQ#node_clients_connect_to
Does that help?
19 matches
Mail list logo