Hi,
I am new to Cassandra and I am exploring the Hadoop integration (MapReduce)
provided by Cassandra.
I am trying to run the hadoop examples provided in the cassandra's repo
under examples/hadoop_cql3_word_count. I am using the cassandra-2.0 branch.
I have a single node cassandra running locally.
Hi,
Is there a way to monitor the progress of a hinted handoff task?
I found the following two mbeans providing some info:
org.apache.cassandra.internal:type=HintedHandoff, which tells me that there
is 1 active task, and
org.apache.cassandra.db:type=HintedHandoffManager#countPendingHints(),
whic
Hi, if you download the rpm from
http://rpm.datastax.com/community/noarch/, example
cassandra20-2.0.3-1.noarch.rpm , it should contain the cqlshlib
and it is package into /usr/lib/python2.6/site-packages/cqlshlib
hth
/Jason
On Tue, Dec 3, 2013 at 10:17 AM, Ritchie Iu wrote:
> No, there is no
Is there a way to know how much data is transferred between two nodes, or
more specifically, between two data centers?
I'm especially interested in how much data is being replicated from one
data center to another, to know how much of the available bandwidth is used.
Thanks,
Tom
Tom,
You should check the size of the hints column family to determine how much
are present. The hints are a super column family and its keys are
destination tokens. You could look at it if you would like.
Hints send and timedouts are logged, you should be seeing something like
Timed out replayi
Hi Rahul,
Thanks for your reply.
I have never seen message like "Timed out replaying hints to...", which is
a good thing then, I suppose ;)
Normally, I do see the "Finished hinted handoff..." log message. However,
every now and then this message is not logged, not even after several
hours. This
Tom,
Do you know why these hints are piling up? What is the size of the hints cf?
Thanks
Rahul
On Tue, Dec 3, 2013 at 6:41 PM, Tom van den Berge wrote:
> Hi Rahul,
>
> Thanks for your reply.
>
> I have never seen message like "Timed out replaying hints to...", which is
> a good thing then, I
Hi,
I have the impression that there is an issue with dropping a keyspace and then
recreating the keyspace (and column families), combined with a restart of the
database
My test goes as follows:
Create keyspace K and column families C.
Insert rows X0 column family C0
Query for X0 : found rows
Rahul,
This problem occurs every now and then, and currently everything is ok, so
there are no hints. But whenever it happens, the hints are quickly piling
up. This results in heap problems on the node ("Heap is 0.813462 full..."
appears many times). This in turn results in the flushing of the 'hi
Then my issue must be the 0.01% because
1) I'm running the repair as root.
2) The directory exists and the permissions are appropriate. root:root 755
3) The three times it occurred during the repair it always complained about
backups directories. But there are dozens other backups directories
Hi,
Are you running nodetool or cassandra as root? I think it doesn't really
matter what user is running the nodetool. Those directories should be
writable by the user who is running the actual cassandra process.
Hannu
2013/12/3 John Pyeatt
> Then my issue must be the 0.01% because
>
> 1)
Hi, Ross.
We had the same problem under the same version of Cassandra. We opted to copy
ALL the stables from the old cluster to each new node, then run nodetool
refresh. The missing rows have appeared after this procedure.
Best regards,
Francisco.
On Nov 27, 2013, at 7:49 PM, Ross Black wro
Both cassandra and nodetool are running as root.
also
ulimit -a
core file size (blocks, -c) 0
data seg size (kbytes, -d) unlimited
scheduling priority (-e) 0
file size (blocks, -f) unlimited
pending signals (-i) 59450
max locked memory
On Tue, Dec 3, 2013 at 6:19 AM, John Pyeatt wrote:
> Then my issue must be the 0.01% because
>
> 1) I'm running the repair as root.
>
Huh? Repair doesn't care what user your shell is. It is a process built
into cassandra and has the permissions that cassandra does?
> 2) The directory exists
This is running the Amazon Linux OS which is essentially CentOS 6 I believe.
java version "1.6.0_45"
Java(TM) SE Runtime Environment (build 1.6.0_45-b06)
Java HotSpot(TM) 64-Bit Server VM (build 20.45-b01, mixed mode)
Installed cassandra 1.2.9 from
http://archive.apache.org/dist/cassandra/1.2.9/a
What is the best practice for modifying the primary key definition of a table
in Cassandra 1.2.9?
Say I have this table:
CREATE TABLE temperature (
weatherstation_id text,
event_time timestamp,
temperature text,
PRIMARY KEY (weatherstation_id,event_time)
);
I want to add a new colum
Thank you Team.
-Chandra
On Mon, Dec 2, 2013 at 7:32 PM, Robert Coli wrote:
> On Mon, Dec 2, 2013 at 8:16 AM, chandra Varahala <
> hadoopandcassan...@gmail.com> wrote:
>
>> I have cassandra cluster with 5 nodes with 1 replication factor initially.
>> Now i changed to replication factor 3 and
So Basically you want to create a cluster of multiple unique keys, but data
which belongs to one unique should be colocated. correct?
-Vivek
On Tue, Dec 3, 2013 at 10:39 AM, onlinespending wrote:
> Subject says it all. I want to be able to randomly distribute a large set
> of records but keep t
18 matches
Mail list logo