In your case -pr would be just fine (see Viktor's explanation).
2012/6/5 Viktor Jevdokimov
> Understand simple mechanics first, decide how to act later.
>
> ** **
>
> Without –PR there’s no difference from which host to run repair, it runs
> for the whole 100% range, from start to end, the
Every CF has a certain amount of overhead in memory. It's just not how
Cassandra is designed to be used. Maybe you could think of a way to smash
data down to indices and entities. With an abstraction layer you can store
practically anything in Cassandra.
2012/6/5 Toru Inoko
> IMHO a model that a
On Tue, Jun 5, 2012 at 8:44 AM, Viktor Jevdokimov <
viktor.jevdoki...@adform.com> wrote:
> Understand simple mechanics first, decide how to act later.
>
> ** **
>
> Without –PR there’s no difference from which host to run repair, it runs
> for the whole 100% range, from start to end, the whol
But in any case, repair is a two way process?
I mean that repair without -PR on node N1 will repair N1 and N2 and N3, because
N2 is a replica of N1 range and N1 is a replica of N3 range?
And if there're more ranges, that not belongs to N1, that ranges and nodes will
not be repaired?
Am I unders
Dear all,
I am writing a java code using Hector but when I am configuring the already
created key space using HFactory.createKeySpace its giving me the error Null
Point Exception. What am I doing wrong. Please help me.
Thanks and Regards
Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376
Check if you have rightly provided your cluster name.
Regards
Harsh
From: Prakrati Agrawal [mailto:prakrati.agra...@mu-sigma.com]
Sent: Tuesday, June 05, 2012 2:55 PM
To: user@cassandra.apache.org
Subject: How to configure the Cassandra connection using Hector
Dear all,
I am writing a java code
cluster = HFactory.getCluster("Test Cluster");
keySpace = HFactory.createKeyspace("mstream", cluster);
These are the lines that I am executing and getting an error
Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | www.mu-sigma.com
From: Harshvardhan Ojha [mailto:harshvardhan.o...@makemy
It seems you are missing ip:port like:
cluster = HFactory.getOrCreateCluster("TestCluster", "localhost:9160");
If this doesn't work please provide complete stack trace.
Regards
Harsh
From: Prakrati Agrawal [mailto:prakrati.agra...@mu-sigma.com]
Sent: Tuesday, June 05, 2012 3:00 PM
To: user@cassa
In the following line give
cluster = HFactory.getCluster("Test Cluster",ip:9160);
ip =ip address of your machine and respective port no.
Regards
Arshad
From: Prakrati Agrawal [prakrati.agra...@mu-sigma.com]
Sent: Tuesday, June 05, 2012 3:00 PM
To: user@cassandra.a
Thank you all for the quick response :)
Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | www.mu-sigma.com
From: MOHD ARSHAD SALEEM [mailto:marshadsal...@tataelxsi.co.in]
Sent: Tuesday, June 05, 2012 3:07 PM
To: user@cassandra.apache.org
Subject: RE: How to configure the Cassandra connec
Hi,
We are using a 9 nodes cassandra cluster, and each node is currently
handling about 40GB of data.
Our CFs were created using LevedCompactionStrategy, with the default
options (size of SSTables = 5MB).
So the number of SSTables was going quite large, which led to problems
described in CASSAND
Dear all
I am using a two node Cassandra cluster. How do I code in Java using Hector to
get data from both the nodes. Please help
Thanks and Regards
Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | www.mu-sigma.com
This email message may contain prop
Use Consistency Level =2.
Regards
Harsh
From: Prakrati Agrawal [mailto:prakrati.agra...@mu-sigma.com]
Sent: Tuesday, June 05, 2012 4:08 PM
To: user@cassandra.apache.org
Subject: How to include two nodes in Java code using Hector
Dear all
I am using a two node Cassandra cluster. How do I code in
I am trying to understand the variance in flushes frequency in a 8
node Cassandra cluster.
All the flushes are of the same type and initiated by MeteredFlusher.java =>
"INFO [OptionalTasks:1] 2012-06-05 06:32:05,873 MeteredFlusher.java
(line 62) flushing high-traffic column family CFS(Keyspace='St
Hi all,
A question has come up in our team about the performance impact of
static vs dynamic columns. We'd like to ask two questions:
Quick background: We are using a custom app to write to Cassandra using
Hector. Production is Solaris and pre-prod is generally Centos. We're
currently on 0.7
The Cassandra users mailing list is a better place for this question, so
I'm moving it there.
Some comments inline:
On Tue, Jun 5, 2012 at 6:47 AM, Juan Ezquerro LLanes wrote:
> I have a columnfamily like:
>
> CREATE COLUMN FAMILY Watchdog
> WITH key_validation_class =
> 'CompositeType(Lexic
In theory the the logging configurator is watching the log4j-server.properties
file and check for changes every 10 seconds. I've nevery had much luck with it,
but assumed it was me getting something wrong.
Or you can modify the values on the fly using the setLog4jLevel() on the
StorageService
> With an abstraction layer you can store practically anything in Cassandra.
See virtual keyspaces in Hector.
> why do you think so? I'll let users create ristricted CFs, and limit a number
> of CFs which users create.
> is it still a bad one?
Depends what your limits are, but in general still y
why do you think so? I'll let users create ristricted CFs, and limit a
number of CFs which users create.
> is it still a bad one?
>>
> Ok, get it, you want to limit the cf user can create (assume) 2, what
about 10k shared users creating 2 cf each=> 20k CF ~~20GB memory used with
no data in it. Do
-pr is a new feature added in 1.0. It was added for efficiency, not
functionality. With -pr repair does 1/RF the work it does without it.
> Am I understood correctly, that “repair” with or without –PR is not a “repair
> selected node” process, but “synchronize data range(s) between replicas”
>
Not that I am aware of. AFAIK the files should eventually get to that size.
Anyone else ?
Cheers
-
Aaron Morton
Freelance Developer
@aaronmorton
http://www.thelastpickle.com
On 5/06/2012, at 9:44 PM, Samuel CARRIERE wrote:
> Hi,
> We are using a 9 nodes cassandra cluster, and
I have a 4-node cluster with one keyspace (aside from the system keyspace)
with the replication factor set to 4. The disk usage between the nodes is
pretty wildly different and I'm wondering why. It's becoming a problem
because one node is getting to the point where it sometimes fails to
compact
See the section on memtable_total_space_in_mb here
http://thelastpickle.com/2011/05/04/How-are-Memtables-measured/
Cheers
-
Aaron Morton
Freelance Developer
@aaronmorton
http://www.thelastpickle.com
On 6/06/2012, at 2:27 AM, rohit bhatia wrote:
> I am trying to understand the v
Alas, upgrading to 1.1.1 did not solve my issue.
-Original Message-
From: Brandon Williams [mailto:dri...@gmail.com]
Sent: Monday, June 04, 2012 11:24 PM
To: user@cassandra.apache.org
Subject: Re: memory issue on 1.1.0
Perhaps the deletes: https://issues.apache.org/jira/browse/CASSANDRA-
(I'm assuming you are talking about column values here)
> 1. Does specifying the type of a column affect performance other than the
> cost of validating data as it is stored?
> e.g. does it help compaction, etc?
No.
Validation is normally pretty light weight.
> From my reading of the docs the
Thank you for all the replies. It has been enlightening to read. I think I
now have a better idea of repair, ranges, replicas and how the data is
distributed. It also seems that using -pr would be the best way to go in my
scenario with 1.x+
Thank you for all the feedback. Glad to see such an acti
Ok, thanks for your reply Aaron.
I'll eventually be happy then :)
Cheers,
aaron morton
05/06/2012 20:52
Veuillez répondre à
user@cassandra.apache.org
A
user@cassandra.apache.org
cc
Objet
Re: Recreate LevelCompacted SSTables with a new size
Not that I am aware of. AFAIK the files shou
Hi Wade
I don't know if your scenario matches mine, but I've been struggling with
memory pressure in 1.x as well. I made the jump from 0.7.9 to 1.1.0, along
with enabling compression and levelled compactions, so I don't know which
specifically is the main culprit.
Specifically, all my nodes
Hi,
To verify that the repair was successful, you can look for this kind of
messages in the log :
INFO [AntiEntropyStage:1] 2012-05-19 00:57:52,351 AntiEntropyService.java
(line 762) [repair #e46a0a90-a13c-11e1--596f3d333ab7] UsersCF is fully
synced (3 remaining column family to sync for t
Thank you. I do have some of the same observations. Do you do deletes?
My observation is that without deletes (or column updates I guess) I can run
forever happy. but when I run (what for me is a batch process) operations that
delete and modify column values I run into this.
Reading bug http
On Mon, Jun 4, 2012 at 2:34 PM, aaron morton wrote:
> IIRC index slices work a little differently with consistency, they need to
> have CL level nodes available for all token ranges. If you drop it to CL
> ONE the read is local only for a particular token range.
>
Yes, this is what we observed. W
There is no error in the log about the streaming.
And thanks for the information, we will try 1.1 when we start upgrade.
koji
2012/6/5 aaron morton
> Are their any errors in the logs about failed streaming ?
>
> If you are getting time outs 1.0.8 added a streaming socket timeout
> https://gith
Ok, so I have completely refactored to remove deletes and it still fails. So it
is completely unrelated to deletes.
I guess I need to go back to 1.0.10? When I originally evaluated I ran
1.0.8... perhaps I went a bridge too far with 1.1.
I don't think I am doing anything exotic here.
Here is
But the data is distributed on the nodes ( meaning 50% of data is on one node
and 50% of data is on another node) so I need to specify the node ip address
somewhere in the code. But where do I specify that is what I am clueless about.
Please help me
Prakrati Agrawal | Developer - Big Data(I&D)|
They
1) setup a two node cluster and loaded 500K rows or something
2) add a third node, run nodetool move
3) while moving they pull the plug on the node
Cassandra won't start with the exception below. Now, this is obviously a very
exceptional situation but the question is posed
I don't use hector, don't know much about internals, this may help
* Cluster cluster = HFactory.getOrCreateCluster("
TestCluster","host1:9160,host2:9160,host3:9160")*
If you have 2 node cluster with RF=2, your data will be present in both
node. And if consistency level 2 is used both node must b
Hi Aaron
Thanks for the link, I have gone through it. But this doesn't justify
nodes of exactly same config/specs differing in their flushing
frequency.
The traffic on all node is same as we are using RandomPartitioner
Thanks
Rohit
On Wed, Jun 6, 2012 at 12:24 AM, aaron morton wrote:
> See the
Dear all
I am trying to check the performance of Cassandra on adding or removing nodes.
I want to know what happens to my existing data if I remove a node ? Please
help me
Thanks and Regards
Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | www.mu-sigma.com
_
Hi All,
I am using Hector client for cassandra . I wanted to know how to create
keyspace and column family using API's to read and write data.
or i have to create keyspace and column family manually using command line
interface.
Regards
Arshad
In Hector when you create a cluster using the API, you specify an IP address &
cluster name. Thereafter internally which node serves the request or how many
nodes need to be contacted to read/write data depends on the cluster
configuration i.e. Whats your replication strategy, factor, consiste
Thank you for the reply.
Now I have decommissioned a node but now I don't know how to recommission it
.Please help me
Thanks and Regards
Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | www.mu-sigma.com
-Original Message-
From: Roshni Rajagopal [mailto:roshni.rajago...@wal-mar
Dear all
I decommissioned a node. Now I want to make that node a part of the ring again.
How do I do it? Please help me
Thanks and Regards
Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | www.mu-sigma.com
This email message may contain proprietary, p
42 matches
Mail list logo