Hi,
it depends on your queries complexity - maybe secondary indexes would be
sufficient for you -
http://www.datastax.com/dev/blog/whats-new-cassandra-07-secondary-indexes
If your queries are too complex then you could use Pig (over Hadoop) -
http://www.slideshare.net/jeromatron/cassandrahadoop-i
Already submitted and fixed! Thanks Jonathan for your help on this. I really
appreciate it!
https://issues.apache.org/jira/browse/CASSANDRA-2158
regardless of increasing RF or not, RR happens based on the
read_repair_chance setting. RR happens after the request has been replied
to though, so it's possible that if you increase the RF and then read that
the read might get stale/missing data. RR would then put the correct value
on all the co
Cassy Andra wrote:
>
> Here is a blog my team is working on at Accenture which is intended to be
> a
> complete beginner's guide to Cassandra. I'm still updating a few posts
> based
> on DataStax's recommendations and I need to add the last three posts (will
> get this done soon), but you can st
When I increase the replication factor does the repair happen automatically
in background when client first tries to access data from the node where
data does not exist.
Or the nodetool repair need to run after increasing the replication factor.
--
View this message in context:
http://cassandra
no, it's actually worse to do that.
1) you're introducing single points of failure (your array).
2) you're introducing complexity and expense
3) you're introducing latency
4) you're introducing bottle necks
5) some other reasons...
You do want your commit log on a separate disk though. The o
On Mon, Feb 14, 2011 at 6:58 PM, Dan Hendry wrote:
> > 1) If I insert a key and want to verify which node it went to then how do
> I
> > do that?
>
> I don't think you can and there should be no reason to care. Cassandra
> abstracts where data is being stored, think in terms of consistency levels
I had actually meant to (and thought I did) type "greater than zero and less
than *or equal* to number of nodes".
That being said, you usually do want it less than the number of nodes in the
cluster because otherwise your cluster essentially has the same performance
as a single node. In general (
The latest official 0.6.x releases, 0.6.10 and 0.6.11, have a very serious
bug/regression when performing some quorum reads (CASSANDRA-2081), which is
fixed in the head of the 0.6 branch. If there aren't any plans to cut 0.6.12
any time soon, as an end user, I request that an official and "blessed"
On Mon, Feb 14, 2011 at 2:54 PM, Robert Coli wrote:
> Regarding very large memtables, it is important to recognize that
> throughput refers only to the size of the COLUMN VALUES, and not, for
> example, their names.
>
That would be a bug in it's own right. There are lots of use cases that
only
Anyone else in London interested in this?
--
From: "Jonathan Ellis"
Sent: Monday, February 14, 2011 10:30 PM
To: "user"
Subject: Fwd: SPA2011 - June 12th-15th - BCS London, UK - Call for Sessions
In case any of the London crowd is interested:
> 1) If I insert a key and want to verify which node it went to then how do
I
> do that?
I don't think you can and there should be no reason to care. Cassandra
abstracts where data is being stored, think in terms of consistency levels
not actual nodes.
> 2) How can I verify if the replication is
He probably meant in production. When playing around, and if you only have 2 nodes, you can set it to 2. From memory RF of 2 means the Quorum is also 2, so you cannot afford to lose one. Thats fine for playing. Aaron On 15 Feb, 2011,at 01:51 PM, mcasandra wrote:
mcasandra wrote:
>
> In earlier p
Anyone else in London interested in this?
--
From: "Jonathan Ellis"
Sent: Monday, February 14, 2011 10:30 PM
To: "user"
Subject: Fwd: SPA2011 - June 12th-15th - BCS London, UK - Call for Sessions
In case any of the London crowd is interested:
mcasandra wrote:
>
> In earlier post same thread you mentioned that replication factor should
> be set to less than N.
>
> Currently I am testing on 2 node cluster and I was able to set
> replication_factor to 2 (=N) and also when I did cfstats (I don't quite
> understand cfstats in detail) and
In earlier post same thread you mentioned that replication factor should be
set to less than N.
Currently I am testing on 2 node cluster and I was able to set
replication_factor to 2 (=N) and also when I did cfstats (I don't quite
understand cfstats in detail) and see some activity on both nodes
Couple of questions:
1) If I insert a key and want to verify which node it went to then how do I
do that?
2) How can I verify if the replication is working. That is how do I check
that CF row got inserted in 2 nodes if replication factor is set to 2.
3) What happens if I just update the keyspace
On Mon, Feb 14, 2011 at 6:28 PM, Aaron Morton wrote:
> Will take a closer look at the code tonight, perhaps we should return an
> error if you try to using Network Topology it cannot detect any DC's .
>
>
+1
For now, I have committed a change in the misleading documentation,
substituting SimpleStrategy for NTS.
Sorry you ran into trouble due to that, mcasandra.
On Mon, Feb 14, 2011 at 4:28 PM, Aaron Morton wrote:
> Will take a closer look at the code tonight, perhaps we should return an
> error if y
Will take a closer look at the code tonight, perhaps we should return an error if you try to using Network Topology it cannot detect any DC's . CheersAaron On 15 Feb, 2011,at 01:22 PM, mcasandra wrote:
That's what I thought might be happening since network topology will try to
find one node on the
That's what I thought might be happening since network topology will try to
find one node on the other data center. Message is little confusing though.
[default@unknown] update keyspace twissandra
placement_strategy='org.apache.cassandra.locator.SimpleStrategy';
Syntax error at position 28: miss
Not sure why the docs suggest to use the NetworkTopologyStrategy, if their are no data centres configured the NetworkTopologyStrategy will say the replication factor is 0. I think this is the source of the "invalid response count 1 for replication factor 0" message. Can you try with the SimpleStra
No it's not set to 0. I am just following the example on datastax getting
started site. Here are all the commands:
[default@unknown] create keyspace twissandra with replication_factor=1
... and
placement_strategy='org.apache.cassandra.locator.NetworkTopologyStrategy';
22dea790-3893-11e0-9174-
Is your ReplicationFactor (RF) really set to 0? Don't do that, it needs to
be at least 1 and probably needs to be 3 in production if you care about
your data. It must be greater than 0 and less than the number of nodes in
your ring. It represents the number of nodes to copy/replicate data to.
An
I am following example posted on
http://www.datastax.com/dev/tutorials/getting_started_0.7/using_cli#cassandra-cli
cli
I am seeing:
$ set users['jsmith']['password']='ch@ngem3';
Internal error processing insert
In the logs I see:
java.lang.AssertionError: invalid response count 1 for replica
nodes contain data for (prevTokenInRing, nodesOwnToken] (i.e. exclusive from
previous token to inclusive of the nodes token). So .179 will contain
things that hash in the range (152896308109140433971537345591636551711,0]
and .12 will contain things that hash in range
(0,152896308109140433971537345
I installed cassandra and started it in multi-node. I set the InitialToken to
0. I ran nodetool and see:
$ nodetool -h localhost ring
Address Status State LoadOwnsToken
152896308109140433971537345591636551711
172.16.
In case any of the London crowd is interested:
-- Forwarded message --
From: Mike Hill
Date: Mon, Feb 14, 2011 at 4:14 PM
Subject: SPA2011 - June 12th-15th - BCS London, UK - Call for Sessions
To: "mikewh...@gmail.com"
SPA2011 - June 12th-15th - BCS London, UK - Call for Sessi
Dan Kuebrich wrote:
>
> You may find this part of the wiki helpful:
> http://wiki.apache.org/cassandra/Operations#Range_changes
>
> "If you explicitly specify an InitialToken in the configuration, the new
> node will bootstrap to that position on the ring. Otherwise, it will pick
> a
> Token th
On Mon, 2011-02-14 at 16:50 -0500, Jake Luciani wrote:
> It can take some time for the files to propagate to the mirrors. It's
> Eventually Consistent though :)
Preferably they'd 404 when that happens though. :)
--
Eric Evans
eev...@rackspace.com
On Mon, 2011-02-14 at 14:20 -0700, Frank LoVecchio wrote:
> Ah, I meant quite a few of the mirror links keep showing up as links
> to gossip sites and whatnot.
I suspect those mirrors are broken. Can you submit a ticket to
https://issues.apache.org/jira/browse/INFRA for any like that you find?
It can take some time for the files to propagate to the mirrors. It's
Eventually Consistent though :)
On Mon, Feb 14, 2011 at 4:20 PM, Frank LoVecchio wrote:
> Ah, I meant quite a few of the mirror links keep showing up as links to
> gossip sites and whatnot.
> On Feb 14, 2011 2:17 PM, "Norman
You may find this part of the wiki helpful:
http://wiki.apache.org/cassandra/Operations#Range_changes
"If you explicitly specify an InitialToken in the configuration, the new
node will bootstrap to that position on the ring. Otherwise, it will pick a
Token that will give it half the keys from the
Ah, I meant quite a few of the mirror links keep showing up as links to
gossip sites and whatnot.
On Feb 14, 2011 2:17 PM, "Norman Maurer" wrote:
Hi all,
I've build a web application using Cassandra.
Data are stored in order to be quickly red/sorted due to my web-app needs.
Everything is working quite good.
Now the big "problem" is that the "other side" of my company needs to create
reports over these data and the query they need to do wou
Huh,
isn't that what mirrors are supposed to be for ?
Bye,
Norman
2011/2/14 Frank LoVecchio :
> Did the site get hacked?
> http://www.apache.org/dyn/closer.cgi?path=/cassandra/0.7.1/apache-cassandra-0.7.1-bin.tar.gz
> Sources keep changing...
>
> On Mon, Feb 14, 2011 at 1:13 PM, Eric Evans wrot
Hi,
I don't know how data on at least one of our five nodes in our cluster got
bad but what I'm seeing is that the key value output from sstable2json
doesn't match the key value that I specify on the command line.
For example, I do the following on one of the nodes:
bash$:/root Mon Feb 14 12:47:
I am trying to understand atleast to some level of detail about how random
partitioner works. With the text I have seen on the website I am not able to
clearly understand. Is there a place where it's described with an example,
for eg how nodes are assigned random tokens? Is the range picked random
Did the site get hacked?
http://www.apache.org/dyn/closer.cgi?path=/cassandra/0.7.1/apache-cassandra-0.7.1-bin.tar.gz
Sources keep changing...
On Mon, Feb 14, 2011 at 1:13 PM, Eric Evans wrote:
>
> Today is Valentine's Day[1] in many parts of the world, an annual
> commemoration of love and af
Ouch, that redesign is a bit nasty. jna.jar in this folder is the same as the one I last got 3.2.7http://java.net/projects/jna/sources/svn/show/trunk/jnalib/dist?rev=1182AaronOn 15 Feb, 2011,at 09:48 AM, mcasandra wrote:
In Cassandra documentation it recommends downloading jna.jar. However I am
un
On Sat, Feb 12, 2011 at 11:17 PM, E S wrote:
> While experimenting with this, I found a bug where you can't have memtable
> throughput configured past 2 gigs without an integer overflow screwing up the
> flushes. That makes me feel like I'm in uncharted territory :).
I am sure the project would
In Cassandra documentation it recommends downloading jna.jar. However I am
unable to see any jar files on the mentioned website
http://java.net/projects/jna/ http://java.net/projects/jna/
Do I need to download the source instead and then compile it?
--
View this message in context:
http://cas
Today is Valentine's Day[1] in many parts of the world, an annual
commemoration of love and affection typically celebrated with candy,
stuffed animals, and floral arrangements.
She may seem a bit a fickle at times, but Cassandra loves you, and since
most people would rather receive a gift of the
hi,
a few weeks back this topic had some discussion (cassandra as a session
store). subsequently, i threw together a quick hack to have PHP use
Cassandra as a session store. A benefit I quickly found is that I could
rely on Cassandra to expire the sessions and not PHP session garbage
collection.
Here is a blog my team is working on at Accenture which is intended to be a
complete beginner's guide to Cassandra. I'm still updating a few posts based
on DataStax's recommendations and I need to add the last three posts (will
get this done soon), but you can start checking it out via this link:
Huh... I usually insert, compact, then flush. Apparently I've been
doing it wrong my whole life. So it needs like a courtesy flush. Let
me try that :)
-Kal
On Thu, Feb 10, 2011 at 3:06 AM, Sylvain Lebresne wrote:
> Kal, you may have to flush before compacting.
> If you insert then compact, then
Well, we have had some problems when a compactation was kicking into the
system under heavy load on version 0.6.3. This caused some nodes to respond
slow and caused some application instability.
What we do, playing with compactation strategy, was to move the major
compactation to a fixed hour of t
On Sun, Feb 13, 2011 at 2:28 AM, Sylvain Lebresne wrote:
>
>
> On Sat, Feb 12, 2011 at 2:52 AM, Xiaobo Gu wrote:
>>
>> On Fri, Feb 11, 2011 at 11:54 PM, Sylvain Lebresne
>> wrote:
>> >
>> >
>> > On Fri, Feb 11, 2011 at 4:27 PM, Xiaobo Gu
>> > wrote:
>> >>
>> >> On Fri, Feb 11, 2011 at 11:21 PM,
Thanks for the clarifications..
On Mon, Feb 14, 2011 at 6:13 PM, Sylvain Lebresne wrote:
> On Mon, Feb 14, 2011 at 11:27 AM, Aditya Narayan wrote:
>
>> Thanks Sylvain,
>>
>> I guess I might have misunderstood the meaning of column_index_size_in_kb,
>> My previous understanding about that was: it
On Mon, Feb 14, 2011 at 11:27 AM, Aditya Narayan wrote:
> Thanks Sylvain,
>
> I guess I might have misunderstood the meaning of column_index_size_in_kb,
> My previous understanding about that was: it is the threshold size for a row
> to pass, after which its columns will be indexed.
>
It is the
Thanks.
On Mon, Feb 14, 2011 at 7:59 PM, nicolas lattuada
wrote:
> Hi
>
> I have a two nodes cluster running and have both of them in the seeds list.
>
> regards
>
>> Date: Sun, 13 Feb 2011 16:04:41 +0800
>> Subject: Question about seeds in tow node cluster.
>> From: guxiaobo1...@gmail.com
>> To:
Hi
I have a two nodes cluster running and have both of them in the seeds list.
regards
> Date: Sun, 13 Feb 2011 16:04:41 +0800
> Subject: Question about seeds in tow node cluster.
> From: guxiaobo1...@gmail.com
> To: user@cassandra.apache.org
>
> Hi,
> If the cluster only have tow nodes, shou
Thanks Sylvain,
I guess I might have misunderstood the meaning of column_index_size_in_kb,
My previous understanding about that was: it is the threshold size for a row
to pass, after which its columns will be indexed.
If I have understood it correctly, it implies the size of the "blocks
(containi
As said by aaron, if the whole row is under 64k, it won't matter. But since
you spoke of very wide row, I'll assume the whole will be much more than
64k.
If so, the row is indexed by block (of 64k, configurable). Then the read
performance depends on how many of those block are needed for the query
54 matches
Mail list logo