Well, Amazon is expensive. Hetzner will sell you dedicated SSD RAID-1 servers
with 32GB RAM and 4 cores with HT for €59/mth. However, if pricing is an
issue, you could start with:
1 server : read at ONE, write at ONE, RF=1. You will have consistency, but not
high availability. This is the sam
Amazon seems to much overprice its services. If you look out for a similar
size deployment elsewhere like linode or digital ocean(very competitive
pricing), you'll notice huge differences. Ok, some services & features are
extra but may we all don't need them necessarily & when you can host on
non-d
3 node EC2 m1.xlarge is ~ $1000/k month + any incidental costs ( s3
backups, transfer out of the AZ ), etc ) or ~$300/month after a ~$1400
upfront 1 year reservation fee.
There are some uncomfortable spots when compaction kicks on concurrently
for several large CF's but otherwise its been performa
>From what I understood "tons" of people are running things on ec2, but it
could be the instance size is pretty large that it compares to a dedicated
server (especially if you go with SSD, it is like 1K/month!)
On Tue, Aug 6, 2013 at 3:54 AM, Aaron Morton wrote:
> how many nodes to start with(2
I'm running cassandra-1.2.8 in a cluster with 45 nodes across three racks. All
nodes are well behaved except one. Whenever I start this node, it starts
churning CPU. Running nodetool tpstats, I notice that the number of pending
gossip stage tasks is constantly increasing [1]. When looking at nodeto
@Aaron
This problem happens when you drop and recreate a keyspace with the same
name and you do it very quickly. I have also filed a JIRA for it
https://issues.apache.org/jira/browse/CASSANDRA-5843
On Tue, Aug 6, 2013 at 10:31 AM, Keith Wright wrote:
> The file does not appear on disk and the
The file does not appear on disk and the permissions are definitely correct.
We have seen the file in snapshots. This is completely blocking us from
adding the new node. How can we recover? Just run repairs?
Thanks
From: Aaron Morton mailto:aa...@thelastpickle.com>>
Reply-To: "user@cassand
http://cassandra.apache.org/doc/cql3/CQL.html#selectStmt
try `and t > 111 and t < 222' or >= and <= if you want inclusive.
On Tue, Aug 6, 2013 at 10:35 AM, Keith Freeman <8fo...@gmail.com> wrote:
> I've been looking at examples about modeling series data in Cassandra, and
> in one experiment cre
I've been looking at examples about modeling series data in Cassandra,
and in one experiment created a table like this:
create table vvv (k text, t bigint, value text, primary key (k, t));
After inserting some data with identical k values and differing t
values, I tried this query (which is nea
On 6 August 2013 16:56, Keith Freeman <8fo...@gmail.com> wrote:
Your description makes me think that if new rows are added during the
> paging (i.e. between one select with token()'s and another), they might
> show up in the query results, right? (because the hash of the new row keys
> might fal
Ok, I get that, I'll have to find another way to sort out new rows.
Your description makes me think that if new rows are added during the
paging (i.e. between one select with token()'s and another), they might
show up in the query results, right? (because the hash of the new row
keys might fa
Hi Richard,
Thanks for your reply.
The uid value is a generated guid and should distribute nicely.
I've just checked the data yesterday there are only 3 uids out of millions
for which there would have been more than 1000 increments.
We started with 256 num_tokens.
Client and server side I c
On 6 August 2013 15:12, Keith Freeman <8fo...@gmail.com> wrote:
> I've seen in several places the advice to use queries like to this page
> through lots of rows:
>
>
> select id from mytable where token(id) > token(last_id)
>
>
> But it's hard to find detailed information about how this works (at
I've seen in several places the advice to use queries like to this page
through lots of rows:
select id from mytable where token(id) > token(last_id)
But it's hard to find detailed information about how this works (at
least that I can understand -- the description in the Cassandra manual
is p
Thanks Aaron. I found that before I asked the question and Helenos seems the
closest but it does not allow you to easily use CRUD like say SQL Server
Management tools where you can get a list of say 1,000 records in a grid
control and select rows for deletion or insert or update.
I will look c
Counters can be atomically incremented (
http://wiki.apache.org/cassandra/Counters). Pick a UUID for the counter,
and use that: c=map.get(k); c.incr()
On 6 August 2013 11:01, Jan Algermissen wrote:
>
> On 06.08.2013, at 11:36, Andy Twigg wrote:
>
> > Store pointers to counters as map values?
>
On 06.08.2013, at 11:36, Andy Twigg wrote:
> Store pointers to counters as map values?
Sorry, but this fits into nothing I know about C* so far - can you explain?
Jan
Store pointers to counters as map values?
Alain,
On 06.08.2013, at 11:17, Alain RODRIGUEZ wrote:
> Once again, this should answer your question :
> http://cassandra.apache.org/doc/cql3/CQL.html#collections
yup, I understand the hint :-) However, since I am about to base application
architecture on these capabilities, I wanted to make
Once again, this should answer your question :
http://cassandra.apache.org/doc/cql3/CQL.html#collections
Alain
2013/8/6 Jan Algermissen
> Hi,
>
> I think it does not fit the model of how C* does writes, but just to
> verify:
>
> Is there an update-in-place possibility on maps? That is, could I
Hi Jan
"TTLs if used only apply to the newly inserted/updated values", from :
http://cassandra.apache.org/doc/cql3/CQL.html#collections
This manual is updated often enough to be up to date, and so, useful, you
should keep it bookmarked.
Alain
2013/8/6 Jan Algermissen
> Hi,
>
> after seeing
On 6 August 2013 08:40, Aaron Morton wrote:
> The reason for me looking at virtual nodes is because of terrible
> experiences we had with 0.8 repairs and as per documentation (an logically)
> the virtual nodes seems like it will help repairs being smoother. Is this
> true?
>
> I've not thought to
Hi,
after seeing Patrick's truly excellent 3-part series on modeling, this question
pops up:
When I do an update on a collection, using a TTL in the update statement (like
Patrick does in the example with the login-location time series example), does
the TTL apply to the update only, or to the
Hi,
I think it does not fit the model of how C* does writes, but just to verify:
Is there an update-in-place possibility on maps? That is, could I do an atomic
increment on a value in a map?
Jan
On Tue, Aug 6, 2013 at 6:10 PM, Aaron Morton wrote:
> Yes. If you overwrite much older data with new data both "versions" of the
> column will remain on disk until compaction get's to work on both fragments
> of the row.
>
thanks
>
> Cheers
>
> -
> Aaron Morton
> Cassandra Cons
Yes. If you overwrite much older data with new data both "versions" of the
column will remain on disk until compaction get's to work on both fragments of
the row.
Cheers
-
Aaron Morton
Cassandra Consultant
New Zealand
@aaronmorton
http://www.thelastpickle.com
On 6/08/2013, at
> Caused by: java.io.FileNotFoundException:
> /data/1/cassandra/data/rts/40301_feedProducts/rts-40301_feedProducts-ib-1-Data.db
> (No such file or directory)
> at java.io.RandomAccessFile.open(Native Method)
> at java.io.RandomAccessFile.(RandomAccessFile.java:233)
> at
>
Repair runs in two phases, first it works out the differences then it streams
the data. The length of the first depends on the size of the data and the
second on the level of inconsistency.
To track the first use nodetool compaction stats or look in the logs for the
messages about requesting o
> how many nodes to start with(2 ok?) ?
I'd recommend 3, that will give you some redundancy see
http://thelastpickle.com/2011/06/13/Down-For-Me/
Cheers
-
Aaron Morton
Cassandra Consultant
New Zealand
@aaronmorton
http://www.thelastpickle.com
On 5/08/2013, at 1:41 AM, Rajkumar G
> So from anyones experience, is it better to use a low cardinality
> partition key or a high cardinality.
IMHO go with whatever best supports the read paths. They all get
If you have lots (e.g. north of 1 billion) rows per node there are extra
considerations that come into play. Cassandra 1.2 he
> The reason for me looking at virtual nodes is because of terrible experiences
> we had with 0.8 repairs and as per documentation (an logically) the virtual
> nodes seems like it will help repairs being smoother. Is this true?
I've not thought too much about how they help repair run smoother, wh
There is a list here.
http://wiki.apache.org/cassandra/Administration%20Tools
Cheers
-
Aaron Morton
Cassandra Consultant
New Zealand
@aaronmorton
http://www.thelastpickle.com
On 3/08/2013, at 6:19 AM, Tony Anecito wrote:
> Hi All,
>
> Is there a GUI tool for managing data i
32 matches
Mail list logo