I'll also add that you are probably running into some memory issues, 2.5 GB is
a low heap size
> -Xms2500M -Xmx2500M -Xmn400M
If you really do have a cache and want to reduce the disk activity disable
durable_writes on the KS. That will stop the writes from going to the commit
log which is on
> So are hints to a given UUID discarded after some period of time with
> that UUID not present in the cluster?
From memory they are TTL'd at the gc_grace_seconds for the CF.
Cheers
-
Aaron Morton
Freelance Cassandra Consultant
New Zealand
@aaronmorton
http://www.thelastpickle.co
> When you say aggregates, do you mean converting 1 minute data to 15 minute
> data
That one.
Essentially replicating a multi dimensional cube; a fact table with dimensions
and aggregates of the fact properties calculated for the dimension values,
including hierarchical dimensions such as time
Appreciate your thoughts Tony,
in our DW there are composite keys, 500K of them say per customer to
produce a report for which the client program needs to page through the
entire set collecting data as it pages through yet to probably another
desktop db.
At this point the purpose of having a
According to the OpsCenter records, yes, the compaction was running then,
8.5mb /s
On Thu, Jun 27, 2013 at 9:54 AM, sulong wrote:
> version: 1.2.2
> cluster read requests 800/s, write request 22/s
> Sorrry, I don't know whether the compaction was running then.
>
>
> On Thu, Jun 27, 2013 at 1
version: 1.2.2
cluster read requests 800/s, write request 22/s
Sorrry, I don't know whether the compaction was running then.
On Thu, Jun 27, 2013 at 1:02 AM, Robert Coli wrote:
> On Tue, Jun 25, 2013 at 10:13 PM, sulong wrote:
> > I have 4 nodes cassandra cluster. Every node has 32G memory, a
env
--
3 node, every node has 8core CPU、32GB memory and 2*1TB disk
centos 5 64bit cassandra 1.2.5 opscenter3.1
my scenario
--
each node has a client program, each client start 10 thread for increasing
counter.
the replica=2, and other is defaut value.
my data model
-
This means there were 12 times the code wanted to put an memtable in the queue
to be flushed to disk but the queue was full.
---
seem not a serious problem. what's the different between 'Blocked' and 'All
time blocked'.
thx
-heipark
-邮件原件-
发件人: aaron morton [mailto:aa...@thelastp
Thanks Arthur.
Interesting you think NoSQL does not fit into large volumes of data, That is
what it is touted to do.
I have heard PK's are needed but remember that is what the "key" column is for
I thought and composite key support is there also.
The only issue I see is the all that duplicate d
Tony hi,
Yes, in some scenarios (e.g. a DW), e.g. absence of proper PKs or indexes (just
too hard to envision, you need to think of future queries 1st) getting thru
large volumes of data makes NoSQL IMHO hard to fit in.
But you have other choices:
1) pagination or
2) slice queries.
Both of th
Thanks for your response.
Are there any other general comments on the steps we are taking to decommission
and join back the node. I'm assuming if we do specify a token, we should
specify exactly the same token when we add that node back.
-Original Message-
From: Robert Coli [mailto:rc.
On Fri, Jun 21, 2013 at 5:25 AM, Eric Stevens wrote:
> Is there a way to replace a failed server using vnodes? I only had occasion
> to do this once, on a relatively small cluster. ...
> Of course that caused a bunch of key reassignments,
> so I'm sure it would be less work for the cluster if I
Hello,
We are excited to announce the release of our tool, D-Tunes, which
helps application developers configure replication in geo-distributed
datastores like Cassandra. D-Tunes is a research prototype that we
have developed at the Internet System Laboratory (ISL) at Purdue
University, and is par
do I understand it correctly if I think that collection modifications are
done by reading the collection, writing a range tombstone that would cover
the collection and then re-writing the whole collection again? or is it
just the modified parts of the collection that are covered by the range
tombst
Hi Robert,
Actually that is what I did. I did that in my RDMS data model. In Cassandra or
NOSQL without join or nested selects I have to do two queries. Also, since
batching is not supported on the server side which makes the performance worse.
I just started learning Cassandra but I am learnin
On Wed, Jun 26, 2013 at 10:20 AM, Tony Anecito wrote:
> Never mind I figured it out. I found it via a search for Secondary indexes.
In general unless you actually need atomic update of the row and its
secondary index, you are probably better off creating your own pseudo
secondary index column fam
On Tue, Jun 25, 2013 at 5:30 AM, Blair Zajac wrote:
> But if I want to delete it regardless of v1, then this doesn't work:
>
> DELETE FROM test WHERE k = 0 IF EXISTS
>
That's correct, though we should probably fix that at some point. I've
opened https://issues.apache.org/jira/browse/CASSANDRA-
Hi All,
I noticed the JDBC driver supports batching of preparedStatements but when I
checked into it via google there were comments that it was not supported on the
server side and that was early this Month it was mentioned.
So I assume that is still the status but is there some idea when batch
Never mind I figured it out. I found it via a search for Secondary indexes.
Regards,
-Tony
From: Tony Anecito
To: Users-Cassandra
Sent: Wednesday, June 26, 2013 10:39 AM
Subject: Creating an "Index" column...
Hi All,
I have a column family with multipl
On Tue, Jun 25, 2013 at 10:13 PM, sulong wrote:
> I have 4 nodes cassandra cluster. Every node has 32G memory, and the
> cassandra jvm uses 8G. The cluster is suffering from gc. Looks like
> CompactionExecutor thread holds too many SSTableReader. See the attachement.
What version of Cassandra?
Wh
You may have to wait a bit for some of the apache mirrors to catch up. But
if you try a few of the mirrors, some of them are up to date (I just
checked).
As for the release notes, there is links to them in the email you just
responded to.
--
Sylvain
On Wed, Jun 26, 2013 at 6:26 PM, Tony Anecito
On Wed, Jun 26, 2013 at 9:26 AM, Tony Anecito wrote:
> I get http status 404 when I try to download.
>
> Where can I find the release notes for this version?
>
> From: Sylvain Lebresne
> please pay attention to the release notes[2] and Let us know[3] if you were
>
Hi All,
I have a column family with multiple columns and when I try a where clause with
one of the columns that is not the "key" column Cassandra gives me an error
about it not being an index column.
So where can I find an example of creating a column family with one or more
columns being "Inde
I get http status 404 when I try to download.
Where can I find the release notes for this version?
Thanks,
-Tony
From: Sylvain Lebresne
To: "user@cassandra.apache.org"
Sent: Wednesday, June 26, 2013 8:53 AM
Subject: [RELEASE] Apache Cassandra 1.2.6 release
The Cassandra team is pleased to announce the release of Apache Cassandra
version 1.2.6.
Cassandra is a highly scalable second-generation distributed database,
bringing together Dynamo's fully distributed design and Bigtable's
ColumnFamily-based data model. You can read more here:
http://cassand
On Wed, Jun 26, 2013 at 12:16 AM, aaron morton wrote:
> bloom_filter_fp_chance value that was changed from default to 0.1, looked
>>> at the filters and they are about 2.5G on disk and I have around 8G of heap.
>>> I will try increasing the value to 0.7 and report my results.
>>>
>> You need to re
> Does nodetool cleanup run synchronously or asynchronously ?
Async on the server.
> If it is running asynchronously is there any way to monitor the progress ?
Nodetool compactionstats
Cheers
-
Aaron Morton
Freelance Cassandra Consultant
New Zealand
@aaronmorton
http://www.thel
It's an inter node timeout waiting for the read to complete. Normally means the
cluster is overloaded in some fashion, check for GC activity and/or overloaded
IOPs.
If you reduce the batch_size it should help.
Cheers
-
Aaron Morton
Freelance Cassandra Consultant
New Zealand
Hi,
I'm pretty sure that it's related to this ticket :
https://issues.apache.org/jira/browse/CASSANDRA-5677
I'd be happy if someone tests this patch.
It should apply easily on 1.2.5 & 1.2.6
After applying the patch, by default, the current implementation is still
used, but modify your cassandra.
> I had seed nodes ip1,ip2,ip3 as the seeds but what I didn't realize was then
> that these nodes had themselves as seeds. I am assuming that should never be
> done, is that correct.
The only reason nodes listing them selves as seeds can be a pain is during
bootstrap. Seeds nodes will not str
> It is made sure that the schema is same across the two clusters.
Just because the schema is the same does not make it the same. Schema is
essentially a versioned document, so the schema on both clusters will be
different.
> ERROR [main] 2013-06-24 04:48:34,746 CassandraDaemon.java (line
Hi,
I've seen a couple of people on Stack Overflow having problems with
performance when they have maps that they continuously update, and in
hindsight I think I might have run into the same problem myself (but I
didn't suspect it as the reason and designed differently and by accident
didn't use m
32 matches
Mail list logo