what others use.
Thanks,
Adam
On Mon, Feb 28, 2022 at 1:59 PM Joe Obernberger <
joseph.obernber...@gmail.com> wrote:
> Hi all - curious what tools are folks using to manage large Cassandra
> clusters? For example, to do tasks such as nodetool cleanup after a
> node or nodes are adde
Thank you Scott!
We installed over on top of 3.11 and it's working fine. Easy non-upgrade path
:)
Thanks again,
Adam
On 2021/07/28 22:49:11, Scott Andreas wrote:
> If you're running Cassandra 3.x, the only data file requirement is that all
> SSTables present on your clu
Thanks Brandon!
Anyone know the upgrade path from 3.x?
https://cassandra.apache.org/doc/latest/cassandra/getting_started/installing.html
Doesn't look like it has specific upgrade instructions.
For instance do we need to run nodetool upgradesstables?
TIA
On 2021/07/26 20:03:59, Brandon Wil
dra again on the new node
> , will it resume bootstrap or will it start over?
>
> On Thu, 7 May 2020 at 13:32, Adam Scott wrote:
>
>> I recommend it on all nodes. This will eliminate that as a source of
>> trouble further on down the road.
>>
>>
>> On Thu,
>
>
> On Thu, 7 May 2020 at 13:23, Adam Scott wrote:
>
>>
>> *edit
>> /etc/sysctl.confnet.ipv4.tcp_keepalive_time=60
>> net.ipv4.tcp_keepalive_probes=3net.ipv4.tcp_keepalive_intvl=10*
>> then run sysctl -p to cause the kernel to reload the settings
abc cassandra]# cat /proc/sys/net/ipv4/tcp_keepalive_time
>
> 300
>
> [root@abc cassandra]# cat /proc/sys/net/ipv4/tcp_keepalive_intvl
>
> 30
>
> [root@abc cassandra]# cat /proc/sys/net/ipv4/tcp_keepalive_probes
>
> 9
>
> On Thu, 7 May 2020 at 12:32, Adam Scott
Maybe a firewall killing a connection?
What does the following show?
cat /proc/sys/net/ipv4/tcp_keepalive_time
cat /proc/sys/net/ipv4/tcp_keepalive_intvl
cat /proc/sys/net/ipv4/tcp_keepalive_probes
On Thu, May 7, 2020 at 10:31 AM Surbhi Gupta
wrote:
> Hi,
>
> We are trying to expand a datacente
ch this
>> mailing list (I'm happy to be corrected 🙂 ) so I'd recommend you
>> cross-post in the Java driver channels as well. Cheers!
>>
>
--
Adam Holmberg
e. adam.holmb...@datastax.com
w. www.datastax.com
Thanks for the excellent advice, this was extremely helpful! Did not know
about TWCS... curing a lot of headache.
Adam
Am Mi., 28. Nov. 2018 um 20:47 Uhr schrieb Jeff Jirsa :
> Probably fine as long as there’s some concept of time in the partition key
> to keep them from growing unb
worries?
4) Any optimization strategies like setting the RF to 1? Which compactation
strategy is advised?
5) Are there any recent performance benchmarks for one of the scenarios?
What else could I do?
Thanks a lot!
Adam
Hi Abdul,
Can you post the timeout error and also the trace output?
The trace output is detailed, but often refers to Cassandra internals. If
we can see the error and the trace it'd be easier to pin-point the problem.
- Adam
On Thu, 11 Oct 2018 at 08:53, Abdul Patel wrote:
> Hi,
>
ould certainly appreciate
any feedback or contributions.
Regards,
Adam
On Mon, 24 Sep 2018 at 14:32, James Shaw wrote:
> Hi, there:
>What are latest good tools for monitoring open source cassandra ?
> I was used to Datastax opscenter tool, felt all tasks quite easy. Now on
>
and a spark-cassandra-connector
job counts the columns and and increases/doubles the number of splits on
demand. This means then that I would have to move e.g. (URL1,0) -> (URL1,1)
when splitnumber would be 2.
Would you do the same? Is there a better way?
Thanks!
Adam
1'? I've looked through the
> configuration files in /etc/cassandra , and can't find where this value is
> being defined.
>
--
Adam Hutson
Data Architect | DataScale
+1 (417) 224-5212
a...@datascale.io
l and permanently delete this message
> and any attachments. KRON makes no warranty that this e-mail is error or
> virus free.
>
--
Adam Hutson
Data Architect | DataScale
+1 (417) 224-5212
a...@datascale.io
OK so good news, I'm running with the patched jar file in my cluster and
haven't seen any issues. The bloom filter off-heap memory usage is between
1.5GB and 2GB per node, which is much more in-line with what I'm expecting!
(thumbsup)
On Mon, Mar 14, 2016 at 9:42 AM, Ad
affected nodes with the fixed jar.
>
> 2016-03-13 19:51 GMT-03:00 Adam Plumb :
>
>> So it's looking like the bloom filter off heap memory usage is ramping up
>> and up until the OOM killer kills the java process. I relaunched on
>> instances with 60GB of memory a
node will start using more and more until it is also killed.
Is this the expected behavior? It doesn't seem ideal to me. Is there
anything obvious that I'm doing wrong?
On Fri, Mar 11, 2016 at 11:31 AM, Adam Plumb wrote:
> Here is the creation syntax for the entire schema. The xyz
{'class': 'LZ4Compressor'};
> CREATE INDEX secondary_id_index_def ON abc.def (secondary_id);
On Fri, Mar 11, 2016 at 11:24 AM, Jack Krupansky
wrote:
> What is your schema and data like - in particular, how wide are your
> partitions (number of rows and typical row size)?
&g
is box?
>
> If so, does it have a memory leak?
>
> all the best,
>
> Sebastián
> On Mar 11, 2016 11:14 AM, "Adam Plumb" wrote:
>
>> I've got a new cluster of 18 nodes running Cassandra 3.4 that I just
>> launched and loaded data into yesterday
I've got a new cluster of 18 nodes running Cassandra 3.4 that I just
launched and loaded data into yesterday (roughly 2TB of total storage) and
am seeing runaway memory usage. These nodes are EC2 c3.4xlarges with 30GB
RAM and the heap size is set to 8G with a new heap size of 1.6G.
Last night I f
The referenced article is accurate as far as NULL is concerned, but please
also note that there is now the ability to specify UNSET to avoid
unnecessary tombstones (as of Cassandra 2.2.0):
https://issues.apache.org/jira/browse/CASSANDRA-7304
Adam
On Tue, Mar 8, 2016 at 12:15 PM, Henry M wrote
e UTC - 06:00, while the example is showing UTC +
08:00. You might want to check the date settings on your local machine and
the database.
Regards,
Adam Holmberg
On Thu, Jan 7, 2016 at 1:58 AM, 土卜皿 wrote:
> Hi, all
>
> When I run the command date:
>
> [root@localhost ~]# date
>
. I've created a ticket to
improve the error message in these circumstances:
https://datastax-oss.atlassian.net/browse/PYTHON-451
Regards,
Adam Holmberg
Michael,
Thanks for pointing that out. It is a driver issue affecting CQL export
(but not the execution API).
I created a ticket to track and resolve:
https://datastax-oss.atlassian.net/browse/PYTHON-447
Adam
On Sat, Nov 21, 2015 at 8:38 AM, Laing, Michael
wrote:
> Quickly reviewing t
It's timing out at the default of two seconds while trying to query and
build the schema metadata. You can raise this timeout:
http://datastax.github.io/python-driver/api/cassandra/cluster.html#cassandra.cluster.Cluster.control_connection_timeout
Adam
On Thu, Oct 29, 2015 at 10:06 AM, Ed
You need to quote the "OpsCenter" identifier to distinguish capital letters:
https://cassandra.apache.org/doc/cql3/CQL.html#identifiers
Adam
On Mon, Oct 26, 2015 at 4:25 PM, Kai Wang wrote:
> Hi,
>
> My understanding is that if I want to enable internal authentication and
&
/Metadata.html
You may not have to construct this yourself.
Adam Holmberg
On Thu, Mar 26, 2015 at 3:53 PM, Roman Tkachenko
wrote:
> Hi Dan,
>
> Have you tried using "nodetool getendpoints"? It shows you nodes that
> currently own the specific key.
>
> Roman
>
> On
Since I don't know what's in your keystore, or how it was generated, I
don't know how much help I can be.
You probably need "-alias " on the command line, and make sure a
cert by the name "" exists in your keystore. You can use
"keytool -list ..." to
the schema change. If this
yields further information, please raise the issue on the driver's user
mailing list.
Adam Holmberg
On Wed, Jan 28, 2015 at 8:19 PM, Saurabh Sethi
wrote:
> I have a 3 node Cassandra 2.1.0 cluster and I am using datastax 2.1.4
> driver to create a keyspace
:
How cqlsh picks up ssl options
<https://github.com/apache/cassandra/blob/cassandra-2.1/pylib/cqlshlib/sslhandling.py>
Example cqlshrc file
<https://github.com/apache/cassandra/blob/cassandra-2.1/conf/cqlshrc.sample>
Adam Holmberg
On Wed, Jan 28, 2015 at 1:08 AM, Lu, Boying
Stephen,
This topic is more appropriate for the python-driver-user list:
https://groups.google.com/a/lists.datastax.com/forum/#!forum/python-driver-user
Can we pick this up there, with a little further information including your
table definition?
Adam
On Fri, Dec 12, 2014 at 1:04 PM, Stephen
This is something that could easily be improved in cqlsh. I'll get a ticket
open today.
Adam
On Wed, Dec 17, 2014 at 8:38 AM, nitin padalia
wrote:
> Thanks! Michael.
> On Dec 17, 2014 8:02 PM, "Laing, Michael"
> wrote:
>
>> http://datastax.github.io/python-d
'null' is how cqlsh displays empty cells:
https://github.com/apache/cassandra/blob/trunk/pylib/cqlshlib/formatting.py#L47-L58
On Thu, Oct 23, 2014 at 9:36 AM, DuyHai Doan wrote:
> Hello Jens
>
> What do you mean by "cqlsh explicitely writes 'null' in those cells" ?
> Are you seing textual value
-2.1.0/bin/cqlsh for how everything gets
setup -- it's possible your wrapper or environment are not playing well
with that.
Also note that "9160" will not apply anymore since this driver uses the
native protocol (9042).
Adam
On Sun, Sep 21, 2014 at 7:53 PM, Tim Dunphy wrote:
There is now a ticket open to look into this and produce a more informative
error message:
https://datastax-oss.atlassian.net/browse/PYTHON-157
Adam
On Wed, Sep 17, 2014 at 4:47 PM, Adam Holmberg
wrote:
> This is not really supported. Presently cqlsh hard-codes CQL and protocol
> to ve
This is not really supported. Presently cqlsh hard-codes CQL and protocol
to versions only supported in 2.1:
https://github.com/apache/cassandra/blob/cassandra-2.1.0/bin/cqlsh#L144-L147
Your best bet is probably downloading a 2.0.x tarball and running out of
there.
Adam Holmberg
On Wed, Sep 17
I think you just need to quote the "Users" identifier. Without quotes,
identifiers are treated as case-insensitive.
https://cassandra.apache.org/doc/cql3/CQL.html#identifiers
Adam
On Wed, Aug 13, 2014 at 9:27 AM, Tim Dunphy wrote:
> Hello,
>
> I am trying to figure out
Hi All,
I'm having some major issues bootstrapping a new node to my cluster. We
are running 1.2.16, with vnodes enabled.
When a new node starts up (with auto_bootstrap), it selects a host ID and
finds the ring successfully:
INFO 18:42:29,559 JOINING: waiting for ring information
It successfull
uot; which contains cells that have never been deleted such as:
select * from chatmessages where username='adam' and
spotname='adam:cherie' and id =1860;
Am I seeing in the cqlsh trace that 1 tombstoned cell has been read:
"Read 1 live and 1 tombstoned cells"
e)
Looking at the code, it looks like the Session object was made
unserializable as of this
change<https://issues.apache.org/jira/browse/CASSANDRA-4239>
.
Is this daemon no longer a supported mode of operation, or am I completely
missing something in how I'm using it?
Any insight would be appreciated.
Regards,
Adam
7;ve been witnessing. In which case, how
do I limit the columns being scanned?
N.B. I cant set the 'widerow' flag to false as it breaks Cassandra (too
many columns are loaded at once, causing an outofmemory style exception).
Many thanks,
Adam
18 PM, Adam Venturella wrote:
> So, it looks like it supports multipart user-data:
> Line 86 here:
> https://github.com/riptano/ComboAMI/blob/2.4/ds2_configure.py
>
>
> I make my multipart user data,
>
> text/plaintext
> text/cloud-config
>
> I need to do some confi
So, it looks like it supports multipart user-data:
Line 86 here:
https://github.com/riptano/ComboAMI/blob/2.4/ds2_configure.py
I make my multipart user data,
text/plaintext
text/cloud-config
I need to do some configuration and hook the cluster up to my puppet master.
The cluster get's configur
I am currently running 4 nodes, @ 1.2.2.
I was curious if it mattered what node I have my clients connect to. Using
the python cql driver :
https://pypi.python.org/pypi/cql/
It doesn't give me the option to specify multiple client addresses, just
one. Will this be an issue?
My assumption is that
TL;DR:
Is it possible to use WHERE IN on wide rows but only have it return the 1st
column of each of the rows in the IN()?
First, I am aware that WHERE IN (id1, id2, id3...N) is not the most
performant, and should not be used on large sets.
Assuming there is also little difference from just issui
Perfect! Thanks for the response Sylvain!
On Friday, March 1, 2013, Sylvain Lebresne wrote:
> On Fri, Mar 1, 2013 at 5:16 PM, Adam Venturella
>
> > wrote:
>
>> My ColumnFamily is defined as follows:
>>
>>
>> CREATE TABLE UserProfileHistory(
>&g
My ColumnFamily is defined as follows:
CREATE TABLE UserProfileHistory(
username text,
timestamp bigint, -- millis since epoch
data text, -- JSON
PRIMARY KEY (username, timestamp)
) WITH CLUSTERING ORDER BY (timestamp DESC);
Each insert on the username adds to the wide row. The
> Aaron Morton
> Freelance Cassandra Developer
> New Zealand
> @aaronmorton
> http://www.thelastpickle.com
> On 21/02/2013, at 7:43 AM, Adam Venturella wrote:
>> My data needs only require me to store JSON, and I can handle this in 1
>> column family by
My data needs only require me to store JSON, and I can handle this in 1
column family by prefixing row keys with a type, for example:
comments:{message_id}
Where comments: represents the prefix and {message_id} represents some row
key to a message object in the same column family.
In this case c
t;
>
> On 02/07/2013 11:04 PM, Adam Venturella wrote:
>
>> Has anyone encountered this before?
>> What did I most likely break or how do I fix it?
>>
>
>
Has anyone encountered this before?
What did I most likely break or how do I fix it?
In my experience, if you foresee needing to do a lot of updates where a
"master" record would need to propagate its changes to other
records, then in general a non-sql based data store may be the wrong fit
for your data.
If you have a lot of data that doesn't really change or is not linked in
some
Jared, how do you guys handle data backups for your ephemeral based cluster?
I'm trying to move to ephemeral drives myself, and that was my last
sticking point; asking how others in the community deal with backup in case
the VM explodes.
On Wed, Jan 16, 2013 at 1:21 PM, Jared Biel
wrote:
> We'
I have been using this successfully so far:
http://pypi.python.org/pypi/cql
On Sun, Jan 6, 2013 at 11:18 AM, Andreas Jung wrote:
> -BEGIN PGP SIGNED MESSAGE-
> Hash: SHA1
>
> Are there any up-to-date Python bindings available that
> work with Cassandra 1.2?
>
> - -aj
> -BEGIN PGP S
Nevermind...
help CREATE_KEYSPACE; works wonders..
CREATE KEYSPACE
WITH replication = {'class':'SimpleStrategy',
'replication_factor':3};
=)
On Sat, Dec 29, 2012 at 1:27 PM, Adam Venturella wrote:
> When I create a keyspace with a SimpleS
When I create a keyspace with a SimpleStrategy as outlined here:
https://cassandra.apache.org/doc/cql3/CQL.html#createKeyspaceStmt
CREATE KEYSPACE Test
WITH strategy_class = SimpleStrategy
AND strategy_options:replication_factor = 1;
I receive the following error:
Bad Re
One more link that might be helpful. It's a similar system to photo's but
instead of Photos/Albums it's Songs/Playlists:
http://www.datastax.com/dev/blog/cql3-for-cassandra-experts.
It's not exactly 1:1 but it covers related concepts in making it work.
On Fri, Dec 21, 2
part of the PK.
Admittedly, I don't know how to get around that and just delete everything
where the first 2 components of the PK are true.
You would already possess the list of album names though, so it could be
BATCH that you need to perform for the deletes, specifying 1 delete per
album_n
I am pretty new to cassandra as well. But here goes nothing:
Assumptions:
- You are using a CQL3 client
- Remember I am a n00bsauce at this as well, so another member of the list
may, and probably does, have a better more enlightened answer than I.
Everyone was new to this a one time though, and
Trying to better grasp compound primary keys and what they are conceptually
doing under the hood. When you create a table with a compound primary key
in cql3 (http://www.datastax.com/dev/blog/schema-in-cassandra-1-1) the
first part of the key is the partition key. I get that and the subsequent
part
e columns)
>
> You can do something similar for the annotations.
>
> Depending on your use case I would use UNIX epoch time if possible rather
> than a time uuid.
>
> Hope that helps.
>
> -
> Aaron Morton
> Freelance Cassandra Developer
> New Zealand
My use case is capturing some information about Instagram photos from the
API. I have 2 use cases. One, I need to capture all of the media data for
an account and two I need to be able to privately annotate that data. There
is some nuance in this, multiple http queries for example, but ignoring
tha
what you meant.
I'm aware of CQL3 collections, but I don't think they quite suite my needs
in this case.
Thanks for the suggestions!
Adam
On Fri, Sep 14, 2012 at 1:56 AM, aaron morton wrote:
> You _could_ use one wide row and do a multiget against the same row for
> different co
ques would be appreciated.
Regards,
Adam Holmberg
P.S./Sidebar:
What this seems like to me is a desire for 'multiget' at the second key
level analogous to multiget at the row key level. Is this something that
could be implemented in the server using SlicePred
I've been using a combination of MySQL and Cassandra for about a year now
on a project that now serves about 20k users. We use Cassandra for storing
large entities and MySQL to store meta data that allows us to do better ad
hoc querying. It's worked quite well for us. During this time we have also
We've been introducing Cassandra into our stack at Gowalla over the past year
or so. We recently released the fruits of some of that work. Chronologic is a
service for publishing and aggregating activity feeds in social applications.
It exposes a REST/JSON API via Ruby and stores all the feeds,
I've been working on Chronologic for the past several months. I'm doing a
presentation on it this weekend at Lone Star Ruby Conference. It will be open
source by Sunday 8/14 :).
More info on the talk: http://speakerrate.com/talks/7808
On Aug 6, 2011, at 3:57 AM, Sal Fuentes wrote:
> Hello Scot
rwise, force a heap dump after a full GC and take a look to see
> what's referencing all the memory.
>
> On Fri, May 6, 2011 at 12:25 PM, Serediuk, Adam
> wrote:
>> We're troubleshooting a memory usage problem during batch reads. We've spent
>> the last few day
;re on 0.7.5, mmap, jna and jdk 1.6.0_24
We've somewhat hit the wall in troubleshooting and any advice is greatly
appreciated.
--
Adam
Having a well known node configuration that is trivial (one step) to create is
your best maintenance bet. We are using 4 disk nodes in the following
configuration:
disk1: boot_raid1 os_raid1 cassandra_commit_log
disk2: boot_raid1 os_raid1 cassandra_data_dir_raid0
disk3: cassandra_data_dir_raid0
; more heavily loaded than the others, and are correctly pushing queries
> to other replicas.
>
> On Tue, May 3, 2011 at 12:47 PM, Serediuk, Adam
> wrote:
>> I just ran a test and we do not see that behavior with dynamic snitch
>> disabled. All nodes appear to be
12:31 PM, Serediuk, Adam
> wrote:
>> We appear to have encountered an issue with cassandra 0.7.5 after upgrading
>> from 0.7.2. While doing a batch read using a get_range_slice against the
>> ranges an individual node is master for we are able to reproduce
>> consistently
get evenly sized splits using Cassandra.Client.describe_splits().
Adam
they're being
created faster than they can be addressed)
Can anyone suggest anywhere we might be going wrong? As I say, at the
present we're just looking to do a bulk insert, no read activity until
the writes have completed.
Thanks in advance,
Adam
cassandra.yaml
Description: Binary data
this? Usually the latency is
quite low, but once in every 10 queries or so it's completely of the
chart.
thanks,
Adam
this problem? I am curious about what might
trigger this in one cluster and not on the others (which operate in the same
environment, and are configured similarly).
Any insight would be appreciated.
Thanks,
Adam
Thanks, Jonathan.
Does any know if this is documented anywhere? I didn't turn up anything
searching JIRA or the web.
Adam
On Thu, Sep 23, 2010 at 12:58 PM, Jonathan Ellis wrote:
> Yes, that is a problem with 0.5.1. Unfortunately we are not planning
> any more releases to 0.5.
>
ndering if this is/was a known issue with this version, or if I am
doing something wrong. My searches thus far have yielded nothing on the
matter.
Any insights would be appreciated.
Regards,
Adam
What are the
defaults for cassandra? Where are they specified?
Or is this indicative of another problem?
thanks,
Adam
David,
This much like the behavior I saw... I thought that I might be doing something
wrong, but I haven't had the time to check out other clients iteration
implementations. What client are you using?
-Adam
-Original Message-
From: David McIntosh [mailto:da...@radiotime.com]
was not
throwing out good keys only duplicate ones. That means 1 of 2 things:
1) I'm somehow using the API incorrectly
2) I am the only one encountering a bug
My money is on 1) of course. I can check the thrift API against what my Scala
client is calling under the hood.
-Adam
-Ori
t] key25
[junit] Query w/ Range(key25,,10) result size: 3
[junit] key25
[junit] key14
[junit] key2
[junit] Query w/ Range(key2,,10), result size: 1
[junit] key2
-Adam
-Original Message-
From: sc...@scode.org on behalf of Peter Schuller
Sent: Fri 8/6/2010 6:43 PM
To: user@cassandra.apac
I ran against the 0.6 branch I still see similarly odd results. My test cases
prove that set of keys have been successfully inserted, but usually I never see
the first key again or I reach the first key before having seen all of the keys.
-Adam
-Original Message-
From: Jeremy Hanna
see all of the
keys.
-Adam
-Original Message-
From: Jeremy Hanna [mailto:jeremy.hanna1...@gmail.com]
Sent: Fri 8/6/2010 11:45 AM
To: user@cassandra.apache.org
Subject: Re: error using get_range_slice with random partitioner
Sounds like what you're seeing is in the client, but ther
(.
I'll review yours as soon as I get the client fixed that I'm using.
Adam
-Original Message-
From: davevi...@gmail.com on behalf of Dave Viner
Sent: Fri 8/6/2010 11:28 AM
To: user@cassandra.apache.org
Subject: Re: error using get_range_slice with random partitioner
Funny you
consequence of the random partitioner. I really don't care about the order of
the iteration, but only each key once and that I see all keys is important.
-Adam
-Original Message-
From: th.hel...@gmail.com on behalf of Thomas Heller
Sent: Fri 8/6/2010 7:27 AM
To: user@cassandra.apache.org
Su
ified
row, but the get_range_slice fails.
-Adam
-Original Message-
From: Jonathan Ellis [mailto:jbel...@gmail.com]
Sent: Thursday, August 05, 2010 12:22 PM
To: user@cassandra.apache.org
Subject: Re: error using get_range_slice with random partitioner
can you reproduce starting with a fr
I've never changed the partitioner from the default random. Other ideas?
I can insert and do column queries using a single key but not range on CF.
-Adam
-Original Message-
From: Jonathan Ellis [mailto:jbel...@gmail.com]
Sent: Thursday, August 05, 2010 11:33 AM
To:
alled with start and stop set to "".
1) Is batch iteration possible with random partioner?
This isn't clear from the FAQ entry on the subject:
http://wiki.apache.org/cassandra/FAQ#iter_world
2) The FAQ states that start argument should be "". What should the end
argument be?
thanks!
Adam
90 matches
Mail list logo