x27;t guarantee that the order of
insertion is ordering them based on the actual timestamp of the post.
Can someone help me find a solution for this?
Thanks.
Erik
y a hardware issue.
- Erik -
From: Philip Ó Condúin
Sent: Thursday, August 8, 2019 09:58
To: user@cassandra.apache.org
Subject: Re: Datafile Corruption
Hi Jon,
Good question, I'm not sure if we're using NVMe, I don't see /dev/nvme but we
could
I have this situation where a few (like, 3-4 out of 84) nodes misbehave.
Very long GC pauses, dropping out of cluster etc.
This happens while loading data (via CQL), and analyzing metrics it
looks like on these few nodes, a lot of hints are being generated close
to the time when they start to
Hi!
I have a 2.0.13 cluster which I need to do two things with:
* Extend it
* Upgrade to 2.1.14
I'm pondering in what order to do things. Is it a supported operation to
extend a partially upgraded cluster, i.e. a cluster upgraded to 2.0
where not all sstables have been upgraded?
If I do tha
radesstables before you try to
extend the cluster.
On 5/18/16, 11:17 AM, "Erik Forsberg" wrote:
Hi!
I have a 2.0.13 cluster which I need to do two things with:
* Extend it
* Upgrade to 2.1.14
I'm pondering in what order to do things. Is it a supported operation to
extend a partia
Hi!
I have a 2.0.13 cluster which I have just extended, and I'm now looking
into upgrading it to 2.1.
* The cleanup after the extension is partially done.
* I'm also looking into changing a few tables into Leveled Compaction
Strategy.
In the interest of speeding up things by avoiding unnece
You didn't mention which version you're upgrading to, but generally you
shouldn't add or
remove nodes while the nodes your cluster is not all on on the same version. So
if you're
going to add nodes, you should do that before you upgrade the first node, or
after you have
upgraded all of them.
o a patch level upgrade, from 2.2.1 to 2.2.7 (the latest
stable). Besides, we want to adjust both `commitlog_directory` and
`data_file_directories` settings (separate them into two better IOPS
volumes), and we assumed they should be done before or after upgrade?
On Sun, Aug 7, 2016 at 5:38 AM, F
Hi!
How are writes handled while I'm adding a node to a cluster, i.e. while
the new node is in JOINING state?
Are they queued up as hinted handoffs, or are they being written to the
joining node?
In the former case I guess I have to make sure my max_hint_window_in_ms
is long enough for the node
Hi!
I have this problem where 3 of my 84 nodes misbehave with too long GC
times, leading to them being marked as DN.
This happens when I load data to them using CQL from a hadoop job, so
quite a lot of inserts at a time. The CQL loading job is using
TokenAwarePolicy with fallback to DCAwareR
On 2016-04-19 15:54, sai krishnam raju potturi wrote:
hi;
do we see any hung process like Repairs on those 3 nodes? what
does "nodetool netstats" show??
No hung process from what I can see.
root@cssa02-06:~# nodetool tpstats
Pool NameActive Pending Completed B
(key varchar PRIMARY KEY);"
yields:
"Bad Request: No definition found that is not part of the PRIMARY KEY"
So, my question is, how do I create a Dynamic Column Family via the CQLSH v3?
Thanks!
-erik
;m
not sure if compaction will drop data that doesn't fall within the nodes
key range, but if it does this method should have the effect you're after.
- Erik -
On 02/27/2013 08:51 PM, Hiller, Dean wrote:
Okay, we had 6 nodes of 130Gig and it was slowly increasing. Through our
operati
On 07/25/2017 05:13 AM, Junaid Nasir wrote:
|listen_address: 10.128.1.1 rpc_address: 10.128.1.1|
Are these the values on all three nodes?
If so, try with empty values:
|listen_address: rpc_address:|
or make sure each node has its own IP address configured.
Hi!
If I use sstableloader to load data to a cluster, and the source
sstables contain some columns where the TTL has expired, i.e. the
sstable has not yet been compacted - will those entries be properly
removed on the destination side?
Thanks,
\EF
Hi!
I'm looking into moving some data from one Cassandra cluster to another,
both of them running Cassandra 1.2.13 (or maybe some later 1.2 version
if that helps me avoid some fatal bug). Sstableloader will probably be
the right thing for me, and given the size of my tables, I will want to
run the
Hi!
I'm bulkloading via streaming from Hadoop to my Cassandra cluster. This
results in a rather large set of relatively small (~1MiB) sstables as
the number of mappers that generate sstables on the hadoop cluster is high.
With SizeTieredCompactionStrategy, the cassandra cluster would quickly
comp
On 2014-08-18 19:52, Robert Coli wrote:
> On Mon, Aug 18, 2014 at 6:21 AM, Erik Forsberg <mailto:forsb...@opera.com>> wrote:
>
> Is there some configuration knob I can tune to make this happen faster?
> I'm getting a bit confused by the description for min_sstab
Hi!
We have unfortunately managed to put ourselves in a situation where we are
really close to full disks on our existing 27 nodes.
We are now trying to add 15 more nodes, but running into problems with out
of disk space on the new nodes while joining.
We're using vnodes, on Cassandra 1.2.18 (ye
Hi!
On the same subject as before - due to full disk during bootstrap, my
joining nodes are stuck. What's the correct procedure here, will a plain
restart of the node do the right thing, i.e. continue where bootstrap
stopped, or is it better to clean the data directories before new start of
daemon
Hi!
I have some data in a table created using thrift. In cassandra-cli, the
'show schema' output for this table is:
create column family Users
with column_type = 'Standard'
and comparator = 'AsciiType'
and default_validation_class = 'UTF8Type'
and key_validation_class = 'LexicalUUIDType'
On 2014-11-11 19:40, Alex Popescu wrote:
> On Tuesday, November 11, 2014, Erik Forsberg <mailto:forsb...@opera.com>> wrote:
>
>
> You'll have better chances to get an answer about the Python driver on
> its own mailing
> list
> https://groups.google.com
as been around since
2011. So CF was probably created in Cassandra 0.7 or 0.8 via thrift
calls from pycassa, and I don't think there has been any schema changes
to it since.
Thanks,
\EF
>
> On Wed, Nov 12, 2014 at 2:06 AM, Erik Forsberg <mailto:forsb...@opera.com>> wrote:
On 2014-11-17 09:56, Erik Forsberg wrote:
> On 2014-11-15 01:24, Tyler Hobbs wrote:
>> What version of cassandra did you originally create the column family
>> in? Have you made any schema changes to it through cql or
>> cassandra-cli, or has it always been exactly the s
On 2014-11-19 01:37, Robert Coli wrote:
>
> Thanks, I can reproduce the issue with that, and I should be able to
> look into it tomorrow. FWIW, I believe the issue is server-side,
> not in the driver. I may be able to suggest a workaround once I
> figure out what's going on.
>
>
Hi!
Is there such a thing as the anonymous/unauthenticated user in the
cassandra permissions system?
What I would like to do is to grant select, i.e. provide read-only
access, to users which have not presented a username and password.
Then grant update/insert to other users which have presented
Hi!
I was pleased to find out that cassandra 2.0.x has added support for
pluggable metrics export, which even includes a graphite metrics sender.
Question: Will changes to the metricsReporterConfigFile require a
restart of cassandra to take effect?
I.e, if I want to add a new exported metric to
On 2015-02-05 12:39, Carlos Rolo wrote:
> Hello Erik,
>
> It seems possible, refer to the following documentation to see if it
> fits your needs:
> http://www.datastax.com/documentation/cassandra/2.0/cassandra/security/secureInternalAuthenticationTOC.html
> http://www.datastax
To elaborate a bit on what Marcin said:
* Once a node starts to believe that a few other nodes are down, it seems
to stay that way for a very long time (hours). I'm not even sure it will
recover without a restart.
* I've tried to stop then start gossip with nodetool on the node that
thinks several
Hi!
We having problems with one node (out of 56 in total) misbehaving.
Symptoms are:
* High number of full CMS old space collections during early morning
when we're doing bulkloads. Yes, bulkloads, not CQL, and only a few
thrift insertions.
* Really long stop-the-world GC events (I've seen up to
You can add something like this to cassandra-env.sh :
JVM_OPTS="$JVM_OPTS
-Dorg.xerial.snappy.tempdir=/path/that/allows/executables"
- Erik -
On 12/28/2013 08:36 AM, Edward Capriolo wrote:
Check your fstabs settings. On some systems /tmp has noexec set and
unpacking a library
On 01/04/2014 08:04 AM, Ertio Lew wrote:
... my dual boot 4GB(RAM) machine.
... -Xms4G -Xmx4G -
You are allocating all your ram to the java heap. Are you using the
same JVM parameters on the windows side? You can try to lower the heap
size or add ram to your machine.
- Erik -
Hi!
I'm bulkloading from Hadoop to Cassandra. Currently in the process of
moving to new hardware for both Hadoop and Cassandra, and while
testrunning bulkload, I see the following error:
Exception in thread "Streaming to /2001:4c28:1:413:0:1:1:12:1"
java.lang.RuntimeException: java.io.EOFExc
On 2014-01-27 12:56, Erik Forsberg wrote:
This is on Cassandra 1.2.1. I know that's pretty old, but I would like
to avoid upgrading until I have made this migration from old to new
hardware. Upgrading to 1.2.13 might be an option.
Update: Exactly the same behaviour on Cassandra 1
I'm limiting the size of the total amount of
data in the largest CF at compaction to, hmm.. the free space on the
disk with most free space, correct?
Comments welcome!
Thanks,
\EF
--
Erik Forsberg
Developer, Opera Software - http://www.opera.com/
e the fastest way to get the node up to speed with the
rest of the cluster?
Thanks,
\EF
>
> On Tue, May 31, 2011 at 7:47 AM, Erik Forsberg
> wrote:
> > Hi!
> >
> > I'm considering setting up a small (4-6 nodes) Cassandra cluster on
> > machines that eac
On 10/12/2011 11:33 AM, Ashley Martens wrote:
java version "1.6.0_20"
OpenJDK Runtime Environment (IcedTea6 1.9.9) (6b20-1.9.9-0ubuntu1~10.10.2)
OpenJDK 64-Bit Server VM (build 19.0-b09, mixed mode)
This may have been mentioned before, but is it an option to use the
Sun/Oracle JDK?
- Erik -
gs to see that they do exist. For
example:
http://mail-archives.apache.org/mod_mbox/cassandra-user/201107.mbox/%3CCAB-=z42ihihr8svhdrbvpfyjjysspckes1zoxkvcje9axkd...@mail.gmail.com%3E
- Erik -
On 10/12/2011 12:41 PM, Ashley Martens wrote:
I guess it could be an option but I can't puppet
ava's
System.currentTimeMillis() * 1000 for timestamps while the application
code used just System.currentTimeMillis() and those values would always
be lower.
- Erik -
On 11/01/2011 09:02 PM, Jonathan Ellis wrote:
That doesn't make sense to me. CS:147 is
columnFamilyKeyMap.put(row.key, row.cf);
where cFKM is
Map columnFamilyKeyMap = new
HashMap();
So cFKM can't be null, and HashMap accomodates both null key and null
value, so I'm not
Hi!
Can the new BulkOutputFormat
(https://issues.apache.org/jira/browse/CASSANDRA-3045) be used to load
data to servers running cassandra 0.8.7 and/or Cassandra 1.0.6?
I'm thinking of using jar files from the development version to load
data onto a production cluster which I want to keep on
Hi!
We're experimenting with streaming from Hadoop to Cassandra using
BulkoutputFormat, on cassandra-1.1 branch.
Are there any specific settings we should tune on the Cassandra servers
in order to get the best streaming performance?
Our Cassandra hardware are 16 core (including HT cores) wi
Hi!
If I run a hadoop job that uses BulkOutputFormat to write data to
Cassandra, and that hadoop job is aborted, i.e. streaming sessions are
not completed, it seems like the streaming sessions hang around for a
very long time, I've observed at least 12-15h, in output from 'nodetool
netstats'.
Hi!
When setting ttl on columns, is there a maximum value (other than
MAXINT, 2**31-1) that can be used?
I have a very odd behaviour here, where I try to set ttl to 9 622 973
(~111 days) which works, but setting it to 11 824 305 (~137 days) does
not - it seems columns are deleted instantly
On 2012-02-20 21:20, aaron morton wrote:
Nothing obvious.
Samarth (working on same project) found that his patch to CASSANDRA-3754
was cleaned up a bit too much, which caused a negative ttl.
https://issues.apache.org/jira/browse/CASSANDRA-3754?focusedCommentId=13212395&page=com.atlassian.jir
Hi!
We're currently testing Cassandra with a large number of row keys per
node - nodetool cfstats approximated number of keys to something like
700M per node. This seems to have caused a very large heap consumption.
After reading
http://wiki.apache.org/cassandra/LargeDataSetConsiderations I
Hi!
We're using the bulkloader to load data to Cassandra. During and after
bulkloading, the minor compaction process seems to result in larger
sstables being created. An example:
INFO [CompactionExecutor:105] 2012-03-21 15:18:46,608
CompactionTask.java (line 115) Compacting [SSTableReader(p
On 2012-03-21 16:36, Erik Forsberg wrote:
Hi!
We're using the bulkloader to load data to Cassandra. During and after
bulkloading, the minor compaction process seems to result in larger
sstables being created. An example:
This is on Cassandra 1.1, btw.
\EF
Hi!
I was trying out the "truncate" command in cassandra-cli.
http://wiki.apache.org/cassandra/CassandraCli08 says "A snapshot of the
data is created, which is deleted asyncronously during a 'graveyard'
compaction."
When do "graveyard" compactions happen? Do I have to trigger them somehow?
error.
Which client are you using? With Hector or straight thrift, your should
be able to store byte[] directly.
- Erik -
I may be mistaken, but in any case it's a
corner case as overwriting columns with shorter ttls would be unusual.
- Erik -
e
shackled the new generation collector, the bad problems went away.
Sorry for the long tirade. This was originally going to be a blog post but I
though it would have more value in context here. I hope ultimately it helps
someone else.
-erik
On Thu, Jan 13, 2011 at 5:26 PM, Mike Malone wrot
Forgot one critical point, we use zero swap on any of these hosts.
ersions of the AMIs used for
provisioning.
On Thu, Jan 13, 2011 at 11:31 PM, Mike Malone wrote:
> Erik, the scenario you're describing is almost identical to what we've been
> experiencing. Sounds like you've been pulling your hair out too! You're also
> running the sa
ed we need to better lock
down how we source AMIs for our EC2 nodes. That will give us a more reliable
system, but apparently we have no control over the actual architecture that
AMZN lets us have.
-erik
On Fri, Jan 14, 2011 at 10:09 AM, Mike Malone wrote:
> That's interesting. F
One of the developers will have to confirm but this looks like a bug
to me. MessagingService is a singleton and there's a Multimap used for
targets that isn't accessed in a thread safe manner.
The thread dump would seem to confirm this, when you hammer what is
ultimately a standard HashMap with mu
Filed as https://issues.apache.org/jira/browse/CASSANDRA-2037
I can't see how the code would be correct as written but I'm usually
wrong about most things.
On Sun, Jan 23, 2011 at 12:14 PM, Erik Onnen wrote:
> One of the developers will have to confirm but this looks like
show activity by the StreamStage thread but after the
failure on the bootstrapping node not much else relative to the
stream.
Lastly, the behavior in both cases seems to have issue with the third
data file. Files f-1,f-2 and f-4 are present but f-3 is not.
Any help would be appreciated.
-erik
Thanks Jonathan.
Filed: https://issues.apache.org/jira/browse/CASSANDRA-2283
We'll start the scrub during our normal compaction cycle and update
this thread and the bug with the results.
-erik
On Mon, Mar 7, 2011 at 11:27 AM, Jonathan Ellis wrote:
> It sounds like it doesn't rea
nt of over
committing your hardware.
If you want protection from AZ failure, split you ring across AZs
(Cassandra is quite good at this) or copy snapshots to EBS volumes.
-erik
There are a lot of benefits to EBS volumes, I/O throughput and
reliability are not among those benefits.
On Wed, Mar
The only difference in the effective command line between the
laptop and server is "-Xms3932M -Xmx3932M -Xmn400M" on the laptop and
"-Xms48334M -Xmx48334M -Xmn1600M" on the server.
--
Erik Forkalsrud
Commission Junction
On 03/10/2011 09:18 AM, Ryan King wrote:
Why use s
375/3010/1440") (all the other output from "describe
keyspace" remains the same)
So it looks like something goes wrong when cassandra gets too much memory.
--
Erik Forkalsrud
Commission Junstion
On 03/11/2011 12:13 PM, Jonathan Ellis wrote:
https://issues.apache.org/jira/browse/CASSANDRA-2158, fixed in 0.7.3
you could have saved a lot of time just by upgrading first. :)
Hmm, I'm testing with 0.7.3 ...
but now I know at least which knob to turn.
- Erik -
long thresholdA = memtableThroughputInMB * 1024 * 1024;
long thresholdB = memtableThroughputInMB * 1024L * 1024L;
System.out.println("a=" + thresholdA + " b=" + thresholdB);
}
- Erik -
On Fri, Mar 11, 2011 at 2:02 PM, Erik Forkalsrud wrote:
On 03/11/2011
to repair the tables?
Thanks,
-erik
Thanks, so is it the "[Index.db, Statistics.db, Data.db, Filter.db];
skipped" that indicates it's in Statistics? Basically I need a way to
know if the same is true of all the other tables showing this issue.
-erik
It's been about 7 months now but at the time G1 would regularly
segfault for me under load on Linux x64. I'd advise extra precautions
in testing and make sure you test with representative load.
I'll capture what I we're seeing here for anyone else who may look
into this in more detail later.
Our standard heap growth is ~300K in between collections with regular
ParNew collections happening on average about every 4 seconds. All
very healthy.
The memtable flush (where we see almost all our
ather than repeatedly
trying a node that isn't responding?
Thanks,
-erik
s what it wants, since by the time we timeout once then FD and/or
> dynamic snitch should route the request to another node for the retry
> without adding additional complexity to StorageProxy. (If that's not
> what you see in practice, then we probably have a dynamic snitch bug.)
&
', so just
wanted to check
if it is possible to get a jar where that bug has been fixed that works with
0.5, so we can
just replace it?
--
Regards Erik
Thanks Jonathan!
Yeah, I will just wait until we are ready for upgrade and hold of on that
project for now.
Erik
mn {column =
#column {name = "property",
value="value", timestamp = 2}}}
], dict:new()),
dict:new()),
Erik
On 30 apr 2010, at 15.58, Zubair Quraishi wrote:
> I have the following code in Erl
On Thu, Jul 29, 2010 at 9:57 PM, Ryan Daum wrote:
>
> Barring this we (place where I work, Chango) will probably eventually fork
> Cassandra to have a RESTful interface and use the Jetty async HTTP client to
> connect to it. It's just ridiculous for us to have threads and associated
> resources t
...or perhaps vice versa: how would I tweak a model to suit Cassandra?
I have in mind data that could be _almost_ shoehorned into the (S)CF
structure, and I'd love to hammer this nail with something hadoopy, but I
have a niggling suspicion I'm setting myself up for frustration.
I have
* a rela
or the data that has not
been migrated to the correct replicas.
Thanks!
-erik
g users to the values. Depending on
> your use case though, if this is time-based, you can name the rows by the
> date and just create new rows as time goes on.
>
> -Brandon
>
--
Regards Erik
tamp, would i still be able to
> > retrieve both versions?
> > >>
> > >> Looking at the API I can not find such method, but perhaps I am
> > >> missing something as I am new to cassandra.
> > >>
> > >> Best regards,
> > >> Bram
> > >>
> > >> This e-mail and its contents are subject to the DISCLAIMER at
> > >> http://www.tno.nl/disclaimer/email.html
> > >
> > > This e-mail and its contents are subject to the DISCLAIMER at
> > > http://www.tno.nl/disclaimer/email.html
> > >
> >
> This e-mail and its contents are subject to the DISCLAIMER at
> http://www.tno.nl/disclaimer/email.html
>
--
Regards Erik
lumns that represent the autoincrement value are the ones named
> as: n, n-1, ... , 2, 1.
>
> Hope it helps!
>
> Jesus.
>
>
>
> 2010/3/24 Sylvain Lebresne
>
> > How can I replace the "auto increament" attribute in MySQL
>> > with Cassandra?
>>
>> You can't. Not easily at least.
>>
>> > If I can't, how can I generate an ID which is globally
>> > unique for each of columns?
>>
>> Check UUIDs:
>> http://en.wikipedia.org/wiki/Universally_Unique_Identifier
>>
>> >
>> > Thanks,
>> >
>> > Sent from my iPhone
>> >
>>
>
>
--
Regards Erik
just how well it would
work is this kind of environment.
--
Regards Erik
Thanks David and Jonathan for the info.
Those two links were pretty much the only thing that I did find about this
issue, but is wasn't
sure that only because it works for different zones it would also work for
different regions.
--
Regards Erik
intended behavior or is it a bug?
--
Regards Erik
Thanks Jonathan!
mes?
>>
> Yes. The option is to store one insert per row, you are not going to be
able to do backwards slices this way, without extra index, but you can
scale mush better.
>
>> In considering the limits of cassandra and the best way to model this, we
>> would be adding 3.9 billion rows per year (assuming 125 Hz @ 24/7).
>> However, I can't really think of a better way to model this... So, am I
>> thinking about this all wrong or am I on the right track?
>>
>> Thanks,
>> Andrew
>>
>
--
Regards Erik
I would say that HBase is a little bit more focused on reads and Cassandra
on writes.
HBase has better scans and Cassandra better multi datacenter functionality.
Erik
netty-handler
4.1.43.Final
We will be addressing this issue and likely release a fixed version very
soon.
Many thanks again,
Erik
On Mon, May 4, 2020 at 6:58 AM Matthias Pfau
wrote:
> Hi Chris and Adam,
> thanks for looking into this!
>
> You can find my t
s maybe?
>
> thanks
> --
> Attila Wind
>
> http://www.linkedin.com/in/attilaw
> <https://urldefense.proofpoint.com/v2/url?u=http-3A__www.linkedin.com_in_attilaw&d=DwMCBA&c=adz96Xi0w1RHqtPMowiL2g&r=uHjHq8qzJoJORfwNE9cgGQeHQBiMQtuQd1uTkDPFJP0&m=cyTtaxvGkKVI7sE73eON05g5XWqhk
87 matches
Mail list logo