Re: Why data is not even distributed.

2012-10-04 Thread Tom
from several components (related to dimensions such as time and version), so you can not expect a random distribution over the whole space. Cheers Tom On Wed, Oct 3, 2012 at 5:39 PM, Andrey Ilinykh wrote: > Hello, everybody! > > I'm observing very strange behavior. I have 3 nod

about "insert into table with IF NOT EXISTS" error

2015-02-09 Thread tom
still see that error. If I remove "IF NOT EXISTS" from cql, insert passed. Please advice. thx. best regards Tom

Re: RESTful API for GET

2012-06-11 Thread Tom
nt to access your Cassandra data (for example: astyanax) Good luck, Tom On 06/11/2012 11:15 PM, James Pirz wrote: Hi, Thanks for the reply, But can you tell me how do you form your request URLs, I mean does Cassandra support a native RESTful api for talking to the system, and if yes, on

Re: Using K8s to Manage Cassandra in Production

2018-05-23 Thread Tom Petracca
Using a statefulset should get you pretty far, though will likely be less effective than a coreos-style “operator”. Some random points: For scale-up: a node shouldn’t report “ready” until it’s in the NORMAL state; this will prevent multiple nodes from bootstrapping at once. For scale-down: as of

High CPU usage on reading single row with Set column with short TTL

2019-01-28 Thread Tom Wollert
e than 20 odd guids in the set. But it appears that cassandra maintains the tombstones in memory? We are running 2.1.20 I'd appreciate any pointers! Cheers, Tom -- Development Director | T: 0800 021 0888 | M: 0790 4489797 | www.codeweavers.net | | Codeweavers Limited | Barn 4 | Duns

Re: coordinator failure handling

2019-02-05 Thread Tom Wollert
All below AFAIK a) The query will only be retried after half the timeout has passed, if the query is idempotent (you have to set that on prepare statement, otherwise it will assume it isn't) b) Querytimeout can be set globally in the Cluster.Builder().WithQueryTimeout c) The LoadBalancingPolicy sh

Connection status on cluster exposed anywhere?

2019-02-27 Thread Tom Wollert
n(per keyspace, as the connectionpool is not shared) - the number of current read/writes (per keyspace/host ideally) Which seems to be internal state that is never exposed. Any ideas before I use reflection to look at the internal state? Cheers, Tom -- <https://codeweavers.net> *Codeweave

Re: Connection status on cluster exposed anywhere?

2019-02-27 Thread Tom Wollert
astax-cassandra-spark-solr-performance/ > > > > On Wed, Feb 27, 2019 at 4:36 AM Tom Wollert > wrote: > >> Is it possible to get the current connection status out of Cassandra C# >> driver? >> >> In particular I'm looking at getting >> - connected hosts &g

What happens to empty partitions?

2019-05-17 Thread Tom Vernon
that TTL. What then happens to that empty partition and key that had no TTL? (assuming no more writes will happen to that unique partition key). Will they remain in the keyspace indefinitely? Does this pose any challenges in terms of performance/housekeeping? Thanks Tom

Re: Commit log + Data directory on same partition (software raid)

2012-08-11 Thread Tom Duffield
Having the both the commit log and data directory on the same volume is generally not recommended. You would actually see a performance decrease unless you can have most your reads be cache hits. On Friday, August 10, 2012, Thibaut Britz wrote: > Hi, > > Has anyone of you made some experience wit

Re: Poor read latency

2013-03-04 Thread Tom Martin
y logs have the hit 0.75, running compaction OR worse hit 0.85 > running compaction….you get that if the above is the case typically. > > Dean > > From: Tom Martin mailto:tompo...@gmail.com>> > Reply-To: "user@cassandra.apache.org<mailto:user@cassandra.apache.

RE: sstableloader throws storage_port error

2011-08-11 Thread Tom Davidson
I am trying to sstableloader and I do not want to access Cassandra on the same node. I haved edited my Cassandra.yaml to with appropriate values for the listen_address and rpc_address but I keep getting the error below. The Cassandra-cli tool, nodetool etc. works find when trying to connect to m

RE: sstableloader throws storage_port error

2011-08-11 Thread Tom Davidson
nadevsan04/10.168.121.57:7000" means something else is using that address/port. netstat can tell you what process that is, if you're not sure. On Thu, Aug 11, 2011 at 4:24 PM, Tom Davidson wrote: > I am trying to sstableloader and I do not want to access Cassandra on the > sa

Re: Sample Data

2012-03-26 Thread Tom Melendez
>> > I wish to test certain things in Cassandra so can someone help me with >> > sample database or sample database data generator which can help me >> > flood Cassandra nodes with large amount of data. I would recommend YCSB: https://github.com/brianfrankcooper/YCSB/wiki/ Thanks, Tom

Using CNAMEs for broadcast_address

2012-05-01 Thread Tom Duffield
Hey, There is a push to use Akamai IPA to accelerate traffic between our Cassandra nodes. Ignoring all other complexities this introduces, is it possible to use CNAMEs for broadcast addresses? I'm also assuming this restricts us to using only the PropertyFileSnitch (since we are not strictly in the

Re: Coke Products at Digg?

2010-07-07 Thread Tom Melendez
> > My near-term beverage consumption strategy is based largely on my > understanding of Digg's, so if there has been a change, I may need to > reevaluate. > Strategy? Care to elaborate? Thanks, Tom

Re: Using mySQL to emulate Cassandra

2010-11-28 Thread Tom Melendez
should you get slashdotted (or "facebooked", "tweeted", "oprahed", "techcrunched" or whatever we call it these days). Good luck with your launch. Thanks, Tom

Re: Embedded Cassandra for Integration testing

2010-03-14 Thread Tom Chen
e jar file required for my testing. Note hector has not been upgraded to v6. For version 6, I suspect that we can yank the dependences out the ant/pom.xml file to determine the runtime jars needed. Tom <http://github.com/rantav/hector> On Sun, Mar 14, 2010 at 11:08 AM, Ryan Daum wrote: >

Re: cassandra not responding

2010-03-16 Thread Tom Chen
speed improvements if I hit the cache (obviously memory access verses random disk.) Write performance in either version is pretty damn good. Tom On Tue, Mar 16, 2010 at 1:40 PM, B. Todd Burruss wrote: > i only anticipate about 2,000,000 hot rows, each with about 4k of data. > however, w

Re: Dividing the client load between machines in Cassandra

2010-03-16 Thread Tom Chen
Try using the cassandra hector client. It has failover and load balancing built in. http://github.com/rantav/hector <http://github.com/rantav/hector>Tom On Tue, Mar 16, 2010 at 2:30 PM, Sonny Heer wrote: > How can I accomplish this? > > The way I'm doing it now it i

Re: write performance thrift interfaces

2010-03-18 Thread Tom Chen
Hi Martin, Are you using a connection pool? I have been able to get about a 1000+ inserts with java code on one cassandra node with small values(100 bytes). Tom On Thu, Mar 18, 2010 at 11:08 AM, Roger Schildmeijer wrote: > Yes, 30 writes / s sounds a little bit poor. > > Maybe you c

Cassandra 0.6.0 Beta3 is missing a few jars

2010-03-21 Thread Tom Chen
6.0-beta3-bin.tar.gz> -- Tom Chen

Re: Cassandra 0.6.0 Beta3 is missing a few jars

2010-03-21 Thread Tom Chen
Thanks Roger, I totally missed that note. I have compiling off the trunk and switching between v5. Tom On Sun, Mar 21, 2010 at 7:51 AM, Roger Schildmeijer wrote: > README.txt tells you everything you need to know in order to run Cassandra. > > hint: ant ivy-retrieve > > On

Schema collision results in multiple data directories per table

2021-10-01 Thread Tom Offermann
in the Datastax article with great success. ## Questions * My understanding is that running concurrent schema updates should always be avoided, since that can result in schema collisions. But, in this case, I wasn't performing multiple schema updates. I was just running a single `ALTER KEYSPACE` statement. Any idea why a single schema update would result in a schema collision and two data directories per table? * Should I have waited longer before restarting nodes? Perhaps, given enough time, the Cassandra nodes would have all converged on the correct schema version, and this would have resolved on it's own? * Any suggestions for how I can avoid this problem in the future? -- Tom Offermann Lead Software Engineer http://newrelic.com

Re: Schema collision results in multiple data directories per table

2021-10-15 Thread Tom Offermann
Stefan, Yes, this is probably one of many good reasons to upgrade! Upgrading to Cassandra 4.0 is definitely on our roadmap, but we're hoping to do these migrations first before we upgrade. However, if we keep running into this problem, we may have to rethink that ordering. --Tom On Wed

Re: Schema collision results in multiple data directories per table

2021-10-15 Thread Tom Offermann
Vytenis, I ran the `ALTER KEYSPACE` command on one of the original `dc1` nodes. Should it make any difference? My understanding was that it could be run from any node in either datacenter. But, if there's a reason to prefer running it on a new datacenter node, I'm happy to do it that w

Re: Schema collision results in multiple data directories per table

2021-10-15 Thread Tom Offermann
n unchanged. But, when we saw the schema collision that I described in this thread, that resulted in new table IDs in `system_schema.tables`. Why do these table IDs normally remain unchanged? What caused new ones to be generated in the error case I described? --Tom On Wed, Oct 13, 2021 at 1

Re: Schema collision results in multiple data directories per table

2021-10-15 Thread Tom Offermann
> > > On Fri, Oct 15, 2021 at 3:02 PM Tom Offermann > wrote: > >> Jeff, >> >> Thanks for describing the race condition. >> >> I understand that performing concurrent schema changes is dangerous, and >> that running an `ALTER KEYSPACE` on one n

Re: Schema collision results in multiple data directories per table

2021-10-15 Thread Tom Offermann
So, if I were to do `CONSISTENCY ALL; select *` from each of the system_schema tables, then on-disk and in-memory should be in sync? On Fri, Oct 15, 2021 at 3:38 PM Jeff Jirsa wrote: > Heap dumps + filesystem inspection + SELECT from schema tables. > > > On Fri, Oct 15, 2021 at

Re: Schema collision results in multiple data directories per table

2021-10-15 Thread Tom Offermann
e /path/to/data/keyspace/table-(id)/ on disk > > If any of those dont match, you've got a problem waiting to bite you on > next restart. > > > > On Fri, Oct 15, 2021 at 3:48 PM Tom Offermann > wrote: > >> So, if I were to do `CONSISTENCY ALL; select *` from each

Re: Schema collision results in multiple data directories per table

2021-10-18 Thread Tom Offermann
h> INSERT INTO ts.timestamps (id, ts) VALUES (1, 8ad72660-f629-11eb-a217-e1a09d8bc60c); cassandra@cqlsh> select dateOf(ts) from ts.timestamps where id = 1; system.dateof(ts) - 2021-08-05 20:13:04.838000+0000 ``` Is there a better/faster way to do this? Once aga

Re: Cassandra in Kubernetes: IP switch decommission issue

2023-03-09 Thread Tom Nora
unsubscribe *Tom Nora * *Startup Growth & Funding* The Book -- <https://itunes.apple.com/us/author/tom-nora/id1208687100?mt=11>HACKING THE CORE <https://itunes.apple.com/us/author/tom-nora/id1208687100?mt=11> *linkedin <http://www.linkedin.com/in/tomnora/en> | twitte

Re: [External] Maximum SSTable size

2018-06-27 Thread Tom van der Woerdt
your cluster has a single large table. STCS is the actual Cassandra default but it often causes more trouble than it solves, because of large SSTables 🙂 Hope that helps! Tom On Wed, 27 Jun 2018 at 08:02, Lucas Benevides wrote: > Hello Community, > > Is there a maximum SSTable S

Re: Five Questions for Cassandra Users

2019-03-28 Thread Tom van der Woerdt
t we certainly don't run our clusters manually Tom van der Woerdt Site Reliability Engineer Booking.com B.V. Vijzelstraat 66-80 Amsterdam 1017HL Netherlands [image: Booking.com] <https://www.booking.com/> Empowering people to experience the world since 1996 43 languages, 214+ off

Re: Change primary key from int to bigint

2017-01-11 Thread Tom van der Woerdt
Hi Benjamin, bigint and int have incompatible serialization types, so that won't work. However, changing to 'varint' will work fine. Hope that helps. Tom On Wed, Jan 11, 2017 at 9:21 AM, Benjamin Roth wrote: > Hi there, > > Does anyone know if there is a hack to cha

Re: Change primary key from int to bigint

2017-01-11 Thread Tom van der Woerdt
nt. There's a small performance impact with the varint encoding and decoding scheme, but likely insignificant for any reasonable use case. Tom cqlsh> select * from foo where id in (1, 128, 256, 65535, 65536, 16777215, 16777216, 2147483647); id | value +--- 1

Re: Change primary key from int to bigint

2017-01-11 Thread Tom van der Woerdt
My understanding is that it's safe... but considering "alter type" is going to be removed completely ( https://issues.apache.org/jira/browse/CASSANDRA-12443), maybe not. As for faster ways to do this: no idea :-( Tom On Wed, Jan 11, 2017 at 12:12 PM, Benjamin Roth wrote: >

How to prevent queries being routed to new DC?

2015-09-03 Thread Tom van den Berge
the new DC, but somehow they don't in my situation. How is it possible that queries are routed to the new, emtpy data center? And more importantly, how can I prevent it? Thanks, Tom

Re: How to prevent queries being routed to new DC?

2015-09-03 Thread Tom van den Berge
to be fine. With the second approach (join_ring=false), they don't show up at all, which is also what I expected. On Thu, Sep 3, 2015 at 8:44 PM, Bryan Cheng wrote: > Hey Tom, > > What's your replication strategy look like? When your new nodes join the > ring, can you verif

Re: How to prevent queries being routed to new DC?

2015-09-03 Thread Tom van den Berge
hu, Sep 3, 2015 at 11:53 AM, Tom van den Berge < > tom.vandenbe...@gmail.com> wrote: > >> Hi Bryan, >> >> I'm using the PropertyFileSnitch, and it contains entries for all nodes >> in the old DC, and all nodes in the new DC. The replication factor for both &

Re: How to prevent queries being routed to new DC?

2015-09-03 Thread Tom van den Berge
Thanks for your help so far! I have some problems trying to understand the jira mentioned by Rob :( I'm currently trying to set up the first node in the new DC with auto_bootstrap = true. The node then becomes visible with status "joining", which (hopefully) prevents other DCs from sending querie

Re: How to prevent queries being routed to new DC?

2015-09-07 Thread Tom van den Berge
NetworkTopologyStrategy On Mon, Sep 7, 2015 at 4:39 PM, Ryan Svihla wrote: > What's your keyspace replication strategy? > > On Thu, Sep 3, 2015 at 3:16 PM Tom van den Berge < > tom.vandenbe...@gmail.com> wrote: > >> Thanks for your help so far! >> >&g

Re: Is it possible to bootstrap the 1st node of a new DC?

2015-09-07 Thread Tom van den Berge
Running nodetool rebuild on a node that was started with join_ring=false does not work, unfortunately. The nodetool command returns immediately, after a message appears in the log that the streaming of data has started. After that, nothing happens. Tom On Fri, Sep 12, 2014 at 5:47 PM, Robert

Re: Is it possible to bootstrap the 1st node of a new DC?

2015-09-07 Thread Tom van den Berge
s from other DCs. The bad thing is that a) I can't choose where it streams its data from, and b) the two nodes I've been trying to bootstrap crashed when they were almost finished... On Mon, Sep 7, 2015 at 10:22 PM, horschi wrote: > Hi Tom, > > this sounds very much like my thread

Re: How to prevent queries being routed to new DC?

2015-09-08 Thread Tom van den Berge
ry fail to find a replica, right? On Tue, Sep 8, 2015 at 4:40 AM, Anuj Wadehra wrote: > Hi Tom, > > While reading data ( even at CL LOCAL_QUORUM), if data in different nodes > required to meet CL in your local cluster doesnt match, data will be read > from remote dc for read rep

Re: Is it possible to bootstrap the 1st node of a new DC?

2015-09-08 Thread Tom van den Berge
t to it. Streaming data across the Atlantic takes a lot more time :( > > kind regards, > Christian > > PS: I would love to see the results, if you perform any tests on the > write-survey. Please share it here on the mailing list :-) > > > > On Mon, Sep 7, 2015 at 11:10

Trace evidence for LOCAL_QUORUM ending up in remote DC

2015-09-08 Thread Tom van den Berge
Key cache hit for sstable 389 | 10.55.156.67 | 10586 | SharedPool-Worker-3 My question is: how is it possible that the query is sent to a node in DC2? Since DC1 has 2 nodes and RF 1, the query should always be sent to the other node in DC1 if the coordinator does not have a replica, right? Thanks, Tom

Re: Trace evidence for LOCAL_QUORUM ending up in remote DC

2015-09-08 Thread Tom van den Berge
Just to be sure: can this bug result in a 0-row result while it should be > 0 ? Op 8 sep. 2015 6:29 PM schreef "Tyler Hobbs" : > See https://issues.apache.org/jira/browse/CASSANDRA-9753 > > On Tue, Sep 8, 2015 at 10:22 AM, Tom van den Berge < > tom.vandenbe...@gmai

Re: Is it possible to bootstrap the 1st node of a new DC?

2015-09-08 Thread Tom van den Berge
> Running nodetool rebuild on a node that was started with join_ring=false >> does not work, unfortunately. The nodetool command returns immediately, >> after a message appears in the log that the streaming of data has started. >> After that, nothing happens. > > > Per driftx, the author of CASSAND

Re: Trace evidence for LOCAL_QUORUM ending up in remote DC

2015-09-08 Thread Tom van den Berge
cause of the problem. I'm very happy to finally know the cause of this problem! Thanks for pointing me in the right direction. Tom On Tue, Sep 8, 2015 at 9:13 PM, Nate McCall wrote: > Just to be sure: can this bug result in a 0-row result while it should be >> > 0 ? >> > P

Re: Is it possible to bootstrap the 1st node of a new DC?

2015-09-09 Thread Tom van den Berge
> > > I've learned from experience that the node immediately joins the cluster, >> and starts accepting reads (from other DCs) for the range it owns. > > > This seems to be the incorrect assumption at the heart of the confusion. > You "should" be able to prevent this behavior entirely via correct u

Extremely high CPU load in new data center

2015-09-14 Thread Tom van den Berge
, like garbage collection warnings. I also did not see any strange things in the tpstats. The only difference I'm aware of between the old and new DC is the use of vnodes. Any help is appreciated! Thanks, Tom

Secondary index is causing high CPU load

2015-09-15 Thread Tom van den Berge
count" in the cfstats for the index go up with almost 20! When doing the same query on one of my "good" nodes, it only increases with a small number, as I would expect. Could it be that the use of vnodes is causing these problems? Regards, Tom On Mon, Sep 14, 2015 at 8:09

Do vnodes need more memory?

2015-09-23 Thread Tom van den Berge
here, and I would love to know it before upgrading my servers with more memory. Thanks, Tom

Re: Do vnodes need more memory?

2015-09-23 Thread Tom van den Berge
G ram. It would be great if you could respond to my initial question though. Thanks, Tom On Wed, Sep 23, 2015 at 4:14 PM, Sebastian Estevez < sebastian.este...@datastax.com> wrote: > This is interesting, where are you seeing that you're collecting 50% of > the time? Is your env.sh

Re: Do vnodes need more memory?

2015-09-24 Thread Tom van den Berge
On Thu, Sep 24, 2015 at 12:45 AM, Robert Coli wrote: > On Wed, Sep 23, 2015 at 7:09 AM, Tom van den Berge < > tom.vandenbe...@gmail.com> wrote: > >> So it seems that Cassandra simply doesn't have enough memory. I'm trying >> to understand if this can be cau

Removed node is not completely removed

2015-10-14 Thread Tom van den Berge
I have removed a node with nodetool removenode, which completed ok. Nodetool status does not list the node anymore. But since then, Im seeing messages in my other nodes log files referring to the removed node: INFO [GossipStage:38] 2015-10-14 11:18:26,322 Gossiper.java (line 968) InetAddress /10

Re: Removed node is not completely removed

2015-10-14 Thread Tom van den Berge
Hi Carlos, I'm using 2.1.6. The mysterious node is not in the peers table. Any other ideas? One of my existing nodes is not present in the system.peers table, though. Should I be worried? Regards, Tom On Wed, Oct 14, 2015 at 2:27 PM, Carlos Rolo wrote: > Check system.peers table to se

Re: Removed node is not completely removed

2015-10-14 Thread Tom van den Berge
has its > own set of system tables. -ml > > On Wed, Oct 14, 2015 at 9:17 AM, Tom van den Berge < > tom.vandenbe...@gmail.com> wrote: > >> Hi Carlos, >> >> I'm using 2.1.6. The mysterious node is not in the peers table. Any other >> ideas? >>

Re: Removed node is not completely removed

2015-10-14 Thread Tom van den Berge
Thanks Sebastian, a restart solved the problem! On Wed, Oct 14, 2015 at 3:46 PM, Sebastian Estevez < sebastian.este...@datastax.com> wrote: > We still keep endpoints in memory. Not sure how you git to this state but > try a rolling restart. > On Oct 14, 2015 9:43 AM, &qu

Unexplainably large reported partition sizes

2016-03-04 Thread Tom van den Berge
time to time, I'm noticing relatively bad latencies when such partitions are (fully) queried. So I'm not fully convinced that the actual partition size is not in the order of 1 or 2 GB. Does anyone have an explanation for these discrepancies? Thanks, Tom

Re: Unexplainably large reported partition sizes

2016-03-05 Thread Tom van den Berge
lated size would have been the compressed size. The compression would be 0.125% of the original, which is not realistic. In the logs, I can see that the typical compression that is achieved for this table is around 80% of the original. Tom On Fri, Mar 4, 2016 at 9:48 PM, Robert Coli wrote: > On

Re: Unexplainably large reported partition sizes

2016-03-06 Thread Tom van den Berge
values ? > > On Sat, Mar 5, 2016 at 7:16 PM, Tom van den Berge > wrote: > >> I don't think compression can be the cause of the difference, because of >> two reasons: >> >> 1) The partition size I calculated myself (3 MB) is the uncompressed >> size,

Re: Unexplainably large reported partition sizes

2016-03-07 Thread Tom van den Berge
a this was? I have not been able to find it. I'm using 2.1.9. Thanks a lot for pointing me in this direction! Tom

Re: Unexplainably large reported partition sizes

2016-03-07 Thread Tom van den Berge
A-10547) > I've been bitten by this one some time ago, too. I stopped using collections because of this. The table in question doesn't use them either. Thanks for the suggestion anyway! Tom

Re: Unexplainably large reported partition sizes

2016-03-10 Thread Tom van den Berge
Thanks guys. I've upgraded to 2.2.5, and the problem is gone. Tom On Wed, Mar 9, 2016 at 10:47 PM, Robert Coli wrote: > On Mon, Mar 7, 2016 at 1:25 PM, Nate McCall > wrote: > >> >>> Rob, can you remember which bug/jira this was? I have not been able to >

Re: Modeling Audit Trail on Cassandra

2016-03-19 Thread Tom van den Berge
er period that suits your situation. Retrieving all changes in a specific time frame is done by retrieving all relevant periods. Tom

Pagination and timeouts

2017-03-27 Thread Tom van den Berge
in_ms is 30 ms. I'm wondering what is causing this timeout? What is also not clear to me is whether the driver and server timeout apply to a single page, or to the entire query? Thanks, Tom

Re: Netty SSL memory leak

2017-05-31 Thread Tom van der Woerdt
Hi John, That's the bug I filed the ticket for, yup. I recommend updating to a newer Cassandra version (3.0.11 or newer), which fixes this issue (and many others). Tom On Wed, May 31, 2017 at 12:39 AM, John Sanda wrote: > I have Cassandra 3.0.9 cluster that is hitting OutOfMemoryErr

Unexpected rows in MV after upgrading to 3.0.15

2017-11-03 Thread Tom van der Woerdt
o the first row could *maybe* be explained by CASSANDRA-11500, but the second row can't. The third row is the one we expect to be there. Is this a new regression in 3.0.15? Is anyone else experiencing this, or should I file a ticket? Thanks, Tom --- Full structure: - CREATE TA

Re: Meltdown/Spectre Linux patch - Performance impact on Cassandra?

2018-01-05 Thread Tom van der Woerdt
, and may not be reproducible for production workloads. If you have the infrastructure to test a variety of kernels, I'd be very interested to see your numbers. Thanks, Tom van der Woerdt Site Reliability Engineer Booking.com B.V. Vijzelstraat 66-80 Amsterdam 1017HL Netherlands Direct +3

Re: Decommissioned nodes and FailureDetector

2018-01-19 Thread Tom van der Woerdt
at the number of unreachable racks, so if you only care about nodes you should be able to get rid of most code here. Tom van der Woerdt Site Reliability Engineer Booking.com B.V. Vijzelstraat 66-80 Amsterdam 1017HL Netherlands [image: Booking.com] <http://www.booking.com/> The world

Re: [External] Re: Whch version is the best version to run now?

2018-03-05 Thread Tom van der Woerdt
community wrt patches. This is why I'd recommend 3.11.x for new projects. Stay away from any of the 2.x series, they're going EOL soonish and the newer versions are very stable. Tom van der Woerdt Site Reliability Engineer Booking.com B.V. Vijzelstraat 66-80 Amsterdam 1017HL Netherla

Re: [External] Is there any limit in the number of partitions that a table can have

2018-03-07 Thread Tom van der Woerdt
Hi Javier, When our users ask this question, I tend to answer "keep it above a billion". More partitions is better. I'm not aware of any actual limits on partition count. Practically it's almost always limited by the disk space in a server. Tom van der Woerdt Site

Node being rebuilt receives read requests

2014-09-10 Thread Tom van den Berge
d queries in my client. I can't understand why. Any help is greatly appreciated. Thanks, Tom

Is it possible to bootstrap the 1st node of a new DC?

2014-09-11 Thread Tom van den Berge
data is not complete yet, it can't return anything. This seems to be a dangerous side effect of this procedure, and therefore can't be used. Thanks Tom

Re: Is it possible to bootstrap the 1st node of a new DC?

2014-09-11 Thread Tom van den Berge
Thanks, Rob. I actually tried using LOCAL_ONE instead of ONE, but I still saw this problem. Maybe I missed some queries when updating to LOCAL_ONE. Anyway, it's good to know that this is supposed to work. Tom On Thu, Sep 11, 2014 at 10:28 PM, Robert Coli wrote: > On Thu, Sep 11, 2014

Re: Is it possible to bootstrap the 1st node of a new DC?

2014-09-12 Thread Tom van den Berge
g", similar to a node that is being bootstrapped? Tom On Thu, Sep 11, 2014 at 11:10 PM, Tom van den Berge wrote: > Thanks, Rob. > I actually tried using LOCAL_ONE instead of ONE, but I still saw this > problem. Maybe I missed some queries when updating to LOCAL_ONE. Anyway, > it&#

Fwd: MarshalException after upgrading to 2.1.6

2015-06-11 Thread Tom van den Berge
he old 2.0.10 again. Does anyone have an idea how this can be fixed? Thanks, Tom ERROR 13:51:57 Exception encountered during startup org.apache.cassandra.serializers.MarshalException: unable to make version 1 UUID from 'currencyCode' at org.apache.cassandra.db.marshal.UUIDType.fromString(UUI

Re: MarshalException after upgrading to 2.1.6

2015-06-11 Thread Tom van den Berge
5 at 9:23 AM, Tom van den Berge > wrote: > >> I've upgraded a node from 2.0.10 to 2.1.6. Before taking down the node, >> I've run nodetool upgradesstables and nodetool scrub. >> >> When starting up the node with 2.1.6, I'm getting a MarshalException >>

Re: Check out if Cassandra ready

2013-11-01 Thread Tom van den Berge
start testing. I'm using it in many unit tests (although formally it's not a unit test anymore when relying on a cassandra node). The fantastic performance of Cassandra even allows me to clear all column families and insert the test fixture rows for each individual test case. Good luck, Tom

Re: Managing index tables

2013-11-05 Thread Tom van den Berge
ne of your indexes results in a rollback of the entire transaction. I hope this is helpful to you. Tom On Mon, Nov 4, 2013 at 12:20 PM, Thomas Stets wrote: > What is the best way to manage index tables on update/deletion of the > indexed data? > > I have a table containing all kinds

Re: filter using timeuuid column type

2013-11-05 Thread Tom van den Berge
This is because time2 is not part of the primary key. Only the primary key column(s) can be queried with > and <. Secondary indexes (like your timeuuid_test2_idx) can only be queried with the = operator. Maybe you can make time2 also part of your primary key? Good luck, Tom On Mon,

Re: OOM while reading key cache

2013-11-13 Thread Tom van den Berge
I'm having the same problem, after upgrading from 1.2.3 to 1.2.10. I can remember this was a bug that was solved in the 1.0 or 1.1 version some time ago, but apparently it got back. A workaround is to delete the contents of the saved_caches directory before starting up. Tom On Tue, N

What is listEndpointsPendingHints?

2013-11-26 Thread Tom van den Berge
tokens of my nodes. The reason I'm looking into this is that my cluster suffering every now and then from never ending (dead) hinted handoff tasks, resulting in a flooding of hints on the node. Thanks, Tom

How to monitor the progress of a HintedHandoff task?

2013-12-03 Thread Tom van den Berge
Hi, Is there a way to monitor the progress of a hinted handoff task? I found the following two mbeans providing some info: org.apache.cassandra.internal:type=HintedHandoff, which tells me that there is 1 active task, and org.apache.cassandra.db:type=HintedHandoffManager#countPendingHints(), whic

How to measure data transfer between data centers?

2013-12-03 Thread Tom van den Berge
Is there a way to know how much data is transferred between two nodes, or more specifically, between two data centers? I'm especially interested in how much data is being replicated from one data center to another, to know how much of the available bandwidth is used. Thanks, Tom

Re: How to monitor the progress of a HintedHandoff task?

2013-12-03 Thread Tom van den Berge
, but has not finished after a reasonable amount of time. So I guess the only way to see learn the progress is to look inside the 'hints' column family then.I'll give that a try. Thanks, Tom On Tue, Dec 3, 2013 at 1:43 PM, Rahul Menon wrote: > Tom, > > You should check the

Re: How to monitor the progress of a HintedHandoff task?

2013-12-03 Thread Tom van den Berge
not be able to handle the data quickly enough, resulting in stored hints. The HintedHandoff task that is started is targeting that remote node. Thanks, Tom On Tue, Dec 3, 2013 at 2:22 PM, Rahul Menon wrote: > Tom, > > Do you know why these hints are piling up? What is the size of the

Re: OutOfMemory Java Heap Space error on startup...

2013-12-04 Thread Tom van den Berge
To start up your node again, you could delete the stored key caches ( /var/lib/cassandra/saved_caches/*). Regards, Tom On Wed, Dec 4, 2013 at 7:46 PM, Krishna Chaitanya wrote: > Hey Nate, > Thanks for the reply. The link was really good...!!! Looking > forward to making the

Re: How to measure data transfer between data centers?

2013-12-04 Thread Tom van den Berge
Hi Chris, I think streaming is used for repair tasks, bulk loading and that kind of things, but not for regular replication traffic. I think you're right that I should look into network tools. I don't think cassandra can supply this information. Thanks, Tom On Wed, Dec 4, 2013

Re: How to monitor the progress of a HintedHandoff task?

2013-12-07 Thread Tom van den Berge
ght have missed? Thanks for your feedback -- it's highly appreciated! Tom On Fri, Dec 6, 2013 at 4:41 PM, Rahul Menon wrote: > Tom, > > you should look at phi_convict_threshold and try and increase the value if > you have too much chatter on your network. > > Also, rebuilding

Help on StatusLogger output?

2014-03-20 Thread Tom van den Berge
ters reset every time they are logged, or e.g. every x minutes? Any help is greatly appreciated! Thanks, Tom

StatusLogger output help

2014-03-28 Thread Tom van den Berge
ters reset every time they are logged, or e.g. every x minutes? Any help is greatly appreciated! Thanks, Tom

Migration 1.2.14 to 2.0.8 causes "Tried to create duplicate hard link" at startup

2014-06-05 Thread Tom van den Berge
ng startup java.lang.RuntimeException: Tried to create duplicate hard link to /Users/tom/cassandra-data/data/drillster/Account/snapshots/pre-sstablemetamigration/drillster-Account-ic-65-Filter.db at org.apache.cassandra.io.util.FileUtils.createHardLink(FileUtils.java:75)

Re: Migration 1.2.14 to 2.0.8 causes "Tried to create duplicate hard link" at startup

2014-06-11 Thread Tom van den Berge
No, unfortunately I haven't. On Tue, Jun 10, 2014 at 5:35 PM, Chris Burroughs wrote: > Were you able to solve or work around this problem? > > > On 06/05/2014 11:47 AM, Tom van den Berge wrote: > >> Hi, >> >> I'm trying to migrate a development clu

Are writes to indexes performed asynchronously?

2014-06-19 Thread Tom van den Berge
de cluster, with consistency level ONE. Thanks, Tom

Re: Migration 1.2.14 to 2.0.8 causes "Tried to create duplicate hard link" at startup

2014-06-19 Thread Tom van den Berge
It turns out this is caused by an earlier, failed attempt to upgrade. Removing all pre-sstablemetamigration snapshot directories solved the issue. Credits to Markus Eriksson. On Wed, Jun 11, 2014 at 9:42 AM, Tom van den Berge wrote: > No, unfortunately I haven't. > > > &

Re: is there a "no disk storage" mode ?

2011-12-01 Thread Tom van den Berge
file paths; it works fine out of the box. I've set it up to drop and recreate my keyspace before each test case, and even then it performs quite good. Good luck, Tom On 12/1/11 5:36 PM, DE VITO Dominique wrote: Hi, I want to use Cassandra for (fast) unit testing with a small numb

Cassandra Network Protocols

2012-04-16 Thread Tom Duffield (Mailing Lists)
When you submit a write request with a constancy of EACH_QUORUM, what network protocol does Cassandra use to write the data to the other DCs? Similarly, what network protocol does Cassandra use when writing data as part of replication? -- Tom Duffield (Mailing Lists) Sent with Sparrow

  1   2   >