A difficult data model with C*

2016-11-07 Thread ben ben
know, the business prefer to order by the last_time desc. If we use last_time as cluster key, there will be many records for a singe movie and the recent one is actually desired. So how to model that? Do you have any suggestions? Thanks! BRs, BEN

答复: A difficult data model with C*

2016-11-08 Thread ben ben
e and continue to watch from the last position she/he did. BTW, a movie can be watched several times by a user and the last positon is needed indeed. BRs, BEN 发件人: Vladimir Yudovin 发送时间: 2016年11月8日 22:35:48 收件人: user 主题: Re: A difficult data model with C* H

Re: Why don't you start off with a "single & small" Cassandra server as you usually do it with MySQL ?

2013-08-27 Thread Ben
tra $100 a month for the alternative and move onto things more important things like customer acquisition. Good luck Ben Bromhead Instaclustr | www.instaclustr.com | @instaclustr On 28/08/2013, at 3:36 AM, Robert Coli wrote: > On Tue, Aug 27, 2013 at 3:04 AM, Aklin_81 wrote: > But so

Re: cassandra spark-connector-sqlcontext too many tasks

2018-03-17 Thread Ben Slater
I think that is probably a question for the Spark Connector forum: https://groups.google.com/a/lists.datastax.com/forum/#!forum/spark-connector-user as it’s much more related to the function of the connector than functionality of Cassandra itself. Cheers Ben On Sat, 17 Mar 2018 at 21:18

Re: Cassandra client tuning

2018-03-18 Thread Ben Slater
You will probably find grouping writes into small batches improves overall performance (if you are not doing it already). See the following presentation for some more info: https://www.slideshare.net/Instaclustr/microbatching-highperformance-writes Cheers Ben On Sun, 18 Mar 2018 at 19:23

Re: Cassandra client tuning

2018-03-18 Thread Ben Slater
oughput but much less > CPU usage at client side. > > Sent using Zoho Mail <https://www.zoho.com/mail/> > > > On Sun, 18 Mar 2018 12:46:02 +0330 *Ben Slater > >* wrote > > You will probably find grouping writes into small batches improves overall >

Re: Cassandra client tuning

2018-03-18 Thread Ben Slater
Sent using Zoho Mail <https://www.zoho.com/mail/> > > > ---- On Sun, 18 Mar 2018 13:14:54 +0330 *Ben Slater > >* wrote > > When you say batch was worth than async in terms of throughput are you > comparing throughput with the same number of threads or something? I w

Re: Roadmap for 4.0

2018-03-30 Thread Ben Bromhead
erson working on it (or are offering to work on it instead) and can get it ready for review in a timely fashion. That way we can build a more realistic working target. For major breaking changes, there is always 5.0 or 4.1 or whatever we end up doing :) Cheers Ben On Thu, Feb 15, 2018 at 9:39 PM

Re: datastax cassandra minimum hardware recommendation

2018-04-04 Thread Ben Bromhead
> http://www.thelastpickle.com >> >> 2018-04-04 9:44 GMT+01:00 sujeet jog : >> >>> the datastax site has a hardware recommendation of 16CPU / 32G RAM for >>> DSE Enterprise, Any idea what is the minimum hardware recommendation >>> supported, can each node be 8CPU and the support covering it ?.. >>> >> >> > -- Ben Bromhead CTO | Instaclustr <https://www.instaclustr.com/> +1 650 284 9692 Reliability at Scale Cassandra, Spark, Elasticsearch on AWS, Azure, GCP and Softlayer

Re: Sorl/DSE Spark

2018-04-12 Thread Ben Bromhead
. Spark is notorious for causing >>>> latency spikes in Cassandra which is not great if you are are sensitive to >>>> that. >>>> >>>> Regards, >>>> Evelyn. >>>> >>>> On 12 Apr 2018, at 6:55 am, kooljava2 >>>> wrote: >>>> >>>> Hello, >>>> >>>> We are exploring on configuring Sorl/Spark. Wanted to get input on >>>> this. >>>> 1) How do we decide which one to use? >>>> 2) Do we run this on a DC where there is less workload? >>>> >>>> Any other suggestion or comments are appreciated. >>>> >>>> Thank you. >>>> >>>> >>>> >>> -- Ben Bromhead CTO | Instaclustr <https://www.instaclustr.com/> +1 650 284 9692 Reliability at Scale Cassandra, Spark, Elasticsearch on AWS, Azure, GCP and Softlayer

Re: Sorl/DSE Spark

2018-04-13 Thread Ben Bromhead
On Thu, Apr 12, 2018, 21:23 Niclas Hedhman wrote: > Ben, > > 1. I don't see anything in this thread that is DSE specific, so I think it > belongs here. > DSE is literally in the title. > 2. Careful when you say that Datastax produces Cassandra. Cassandra is a >

Re: Sorl/DSE Spark

2018-04-13 Thread Ben Bromhead
Thanks Jeff. On Thu, Apr 12, 2018, 21:37 Jeff Jirsa wrote: > Pretty sure Ben meant that datastax produces DSE, not Cassandra, and since > the questions specifically mentions DSE in the subject (implying that the > user is going to be running either solr or spark within DSE t

Re: Cassandra datastax cerrification

2018-04-14 Thread Ben Bromhead
220 should be enough for cerrification and also i am > reading definitive guide on cassandra ..any other material required ? Any > practise test websites? As certification is costly and wanna clear in one > go ... > -- Ben Bromhead CTO | Instaclustr <https://www.instaclustr.com/> +1

Re: Does Cassandra supports ACID txn

2018-04-19 Thread Ben Slater
level (eg in the catch block delete the records that were inserted). That, however, does not provide you the isolation part of ACID. You also tend to find that if you have properly denormalised your data model for Cassandra there is less requirement for these type of batched updates. Cheers Ben O

Re: read repair with consistency one

2018-04-21 Thread Ben Slater
find inconsistencies. - If read_repair_chance or dc_local_read_repair_chance are >0 then extra replicas are checked as part of the query for the % of queries specified by the chance setting. Again, if inconsistencies are found, they are repaired. I expect this mechanism would still apply for CL=ONE. Ch

Re: Does Cassandra supports ACID txn

2018-04-25 Thread Ben Slater
then a single write to hotel_enabled. hotel_enabled could also be something like hotel_details so the other tables really are just indexes. You need to do more reads but whatever you do consistency doesn’t come for free. Cheers Ben On Thu, 26 Apr 2018 at 12:44 Rajesh Kishore wrote: > Correct

Re: Cassandra Summit 2019 / Cassandra Summit 2018

2018-05-03 Thread Ben Bromhead
8 (probably too late)? > > > > Is there a planning committee? > > > > Who wants there to be a Cassandra Summit 2019 and who thinks there is a > better way? > > > > We could try a Cassandra Distributed Summit 2019 where we meet virtually > and perhaps asynchronously, but there would be a lot more energy and > bonding if it’s not virtual. I’m up for any of these. > > > > Kenneth Brotman > > > > > > > > > > -- > > > > -- Ben Bromhead CTO | Instaclustr <https://www.instaclustr.com/> +1 650 284 9692 Reliability at Scale Cassandra, Spark, Elasticsearch on AWS, Azure, GCP and Softlayer

Re: Basic Copy vs Snapshot for backup

2018-05-10 Thread Ben Slater
The snapshot gives you a complete set of your sstables at a point in time. If you were copying sstables directly from a live node you would have to deal with files coming and going due to compactions. Cheers Ben On Thu, 10 May 2018 at 16:45 wrote: > Dear Community, > > > > Is th

Re: Interesting Results - Cassandra Benchmarks over Time Series Data for IoT Use Case I

2018-05-17 Thread Ben Slater
but you might also find relevant: https://www.instaclustr.com/cassandra-connector-for-spark-5-tips-for-success/ Cheers Ben On Thu, 17 May 2018 at 18:06 Arbab Khalil wrote: > We have been exploring IoT specific C* schema design over the past few > months. We wanted to share the benchmarkin

Re: Using K8s to Manage Cassandra in Production

2018-05-23 Thread Ben Bromhead
know use cases where people are > using Mesos or a custom tool built with terraform/chef etc to run their > production clusters but have yet to find a real K8s use case. > > > > *Questions?* > > Is K8s a reasonable choice for managing a production C* cluster? > > Are there documented use cases for this? > > > > Any help would be greatly appreciated. > > > > -- > > Regards, > > > > *Hassaan Pasha* > > -- > > Regrads, > > Pavel Sapezhko > > > -- Ben Bromhead CTO | Instaclustr <https://www.instaclustr.com/> +1 650 284 9692 Reliability at Scale Cassandra, Spark, Elasticsearch on AWS, Azure, GCP and Softlayer

Re: Using K8s to Manage Cassandra in Production

2018-05-23 Thread Ben Bromhead
ent gromakowski < vincent.gromakow...@gmail.com> wrote: > Why did you choose java for the operator implementation when everybody > seems to use the go client (probably for greater functionalities) ? > > 2018-05-23 15:39 GMT+02:00 Ben Bromhead : > >> You can get a good way wit

Re: Using K8s to Manage Cassandra in Production

2018-05-23 Thread Ben Bromhead
r... > > 2018-05-23 17:17 GMT+02:00 Ben Bromhead : > >> The official Kubernetes Java driver is actually pretty feature complete, >> if not exactly idiomatic Java... it's only missing full examples to get it >> to GOLD compatibility levels iirc. >> >> A f

Re: Certified Cassandra for Enterprise use

2018-05-29 Thread Ben Slater
Hi Pranay We (Instaclustr) provide enterprise support for Cassandra ( https://www.instaclustr.com/services/cassandra-support/) which may cover what you are looking for. Please get in touch direct if you would like to discuss. Cheers Ben On Tue, 29 May 2018 at 10:11 Pranay akula wrote: >

Re: Jmx_exporter CPU spike

2018-07-09 Thread Ben Bromhead
like GC etc, but feel free to submit a PR! Ben On Mon, Jul 9, 2018 at 12:03 AM Rahul Singh wrote: > How often are you polling the JMX? How much of a spike are you seeing in > CPU? > > -- > Rahul Singh > rahul.si...@anant.us > > Anant Corporation > On Jul 5, 2

Re: Timeout for only one keyspace in cluster

2018-07-21 Thread Ben Slater
Note that that writetimeout exception can be C*s way of telling you when there is contention on a LWT (rather than actually timing out). See https://issues.apache.org/jira/browse/CASSANDRA-9328 Cheers Ben On Sun, 22 Jul 2018 at 11:20 Goutham reddy wrote: > Hi, > As it is a single partiti

Re: Re: [EXTERNAL] full text search on some text columns

2018-07-31 Thread Ben Slater
We (Instaclustr) will be submitting a PR for 3.11.3 support for cassandra-lucene-index once 3.11.3 is officially released as we offer it as part of our service and have customers using it. Cheers Ben On Wed, 1 Aug 2018 at 14:06 onmstester onmstester wrote: > It seems to be an interest

Re: Secure data

2018-08-01 Thread Ben Slater
-application-level-encryption/ We also use encrypted GP2 EBS pretty widely without issue. Cheers Ben On Thu, 2 Aug 2018 at 05:38 Jonathan Haddad wrote: > You can also get full disk encryption with LUKS, which I've used before. > > On Wed, Aug 1, 2018 at 12:36 PM Jeff Jirsa w

Re: cold vs hot data

2018-09-13 Thread Ben Slater
o PDF e-mail accounts will be archived and may > be scanned by us and/or by external service providers to detect and prevent > threats to our systems, investigate illegal or inappropriate behavior, > and/or eliminate unsolicited promotional e-mails (“spam”). If you have any > concerns

Re: Advantage over Cassandra in Kubernetes

2018-10-11 Thread Ben Bromhead
een solved before. - The Cassandra examples in the official Kubernetes documentation is not something you want to take into production. Cheers Ben On Thu, Oct 11, 2018 at 6:50 PM Goutham reddy wrote: > Hi, > We are in the process of setting up Cassandra cluster with high > availabilit

Re: [ANNOUNCE] StratIO's Lucene plugin fork

2018-10-30 Thread Ben Slater
For anyone who is interested, we’ve published a blog with some more background on this and some more detail of our ongoing plans: https://www.instaclustr.com/instaclustr-support-cassandra-lucene-index/ Cheers Ben On Fri, 19 Oct 2018 at 09:42 kurt greaves wrote: > Hi all, > >

Re: Cassandra DCOS | monitoring connection and user activity

2018-11-01 Thread Ben Bromhead
rn e-mail. Regardless of >> content, this e-mail shall not operate to bind DXC Technology Company or >> any of its affiliates to any order or other contract unless pursuant to >> explicit written agreement or government initiative expressly permitting >> the use of e-mail for such purpose. --. >> > -- Ben Bromhead CTO | Instaclustr <https://www.instaclustr.com/> +1 650 284 9692 Reliability at Scale Cassandra, Spark, Elasticsearch on AWS, Azure, GCP and Softlayer

Re: Multiple cluster for a single application

2018-11-07 Thread Ben Slater
management environment but if you app functionality allows your to split into multiple clusters (ie all your data is not all in one giant table) then I would generally look to split. Splitting also gives you the advantage of making it harder to have an outage that brings everything down. Cheers Ben On

Re: [EXTERNAL] Is Apache Cassandra supports Data at rest

2018-11-14 Thread Ben Slater
I wrote a blog post a while ago on the pros and cons of encrypting in your application for use with Cassandra that you might find useful background on this subject: https://www.instaclustr.com/securing-apache-cassandra-with-application-level-encryption/ Cheers Ben On Wed, 14 Nov 2018 at 13:47

Re: Cassandra single unreachable node causing total cluster outage

2018-11-27 Thread Ben Slater
In what way does the cluster become unstable (ie more specifically what are the symptoms)? My first thought would be the loss of the node causing the other nodes to become overloaded but that doesn’t seem to fit with your point 2. Cheers Ben --- *Ben Slater* *Chief Product Officer* <ht

Re: Authenticate cassandra-stress with cqlshrc

2019-01-08 Thread Ben Slater
(in which case I don’t think there is one but stress has a hell of a lot of options so I could be wrong). Cheers Ben --- *Ben Slater* *Chief Product Officer* <https://www.facebook.com/instaclustr> <https://twitter.com/instaclustr> <https://www.linkedin.com/company/instaclustr>

Re: Authenticate cassandra-stress with cqlshrc

2019-01-08 Thread Ben Slater
Yep, cassandra-stress doesn’t attempt to use the cqlshrc file. Seems to me it could be convenient so might make a nice contribution to the project. Cheers Ben --- *Ben Slater* *Chief Product Officer* <https://www.facebook.com/instaclustr> <https://twitter.com/instaclust

Re: How to read the Index.db file

2019-02-07 Thread Ben Slater
They don’t do exactly what you want but depending on why you are trying to get this info you might find our sstable-tools useful: https://github.com/instaclustr/cassandra-sstable-tools --- *Ben Slater* *Chief Product Officer* <https://www.facebook.com/instaclustr> <https://tw

unsubscribe

2019-02-11 Thread Ben Standefer
unsubscribe –Ben Standefer Sent via Superhuman ( https://sprh.mn/?vip=benstande...@gmail.com )

Re: ***UNCHECKED*** Query regarding cassandra column write time set by client Timestamp Generator

2019-04-09 Thread Ben Slater
Not in the logs but I think you should be able to use the WRITETIME function to view via CQL (see https://cassandra.apache.org/doc/latest/cql/dml.html#select) Cheers Ben --- *Ben Slater* *Chief Product Officer* <https://www.facebook.com/instaclustr> <https://twitter.com/instaclust

Re: ***UNCHECKED*** Query regarding cassandra column write time set by client Timestamp Generator

2019-04-09 Thread Ben Slater
Maybe stabledump can help you? https://cassandra.apache.org/doc/4.0/tools/sstable/sstabledump.html --- *Ben Slater* *Chief Product Officer* <https://www.facebook.com/instaclustr> <https://twitter.com/instaclustr> <https://www.linkedin.com/company/instaclustr> Read our lat

Re: A cluster (RF=3) not recovering after two nodes are stopped

2019-04-23 Thread Ben Slater
Is it possible that stress is overloading node 1 so it’s not recovering state properly when node 2 comes up? Have you tried running with a lower load (say 2 or 3 threads)? Cheers Ben --- *Ben Slater* *Chief Product Officer* <https://www.facebook.com/instaclustr> <https://tw

Re: A cluster (RF=3) not recovering after two nodes are stopped

2019-04-25 Thread Ben Slater
Ben --- *Ben Slater**Chief Product Officer* <https://www.instaclustr.com/platform/> <https://www.facebook.com/instaclustr> <https://twitter.com/instaclustr> <https://www.linkedin.com/company/instaclustr> Read our latest technical blog posts here <https://www.in

Re: different query result after a rerun of the same query

2019-04-29 Thread Ben Slater
inserted (relative to your attempt to query it)? Cheers Ben --- *Ben Slater**Chief Product Officer* <https://www.instaclustr.com/platform/> <https://www.facebook.com/instaclustr> <https://twitter.com/instaclustr> <https://www.linkedin.com/company/instaclustr> Read our lat

Re: different query result after a rerun of the same query

2019-04-29 Thread Ben Slater
, changes in topology without following proper procedure, etc. Cheers Ben --- *Ben Slater**Chief Product Officer* <https://www.instaclustr.com/platform/> <https://www.facebook.com/instaclustr> <https://twitter.com/instaclustr> <https://www.linkedin.com/company/instaclu

Re: different query result after a rerun of the same query

2019-04-30 Thread Ben Slater
monitor the dropped mutations metrics. Cheers Ben --- *Ben Slater**Chief Product Officer* <https://www.instaclustr.com/platform/> <https://www.facebook.com/instaclustr> <https://twitter.com/instaclustr> <https://www.linkedin.com/company/instaclustr> Read our latest t

Re: CL=LQ, RF=3: Can a Write be Lost If Two Nodes ACK'ing it Die

2019-05-02 Thread Ben Slater
In scenario 2, if the row has been written to node 3 it will be replaced on the other nodes via rebuild or repair. --- *Ben Slater**Chief Product Officer* <https://www.instaclustr.com/platform/> <https://www.facebook.com/instaclustr> <https://twitter.com/inst

Re: CL=LQ, RF=3: Can a Write be Lost If Two Nodes ACK'ing it Die

2019-05-02 Thread Ben Slater
many not have made its way to the third node by the time the first two replicas are lost. If there is a replica on the third node it can be recovered to the other two nodes by either rebuild (actually replace) or repair. Cheers Ben --- *Ben Slater**Chief Product Officer* <ht

Re: CL=LQ, RF=3: Can a Write be Lost If Two Nodes ACK'ing it Die

2019-05-03 Thread Ben Slater
the two nodes dieing (although it’s very likely there will be three replicas from the start if the cluster is in a healthy state at the time of the write). Cheers Ben --- *Ben Slater**Chief Product Officer* <https://www.instaclustr.com/platform/> <https://www.facebook.com/instaclustr&

Re: CassKop : a Cassandra operator for Kubernetes developped by Orange

2019-05-25 Thread Ben Bromhead
This all looks really awesome, while i wish there wasn't as much duplicated effort across the different projects it's great to see how different teams are solving this very real problem! Also Attila we do have our cloud offering as you mentioned, but we also have our own k8s operator which is Apac

Re: high write latency on a single table

2019-07-22 Thread Ben Slater
Is the size of the data in your “state” column variable? The higher write latencies at the 95%+ could line up with large volumes of data for particular rows in that column (the one column not in both tables)? Cheers Ben --- *Ben Slater**Chief Product Officer* <https://www.instaclustr.

Re: Cassandra-stress testing

2019-08-20 Thread Ben Slater
://www.instaclustr.com/certified-apache-cassandra/ Cheers Ben --- *Ben Slater**Chief Product Officer* <https://www.instaclustr.com/platform/> <https://www.facebook.com/instaclustr> <https://twitter.com/instaclustr> <https://www.linkedin.com/company/instaclustr> Read our latest technica

Re: Cassandra-stress testing

2019-08-21 Thread Ben Slater
any useful response from the community. Cheers Ben --- *Ben Slater**Chief Product Officer* <https://www.instaclustr.com/platform/> <https://www.facebook.com/instaclustr> <https://twitter.com/instaclustr> <https://www.linkedin.com/company/instaclustr> Read our latest t

Repair Issues

2019-10-24 Thread Ben Mills
Greetings, Inherited a small Cassandra cluster with some repair issues and need some advice on recommended next steps. Apologies in advance for a long email. Issue: Intermittent repair failures on two non-system keyspaces. - platform_users - platform_management Repair Type: Full, parallel rep

Re: Repair Issues

2019-10-24 Thread Ben Mills
ou using Cassandra reaper? > > On Thu, Oct 24, 2019, 12:31 PM Ben Mills wrote: > >> Greetings, >> >> Inherited a small Cassandra cluster with some repair issues and need some >> advice on recommended next steps. Apologies in advance for a long email. >> >>

Re: Repair Issues

2019-10-24 Thread Ben Mills
d when the pods are down. This is a subtlety of running Cassandra in Kubernetes. On Thu, Oct 24, 2019 at 4:24 PM Reid Pinchback wrote: > Ben, you may find this helpful: > > > > https://blog.pythian.com/so-you-have-a-broken-cassandra-sstable-file/ > > > > >

Re: Repair Issues

2019-10-24 Thread Ben Mills
> cause of a problem, but in your case there's so many potential problems I > don't think it's worth digging into, especially since the problem might be > one of the 500 or so bugs that were fixed since this release. > > Once you've done those things it'll

Re: Repair Issues

2019-10-26 Thread Ben Mills
Thanks Ghiyasi. On Sat, Oct 26, 2019 at 9:17 AM Hossein Ghiyasi Mehr wrote: > If the problem exist still, and all nodes are up, reboot them one by one. > Then try to repair one node. After that repair other nodes one by one. > > On Fri, Oct 25, 2019 at 12:56 AM Ben Mills wrote: &g

Memory Recommendations for G1GC

2019-11-01 Thread Ben Mills
Greetings, We are planning a Cassandra upgrade from 3.7 to 3.11.5 and considering a change to the GC config. What is the minimum amount of memory that needs to be allocated to heap space when using G1GC? For GC, we currently use CMS. Along with the version upgrade, we'll be running the stateful

Re: Warning message even for batches targeting single partition

2016-05-08 Thread Ben Slater
Hi Bhuvan, You’re correct that large unlogged batch on one partition isn’t an issue. The logging behaviour has been/is being changed - see this jira for a detailed discussion: https://issues.apache.org/jira/browse/CASSANDRA-10876 Cheers Ben On Mon, 9 May 2016 at 00:58 Bhuvan Rawal wrote

Re: SS Tables Files Streaming

2016-05-09 Thread Ben Bromhead
arios can SS Table files on disk from Node 1 go to Node 2 as > is ? I’m aware this happens in *nodetool rebuild* and I am assuming this > does *not* happen in repairs. Can someone confirm ? > > > > The reason I ask is I am working on a solution for backup / restore and I >

Re: SS Tables Files Streaming

2016-05-09 Thread Ben Bromhead
Note that incremental repair strategies (2.1+) run anti-compaction against sstables in the range being repaired, so this will prevent overstreaming based on the ranges in the repair session. On Mon, 9 May 2016 at 10:31 Ben Bromhead wrote: > Yup, with repair and particularly bootstrap is th

Re: Accessing Cassandra data from Spark Shell

2016-05-09 Thread Ben Slater
-Instaclustr-Spark-Cassandra- Cheers Ben On Tue, 10 May 2016 at 14:08 Cassa L wrote: > Hi, > Has anyone tried accessing Cassandra data using SparkShell? How do you do > it? Can you use HiveContext for Cassandra data? I'm using community version > of Cassandra-3.0 >

Re: Repair schedules for new clusters

2016-05-17 Thread Ben Slater
-go works OK if your data load is increment. If you do a big load before your first incremental repair then it’s not much different to migrating to incremental repairs so worth following the procedures for migration to avoid a big impact. Cheers Ben On Tue, 17 May 2016 at 16:50 Ashic Mahtab wrote

Re: restore cassandra snapshots on a smaller cluster

2016-05-17 Thread Ben Slater
It should definitely work if you use sstableloader to load all the files. I imagine it is possible doing a straight restore (copy sstables) if you assign the tokens from multiple source nodes to one target node using the initial_token parameter in cassandra.yaml. Cheers Ben On Wed, 18 May 2016

Re: Accessing Cassandra data from Spark Shell

2016-05-17 Thread Ben Slater
It definitely should be possible for 1.5.2 (I have used it with spark-shell and cassandra connector with 1.4.x). The main trick is in lining up all the versions and building an appropriate connector jar. Cheers Ben On Wed, 18 May 2016 at 15:40 Cassa L wrote: > Hi, > I followed instructi

Re: Recovery for deleted SSTables files for one column family.

2016-05-19 Thread Ben Slater
will rebuild the sstables from the other replicas in the cluster. Cheers Ben On Thu, 19 May 2016 at 17:57 Mike Yeap wrote: > Hi all, I would like to know, is there any way to rebuild a particular > column family when all the SSTables files for this column family are > missing?? Say

Re: Unable to use native library in C* trigger

2016-06-07 Thread Ben Slater
recompiling C* with a new whitelist I guess). There is a JIRA for non-sandboxed UDFs: https://issues.apache.org/jira/browse/CASSANDRA-9892 Cheers Ben On Wed, 8 Jun 2016 at 01:07 Brian Kelly wrote: > Hi, all, > > I am attempting write a trigger that depends on a native library. The >

Re: Empty snapshot created

2016-06-09 Thread Ben Slater
After adding a DC you need to run nodetool rebuild. See the procedure here: https://docs.datastax.com/en/cassandra/2.1/cassandra/operations/ops_add_dc_to_cluster_t.html Cheers Ben On Fri, 10 Jun 2016 at 14:17 Mradul Maheshwari wrote: > Hi, > I am facing an issue when taking snapshots. &

Re: java.lang.OutOfMemoryError: Java heap space

2016-06-10 Thread Ben Slater
obsolete snapshots) - large partitions ( Writing large partition feed/messages:MANAGER:0 (175811867 bytes)) Cheers Ben On Sat, 11 Jun 2016 at 01:11 Tobin Landricombe wrote: > Hi, > > I've been googling various parts of this all day but none of the > suggestions seem to fit. > >

Re: java.lang.OutOfMemoryError: Java heap space

2016-06-11 Thread Ben Slater
the needs of large partitions). And, there is a fair chance you need to do (b) for a healthy cluster in the long run. Cheers Ben On Sat, 11 Jun 2016 at 20:52 Tobin Landricombe wrote: > Hi Ben, > > I think the degraded mode is caused by one or both of these... > • WARN [main

Re: java.lang.OutOfMemoryError: Java heap space

2016-06-11 Thread Ben Slater
I should add - there is probably an option (c) of fiddling with a bunch of tuning parameters to try to nurse things through with your current config but I’m not sure that’s useful unless you really need to make the current set up work for some reason. On Sun, 12 Jun 2016 at 15:23 Ben Slater

Re: Spark Memory Error - Not enough space to cache broadcast

2016-06-14 Thread Ben Slater
A high level shot in the dark but in our testing we found Spark 1.6 a lot more reliable in low memory situations (presumably due to https://issues.apache.org/jira/browse/SPARK-1). If it’s an option, probably worth a try. Cheers Ben On Wed, 15 Jun 2016 at 08:48 Cassa L wrote: > Hi,

Re: Data lost in Cassandra 3.5 single instance via Erlang driver

2016-06-14 Thread Ben Slater
>> 0x00d0: 4964 000a 7465 7374 746f Idtestto >> 0x00e0: 7069 6332 0002 0004 pic2 >> 0x00f0: 03e8 0064 .d >> 17:16:24.834414 IP localhost.9042 > localhost.49128: Flags [P.], seq >> 14:27, ack 396, win 579, options [nop,nop,TS val 72982872 ecr 72982868], >> length 13 >> 0x: 4500 0041 f907 4000 4006 43ad 7f00 0001 E..A..@.@.C. >> 0x0010: 7f00 0001 2352 bfe8 bfe6 87a4 83eb 4bcb #RK. >> 0x0020: 8018 0243 fe35 0101 080a 0459 a158 ...C.5...Y.X >> 0x0030: 0459 a154 *8400 0800 0400 * .Y.T >> 0x0040: *01* . >> 17:16:24.834452 IP localhost.49128 > localhost.9042: Flags [.], ack 27, >> win 530, options [nop,nop,TS val 72982872 ecr 72982872], length 0 >> 0x: 4500 0034 370d 4000 4006 05b5 7f00 0001 E..47.@.@... >> 0x0010: 7f00 0001 bfe8 2352 83eb 4bcb bfe6 87b1 ..#R..K. >> 0x0020: 8010 0212 fe28 0101 080a 0459 a158 .(...Y.X >> 0x0030: 0459 a158.Y.X >> > > > Check the TCP flow, the data send and ack succeed, but not sure why the > data doesn't insert in Server side successful. Any tool or method can I > debug the Cassandra Server, or am I missing something? > > Thanks, > Linbo > -- Ben Slater Chief Product Officer Instaclustr: Cassandra + Spark - Managed | Consulting | Support +61 437 929 798

Re: how to force cassandra-stress to actually generate enough data

2016-06-15 Thread Ben Slater
running for a long time just results in overwriting a lot a data that gets compacted away. Using n=[number > 1M] will get you n somethings. Cheers Ben On Wed, 15 Jun 2016 at 22:25 Peter Kovgan wrote: > Hi, > > > > The cassandra-stress is not helping really to populate the d

Re: Multi-DC Cluster w/ non-replicated Keyspace

2016-06-16 Thread Ben Slater
plicated > to the other? > > Whenever we add a new keyspace it seems to get replicated even if we don't > specify the other DC in the keyspace's NetworkTopologyStrategy. > > -J > -- Ben Slater Chief Product Officer Instaclustr: Cassandra + Spark - Managed | Consulting | Support +61 437 929 798

Re: Multi-DC Cluster w/ non-replicated Keyspace

2016-06-16 Thread Ben Slater
That’s the behaviour I would have expected. I’m not aware of anyway to prevent this and would be surprised if there is one (but I’ve never tried to find one either so it might be possible). Cheers Ben On Fri, 17 Jun 2016 at 12:02 Jason J. W. Williams wrote: > Hey Ben, > > Looks like

Re: Is my cluster normal?

2016-07-07 Thread Ben Slater
discussion of how we went about benchmarking. Cheers Ben On Fri, 8 Jul 2016 at 07:52 Yuan Fang wrote: > Yes, here is my stress test result: > Results: > op rate : 12200 [WRITE:12200] > partition rate: 12200 [WRITE:12200] > row rate

Re: Re : Purging tombstones from a particular row in SSTable

2016-07-27 Thread Ben Slater
somewhere at some point in time). Cheers Ben On Thu, 28 Jul 2016 at 09:17 Alain RODRIGUEZ wrote: > Hi, > > I just released a detailed post about tombstones today that might be of > some interest for you: > http://thelastpickle.com/blog/2016/07/27/about-deletes-and-tombstones.html >

Re: Memory leak and lockup on our 2.2.7 Cassandra cluster.

2016-08-03 Thread Ben Slater
a way to make your partitions smaller (like 1/10th of the size). Cheers Ben <https://issues.apache.org/jira/browse/CASSANDRA-11206> On Wed, 3 Aug 2016 at 12:35 Kevin Burton wrote: > I have a theory as to what I think is happening here. > > There is a correlation between the mass

Re: [Marketing Mail] Re: Memory leak and lockup on our 2.2.7 Cassandra cluster.

2016-08-03 Thread Ben Slater
Yep, that was what I was referring to. On Thu, 4 Aug 2016 2:24 am Reynald Bourtembourg < reynald.bourtembo...@esrf.fr> wrote: > Hi, > > Maybe Ben was referring to this issue which has been mentioned recently on > this mailing list: > https://issues.apache.org/jira/b

Replicating Cassandra data to HDFS

2016-08-09 Thread Ben Vogan
Hi all, We are investigating using Cassandra in our data platform. We would like data to go into Cassandra first and to eventually be replicated into our data lake in HDFS for long term cold storage. Does anyone know of a good way of doing this? We would rather not have parallel writes to HDFS

Re: Replicating Cassandra data to HDFS

2016-08-09 Thread Ben Vogan
; for duplication checks to dedup then output to another source (form of dual > write but with dedup), this was really silly and slow. I only bring it up > to save you the trouble in case you end up in the same path chasing for > something more 'real time'. > > Regards, > R

Re: Read Repairs and CL

2016-08-28 Thread Ben Slater
read at CL=ALL which could not read sufficient replicas - a bit confusing (although I can also see how getting CL=ALL errors when you thought you were reading at QUORUM or ONE would be confusing). Cheers Ben On Sun, 28 Aug 2016 at 10:52 kurt Greaves wrote: > Looking at the wiki for the read p

Re: Bootstrapping multiple C* nodes in AWS

2016-08-29 Thread Ben Slater
what you are doing, particular if you’re working with a production cluster. Cheers Ben On Tue, 30 Aug 2016 at 16:09 Aiman Parvaiz wrote: > Hi all > I am running C* 2.1.12 in AWS EC2 Classic with RF=3 and vnodes(256 > tokens/node). My nodes are distributed in three different availabilit

Re: Read Repairs and CL

2016-08-30 Thread Ben Slater
Thanks Sam - a couple of subtleties there that we missed in our review. Cheers Ben On Tue, 30 Aug 2016 at 19:42 Sam Tunnicliffe wrote: > Just to clarify a little further, it's true that read repair queries are > performed at CL ALL, but this is slightly different to a regular, >

Re: ServerError: An unexpected error occurred server side; in cassandra java driver

2016-09-01 Thread Ben Slater
Hi Siddarth, It would probably help people provide and answer if you let everyone some more details like: - cassandra version and driver version you are using - query that is being executed when the error occurs - schema of the table that is being queried Cheers Ben On Thu, 1 Sep 2016 at 21:19

Re: Streaming Process: How can we speed it up?

2016-09-15 Thread Ben Slater
that the node doesn’t get cleaned up through the replacement process like a newly streamed node does (plus the extra operational complexity). Cheers Ben On Thu, 15 Sep 2016 at 19:47 Vasileios Vlachos wrote: > Hello and thanks for your responses, > > OK, so i

Re: How Fast Does Information Spread With Gossip?

2016-09-16 Thread Ben Bromhead
? Too high, too low? >> >> >> Thanks, >> >> Jerome >> > -- > > Jens Rantil > Backend Developer @ Tink > > Tink AB, Wallingatan 5, 111 60 Stockholm, Sweden > For urgent matters you can reach me at +46-708-84 18 32. > -- Ben Bromhead CTO | Instaclustr <https://www.instaclustr.com/> +1 650 284 9692 Managed Cassandra / Spark on AWS, Azure and Softlayer

Re: Export/Importing keyspace from a different sized cluster

2016-09-19 Thread Ben Slater
cluster to run > sstableloader, what methods can I use to load a single keyspace from one > cluster to another cluster of different size? > > > > Appreciate the help… > > > > Thanks, > > Justin > > > -- Ben Slater Chief Product Officer Instacl

Re: Question about replica and replication factor

2016-09-19 Thread Ben Slater
distributed across all the nodes in your cluster). Cheers Ben On Tue, 20 Sep 2016 at 14:09 Jun Wu wrote: > Hi there, > > I have a question about the replica and replication factor. > > For example, I have a cluster of 6 nodes in the same data center. > Replication factor R

Re: Question about replica and replication factor

2016-09-19 Thread Ben Slater
) the coordinator would need to contact and execute a read on at least one other node to satisfy the query. Cheers Ben On Tue, 20 Sep 2016 at 14:50 Jun Wu wrote: > Hi Ben, > > Thanks for the quick response. > > It's clear about the example for single row/partition.

Re: Question about replica and replication factor

2016-09-19 Thread Ben Slater
in most uses of Cassandra). Cheers Ben On Tue, 20 Sep 2016 at 15:18 Jun Wu wrote: > > > Yes, I think for my case, at least two nodes need to be contacted to get > the full set of data. > > But another thing comes up about dynamic snitch. It's the wrapped snitch > and

Re: Exceptions whenever compaction happens

2016-09-26 Thread Ben Slater
schema of the table that might help. Beyond that the only thing I can think of is running scrub if you haven’t already done so. Cheers Ben On Mon, 26 Sep 2016 at 16:36 Nikhil Sharma wrote: > Hi, > > We are not exactly sure what is causing this problem. But after compaction > happens (

Re: crash with OOM

2016-09-27 Thread Ben Slater
-partition batches? Massive partitions? Especially given it hit two nodes simultaneously I would be looking for rogue query as my first point of investigation. Cheers Ben On Tue, 27 Sep 2016 at 17:49 xutom wrote: > > Hi, all > I have a C* cluster with 12 nodes. My cassandra version

Re: ask for help about exmples of Data Types the document shows

2016-09-27 Thread Ben Slater
frozen is relatively recent (3.6) so the doco may be out date there. Cheers Ben On Wed, 28 Sep 2016 at 13:12 zha...@easemob.com wrote: > hi, everyone, I'm learning Cassandra now , and have some problems about > the document of "Data Types" . I don't know where to re

Re: Re: ask for help about exmples of Data Types the document shows

2016-09-27 Thread Ben Slater
My best guess it that you need to remove the quotes from around the zip values (ie change if to zip: 20500 rather than zip: ‘20500’ ) as zip is defined as an int. Cheers Ben On Wed, 28 Sep 2016 at 14:38 zha...@easemob.com wrote: > Hi, Ben Slater, thank you very much for your replay! >

Re: Where to change the datacenter name?

2016-10-10 Thread Ben Dalling
Hi, It depends on which snitch you are using, but the DC name is in cassandra-rackdc.properties for the GossipingPropertyFileSnitch. Best wishes, Ben On 10 October 2016 at 20:54, Adam Hutson wrote: > There is a cluster name in the cassandra.yaml for naming the cluster, aka > data

Re: Is there any way to throttle the memtable flushing throughput?

2016-10-11 Thread Ben Bromhead
be an issue. Even if you can't/don't want to upgrade the instance type, the other steps will help things. Ben On Tue, 11 Oct 2016 at 10:23 Satoshi Hikida wrote: > Hi, > > I'm investigating the read/write performance of the C* (Ver. 2.2.8). > However, I have an issue abou

Re: New node overstreaming data ?

2016-10-13 Thread Ben Bromhead
causing, and working around) ? To work around, we tried increasing > #compactors and reducing stream throughput so that at least incoming > #SSTables would be controlled. > > > > This has happened to us few times in the past too, so I am wondering if > this is a known problem (I c

Re: Does increment/decrement by 0 generate any commits ?

2016-10-13 Thread Ben Bromhead
olumn by positive integer and the others by 0. > > Makes sense ? > -- Ben Bromhead CTO | Instaclustr <https://www.instaclustr.com/> +1 650 284 9692 Managed Cassandra / Spark on AWS, Azure and Softlayer

  1   2   3   4   5   >