know, the business prefer to
order by the last_time desc. If we use last_time as cluster key, there will be
many records for a singe movie and the recent one is actually desired. So how
to model that? Do you have any suggestions?
Thanks!
BRs,
BEN
e and continue to watch from the
last position she/he did. BTW, a movie can be watched several times by a user
and the last positon is needed indeed.
BRs,
BEN
发件人: Vladimir Yudovin
发送时间: 2016年11月8日 22:35:48
收件人: user
主题: Re: A difficult data model with C*
H
tra $100 a month
for the alternative and move onto things more important things like customer
acquisition.
Good luck
Ben Bromhead
Instaclustr | www.instaclustr.com | @instaclustr
On 28/08/2013, at 3:36 AM, Robert Coli wrote:
> On Tue, Aug 27, 2013 at 3:04 AM, Aklin_81 wrote:
> But so
I think that is probably a question for the Spark Connector forum:
https://groups.google.com/a/lists.datastax.com/forum/#!forum/spark-connector-user
as
it’s much more related to the function of the connector than functionality
of Cassandra itself.
Cheers
Ben
On Sat, 17 Mar 2018 at 21:18
You will probably find grouping writes into small batches improves overall
performance (if you are not doing it already). See the following
presentation for some more info:
https://www.slideshare.net/Instaclustr/microbatching-highperformance-writes
Cheers
Ben
On Sun, 18 Mar 2018 at 19:23
oughput but much less
> CPU usage at client side.
>
> Sent using Zoho Mail <https://www.zoho.com/mail/>
>
>
> On Sun, 18 Mar 2018 12:46:02 +0330 *Ben Slater
> >* wrote
>
> You will probably find grouping writes into small batches improves overall
>
Sent using Zoho Mail <https://www.zoho.com/mail/>
>
>
> ---- On Sun, 18 Mar 2018 13:14:54 +0330 *Ben Slater
> >* wrote
>
> When you say batch was worth than async in terms of throughput are you
> comparing throughput with the same number of threads or something? I w
erson working on it
(or are offering to work on it instead) and can get it ready for review in
a timely fashion. That way we can build a more realistic working target.
For major breaking changes, there is always 5.0 or 4.1 or whatever we end
up doing :)
Cheers
Ben
On Thu, Feb 15, 2018 at 9:39 PM
> http://www.thelastpickle.com
>>
>> 2018-04-04 9:44 GMT+01:00 sujeet jog :
>>
>>> the datastax site has a hardware recommendation of 16CPU / 32G RAM for
>>> DSE Enterprise, Any idea what is the minimum hardware recommendation
>>> supported, can each node be 8CPU and the support covering it ?..
>>>
>>
>>
> --
Ben Bromhead
CTO | Instaclustr <https://www.instaclustr.com/>
+1 650 284 9692
Reliability at Scale
Cassandra, Spark, Elasticsearch on AWS, Azure, GCP and Softlayer
. Spark is notorious for causing
>>>> latency spikes in Cassandra which is not great if you are are sensitive to
>>>> that.
>>>>
>>>> Regards,
>>>> Evelyn.
>>>>
>>>> On 12 Apr 2018, at 6:55 am, kooljava2
>>>> wrote:
>>>>
>>>> Hello,
>>>>
>>>> We are exploring on configuring Sorl/Spark. Wanted to get input on
>>>> this.
>>>> 1) How do we decide which one to use?
>>>> 2) Do we run this on a DC where there is less workload?
>>>>
>>>> Any other suggestion or comments are appreciated.
>>>>
>>>> Thank you.
>>>>
>>>>
>>>>
>>> --
Ben Bromhead
CTO | Instaclustr <https://www.instaclustr.com/>
+1 650 284 9692
Reliability at Scale
Cassandra, Spark, Elasticsearch on AWS, Azure, GCP and Softlayer
On Thu, Apr 12, 2018, 21:23 Niclas Hedhman wrote:
> Ben,
>
> 1. I don't see anything in this thread that is DSE specific, so I think it
> belongs here.
>
DSE is literally in the title.
> 2. Careful when you say that Datastax produces Cassandra. Cassandra is a
>
Thanks Jeff.
On Thu, Apr 12, 2018, 21:37 Jeff Jirsa wrote:
> Pretty sure Ben meant that datastax produces DSE, not Cassandra, and since
> the questions specifically mentions DSE in the subject (implying that the
> user is going to be running either solr or spark within DSE t
220 should be enough for cerrification and also i am
> reading definitive guide on cassandra ..any other material required ? Any
> practise test websites? As certification is costly and wanna clear in one
> go ...
>
--
Ben Bromhead
CTO | Instaclustr <https://www.instaclustr.com/>
+1
level (eg in the catch block delete the records that were
inserted). That, however, does not provide you the isolation part of ACID.
You also tend to find that if you have properly denormalised your data
model for Cassandra there is less requirement for these type of batched
updates.
Cheers
Ben
O
find inconsistencies.
- If read_repair_chance or dc_local_read_repair_chance are >0 then extra
replicas are checked as part of the query for the % of queries specified by
the chance setting. Again, if inconsistencies are found, they are repaired.
I expect this mechanism would still apply for CL=ONE.
Ch
then a single write to
hotel_enabled. hotel_enabled could also be something like hotel_details so
the other tables really are just indexes. You need to do more reads but
whatever you do consistency doesn’t come for free.
Cheers
Ben
On Thu, 26 Apr 2018 at 12:44 Rajesh Kishore wrote:
> Correct
8 (probably too late)?
>
>
>
> Is there a planning committee?
>
>
>
> Who wants there to be a Cassandra Summit 2019 and who thinks there is a
> better way?
>
>
>
> We could try a Cassandra Distributed Summit 2019 where we meet virtually
> and perhaps asynchronously, but there would be a lot more energy and
> bonding if it’s not virtual. I’m up for any of these.
>
>
>
> Kenneth Brotman
>
>
>
>
>
>
>
>
>
> --
>
>
>
> --
Ben Bromhead
CTO | Instaclustr <https://www.instaclustr.com/>
+1 650 284 9692
Reliability at Scale
Cassandra, Spark, Elasticsearch on AWS, Azure, GCP and Softlayer
The snapshot gives you a complete set of your sstables at a point in time.
If you were copying sstables directly from a live node you would have to
deal with files coming and going due to compactions.
Cheers
Ben
On Thu, 10 May 2018 at 16:45 wrote:
> Dear Community,
>
>
>
> Is th
but you might also find relevant:
https://www.instaclustr.com/cassandra-connector-for-spark-5-tips-for-success/
Cheers
Ben
On Thu, 17 May 2018 at 18:06 Arbab Khalil wrote:
> We have been exploring IoT specific C* schema design over the past few
> months. We wanted to share the benchmarkin
know use cases where people are
> using Mesos or a custom tool built with terraform/chef etc to run their
> production clusters but have yet to find a real K8s use case.
>
>
>
> *Questions?*
>
> Is K8s a reasonable choice for managing a production C* cluster?
>
> Are there documented use cases for this?
>
>
>
> Any help would be greatly appreciated.
>
>
>
> --
>
> Regards,
>
>
>
> *Hassaan Pasha*
>
> --
>
> Regrads,
>
> Pavel Sapezhko
>
>
>
--
Ben Bromhead
CTO | Instaclustr <https://www.instaclustr.com/>
+1 650 284 9692
Reliability at Scale
Cassandra, Spark, Elasticsearch on AWS, Azure, GCP and Softlayer
ent gromakowski <
vincent.gromakow...@gmail.com> wrote:
> Why did you choose java for the operator implementation when everybody
> seems to use the go client (probably for greater functionalities) ?
>
> 2018-05-23 15:39 GMT+02:00 Ben Bromhead :
>
>> You can get a good way wit
r...
>
> 2018-05-23 17:17 GMT+02:00 Ben Bromhead :
>
>> The official Kubernetes Java driver is actually pretty feature complete,
>> if not exactly idiomatic Java... it's only missing full examples to get it
>> to GOLD compatibility levels iirc.
>>
>> A f
Hi Pranay
We (Instaclustr) provide enterprise support for Cassandra (
https://www.instaclustr.com/services/cassandra-support/) which may cover
what you are looking for.
Please get in touch direct if you would like to discuss.
Cheers
Ben
On Tue, 29 May 2018 at 10:11 Pranay akula
wrote:
>
like GC etc, but feel
free to submit a PR!
Ben
On Mon, Jul 9, 2018 at 12:03 AM Rahul Singh
wrote:
> How often are you polling the JMX? How much of a spike are you seeing in
> CPU?
>
> --
> Rahul Singh
> rahul.si...@anant.us
>
> Anant Corporation
> On Jul 5, 2
Note that that writetimeout exception can be C*s way of telling you when
there is contention on a LWT (rather than actually timing out). See
https://issues.apache.org/jira/browse/CASSANDRA-9328
Cheers
Ben
On Sun, 22 Jul 2018 at 11:20 Goutham reddy
wrote:
> Hi,
> As it is a single partiti
We (Instaclustr) will be submitting a PR for 3.11.3 support for
cassandra-lucene-index once 3.11.3 is officially released as we offer it as
part of our service and have customers using it.
Cheers
Ben
On Wed, 1 Aug 2018 at 14:06 onmstester onmstester
wrote:
> It seems to be an interest
-application-level-encryption/
We also use encrypted GP2 EBS pretty widely without issue.
Cheers
Ben
On Thu, 2 Aug 2018 at 05:38 Jonathan Haddad wrote:
> You can also get full disk encryption with LUKS, which I've used before.
>
> On Wed, Aug 1, 2018 at 12:36 PM Jeff Jirsa w
o PDF e-mail accounts will be archived and may
> be scanned by us and/or by external service providers to detect and prevent
> threats to our systems, investigate illegal or inappropriate behavior,
> and/or eliminate unsolicited promotional e-mails (“spam”). If you have any
> concerns
een solved before.
- The Cassandra examples in the official Kubernetes documentation is not
something you want to take into production.
Cheers
Ben
On Thu, Oct 11, 2018 at 6:50 PM Goutham reddy
wrote:
> Hi,
> We are in the process of setting up Cassandra cluster with high
> availabilit
For anyone who is interested, we’ve published a blog with some more
background on this and some more detail of our ongoing plans:
https://www.instaclustr.com/instaclustr-support-cassandra-lucene-index/
Cheers
Ben
On Fri, 19 Oct 2018 at 09:42 kurt greaves wrote:
> Hi all,
>
>
rn e-mail. Regardless of
>> content, this e-mail shall not operate to bind DXC Technology Company or
>> any of its affiliates to any order or other contract unless pursuant to
>> explicit written agreement or government initiative expressly permitting
>> the use of e-mail for such purpose. --.
>>
> --
Ben Bromhead
CTO | Instaclustr <https://www.instaclustr.com/>
+1 650 284 9692
Reliability at Scale
Cassandra, Spark, Elasticsearch on AWS, Azure, GCP and Softlayer
management environment but if you app functionality allows your to split
into multiple clusters (ie all your data is not all in one giant table)
then I would generally look to split. Splitting also gives you the
advantage of making it harder to have an outage that brings everything down.
Cheers
Ben
On
I wrote a blog post a while ago on the pros and cons of encrypting in your
application for use with Cassandra that you might find useful background on
this subject:
https://www.instaclustr.com/securing-apache-cassandra-with-application-level-encryption/
Cheers
Ben
On Wed, 14 Nov 2018 at 13:47
In what way does the cluster become unstable (ie more specifically what are
the symptoms)? My first thought would be the loss of the node causing the
other nodes to become overloaded but that doesn’t seem to fit with your
point 2.
Cheers
Ben
---
*Ben Slater*
*Chief Product Officer*
<ht
(in which case I don’t think there is one but stress has a hell of a lot of
options so I could be wrong).
Cheers
Ben
---
*Ben Slater*
*Chief Product Officer*
<https://www.facebook.com/instaclustr> <https://twitter.com/instaclustr>
<https://www.linkedin.com/company/instaclustr>
Yep, cassandra-stress doesn’t attempt to use the cqlshrc file. Seems to me
it could be convenient so might make a nice contribution to the project.
Cheers
Ben
---
*Ben Slater*
*Chief Product Officer*
<https://www.facebook.com/instaclustr> <https://twitter.com/instaclust
They don’t do exactly what you want but depending on why you are trying to
get this info you might find our sstable-tools useful:
https://github.com/instaclustr/cassandra-sstable-tools
---
*Ben Slater*
*Chief Product Officer*
<https://www.facebook.com/instaclustr> <https://tw
unsubscribe
–Ben Standefer
Sent via Superhuman ( https://sprh.mn/?vip=benstande...@gmail.com )
Not in the logs but I think you should be able to use the WRITETIME
function to view via CQL (see
https://cassandra.apache.org/doc/latest/cql/dml.html#select)
Cheers
Ben
---
*Ben Slater*
*Chief Product Officer*
<https://www.facebook.com/instaclustr> <https://twitter.com/instaclust
Maybe stabledump can help you?
https://cassandra.apache.org/doc/4.0/tools/sstable/sstabledump.html
---
*Ben Slater*
*Chief Product Officer*
<https://www.facebook.com/instaclustr> <https://twitter.com/instaclustr>
<https://www.linkedin.com/company/instaclustr>
Read our lat
Is it possible that stress is overloading node 1 so it’s not recovering
state properly when node 2 comes up? Have you tried running with a lower
load (say 2 or 3 threads)?
Cheers
Ben
---
*Ben Slater*
*Chief Product Officer*
<https://www.facebook.com/instaclustr> <https://tw
Ben
---
*Ben Slater**Chief Product Officer*
<https://www.instaclustr.com/platform/>
<https://www.facebook.com/instaclustr> <https://twitter.com/instaclustr>
<https://www.linkedin.com/company/instaclustr>
Read our latest technical blog posts here
<https://www.in
inserted (relative to your attempt to
query it)?
Cheers
Ben
---
*Ben Slater**Chief Product Officer*
<https://www.instaclustr.com/platform/>
<https://www.facebook.com/instaclustr> <https://twitter.com/instaclustr>
<https://www.linkedin.com/company/instaclustr>
Read our lat
, changes in topology
without following proper procedure, etc.
Cheers
Ben
---
*Ben Slater**Chief Product Officer*
<https://www.instaclustr.com/platform/>
<https://www.facebook.com/instaclustr> <https://twitter.com/instaclustr>
<https://www.linkedin.com/company/instaclu
monitor the dropped mutations metrics.
Cheers
Ben
---
*Ben Slater**Chief Product Officer*
<https://www.instaclustr.com/platform/>
<https://www.facebook.com/instaclustr> <https://twitter.com/instaclustr>
<https://www.linkedin.com/company/instaclustr>
Read our latest t
In scenario 2, if the row has been written to node 3 it will be replaced on
the other nodes via rebuild or repair.
---
*Ben Slater**Chief Product Officer*
<https://www.instaclustr.com/platform/>
<https://www.facebook.com/instaclustr> <https://twitter.com/inst
many not have made its way to the third node by the time the first
two replicas are lost. If there is a replica on the third node it can be
recovered to the other two nodes by either rebuild (actually replace) or
repair.
Cheers
Ben
---
*Ben Slater**Chief Product Officer*
<ht
the two nodes dieing (although it’s very likely there will be three
replicas from the start if the cluster is in a healthy state at the time of
the write).
Cheers
Ben
---
*Ben Slater**Chief Product Officer*
<https://www.instaclustr.com/platform/>
<https://www.facebook.com/instaclustr&
This all looks really awesome, while i wish there wasn't as much duplicated
effort across the different projects it's great to see how different teams
are solving this very real problem!
Also Attila we do have our cloud offering as you mentioned, but we also
have our own k8s operator which is Apac
Is the size of the data in your “state” column variable? The higher write
latencies at the 95%+ could line up with large volumes of data for
particular rows in that column (the one column not in both tables)?
Cheers
Ben
---
*Ben Slater**Chief Product Officer*
<https://www.instaclustr.
://www.instaclustr.com/certified-apache-cassandra/
Cheers
Ben
---
*Ben Slater**Chief Product Officer*
<https://www.instaclustr.com/platform/>
<https://www.facebook.com/instaclustr> <https://twitter.com/instaclustr>
<https://www.linkedin.com/company/instaclustr>
Read our latest technica
any useful response from the community.
Cheers
Ben
---
*Ben Slater**Chief Product Officer*
<https://www.instaclustr.com/platform/>
<https://www.facebook.com/instaclustr> <https://twitter.com/instaclustr>
<https://www.linkedin.com/company/instaclustr>
Read our latest t
Greetings,
Inherited a small Cassandra cluster with some repair issues and need some
advice on recommended next steps. Apologies in advance for a long email.
Issue:
Intermittent repair failures on two non-system keyspaces.
- platform_users
- platform_management
Repair Type:
Full, parallel rep
ou using Cassandra reaper?
>
> On Thu, Oct 24, 2019, 12:31 PM Ben Mills wrote:
>
>> Greetings,
>>
>> Inherited a small Cassandra cluster with some repair issues and need some
>> advice on recommended next steps. Apologies in advance for a long email.
>>
>>
d when
the pods are down. This is a subtlety of running Cassandra in Kubernetes.
On Thu, Oct 24, 2019 at 4:24 PM Reid Pinchback
wrote:
> Ben, you may find this helpful:
>
>
>
> https://blog.pythian.com/so-you-have-a-broken-cassandra-sstable-file/
>
>
>
>
>
> cause of a problem, but in your case there's so many potential problems I
> don't think it's worth digging into, especially since the problem might be
> one of the 500 or so bugs that were fixed since this release.
>
> Once you've done those things it'll
Thanks Ghiyasi.
On Sat, Oct 26, 2019 at 9:17 AM Hossein Ghiyasi Mehr
wrote:
> If the problem exist still, and all nodes are up, reboot them one by one.
> Then try to repair one node. After that repair other nodes one by one.
>
> On Fri, Oct 25, 2019 at 12:56 AM Ben Mills wrote:
&g
Greetings,
We are planning a Cassandra upgrade from 3.7 to 3.11.5 and considering a
change to the GC config.
What is the minimum amount of memory that needs to be allocated to heap
space when using G1GC?
For GC, we currently use CMS. Along with the version upgrade, we'll be
running the stateful
Hi Bhuvan,
You’re correct that large unlogged batch on one partition isn’t an issue.
The logging behaviour has been/is being changed - see this jira for a
detailed discussion: https://issues.apache.org/jira/browse/CASSANDRA-10876
Cheers
Ben
On Mon, 9 May 2016 at 00:58 Bhuvan Rawal wrote
arios can SS Table files on disk from Node 1 go to Node 2 as
> is ? I’m aware this happens in *nodetool rebuild* and I am assuming this
> does *not* happen in repairs. Can someone confirm ?
>
>
>
> The reason I ask is I am working on a solution for backup / restore and I
>
Note that incremental repair strategies (2.1+) run anti-compaction against
sstables in the range being repaired, so this will prevent overstreaming
based on the ranges in the repair session.
On Mon, 9 May 2016 at 10:31 Ben Bromhead wrote:
> Yup, with repair and particularly bootstrap is th
-Instaclustr-Spark-Cassandra-
Cheers
Ben
On Tue, 10 May 2016 at 14:08 Cassa L wrote:
> Hi,
> Has anyone tried accessing Cassandra data using SparkShell? How do you do
> it? Can you use HiveContext for Cassandra data? I'm using community version
> of Cassandra-3.0
>
-go works OK if your data load
is increment. If you do a big load before your first incremental repair
then it’s not much different to migrating to incremental repairs so worth
following the procedures for migration to avoid a big impact.
Cheers
Ben
On Tue, 17 May 2016 at 16:50 Ashic Mahtab wrote
It should definitely work if you use sstableloader to load all the files. I
imagine it is possible doing a straight restore (copy sstables) if you
assign the tokens from multiple source nodes to one target node using the
initial_token parameter in cassandra.yaml.
Cheers
Ben
On Wed, 18 May 2016
It definitely should be possible for 1.5.2 (I have used it with spark-shell
and cassandra connector with 1.4.x). The main trick is in lining up all the
versions and building an appropriate connector jar.
Cheers
Ben
On Wed, 18 May 2016 at 15:40 Cassa L wrote:
> Hi,
> I followed instructi
will rebuild the sstables from the other replicas in the cluster.
Cheers
Ben
On Thu, 19 May 2016 at 17:57 Mike Yeap wrote:
> Hi all, I would like to know, is there any way to rebuild a particular
> column family when all the SSTables files for this column family are
> missing?? Say
recompiling C* with a new whitelist I guess). There is a JIRA for
non-sandboxed UDFs: https://issues.apache.org/jira/browse/CASSANDRA-9892
Cheers
Ben
On Wed, 8 Jun 2016 at 01:07 Brian Kelly wrote:
> Hi, all,
>
> I am attempting write a trigger that depends on a native library. The
>
After adding a DC you need to run nodetool rebuild. See the procedure here:
https://docs.datastax.com/en/cassandra/2.1/cassandra/operations/ops_add_dc_to_cluster_t.html
Cheers
Ben
On Fri, 10 Jun 2016 at 14:17 Mradul Maheshwari wrote:
> Hi,
> I am facing an issue when taking snapshots.
&
obsolete snapshots)
- large partitions ( Writing large partition feed/messages:MANAGER:0
(175811867 bytes))
Cheers
Ben
On Sat, 11 Jun 2016 at 01:11 Tobin Landricombe wrote:
> Hi,
>
> I've been googling various parts of this all day but none of the
> suggestions seem to fit.
>
>
the needs of large partitions). And, there is a fair chance you
need to do (b) for a healthy cluster in the long run.
Cheers
Ben
On Sat, 11 Jun 2016 at 20:52 Tobin Landricombe wrote:
> Hi Ben,
>
> I think the degraded mode is caused by one or both of these...
> • WARN [main
I should add - there is probably an option (c) of fiddling with a bunch of
tuning parameters to try to nurse things through with your current config
but I’m not sure that’s useful unless you really need to make the current
set up work for some reason.
On Sun, 12 Jun 2016 at 15:23 Ben Slater
A high level shot in the dark but in our testing we found Spark 1.6 a lot
more reliable in low memory situations (presumably due to
https://issues.apache.org/jira/browse/SPARK-1). If it’s an option,
probably worth a try.
Cheers
Ben
On Wed, 15 Jun 2016 at 08:48 Cassa L wrote:
> Hi,
>> 0x00d0: 4964 000a 7465 7374 746f Idtestto
>> 0x00e0: 7069 6332 0002 0004 pic2
>> 0x00f0: 03e8 0064 .d
>> 17:16:24.834414 IP localhost.9042 > localhost.49128: Flags [P.], seq
>> 14:27, ack 396, win 579, options [nop,nop,TS val 72982872 ecr 72982868],
>> length 13
>> 0x: 4500 0041 f907 4000 4006 43ad 7f00 0001 E..A..@.@.C.
>> 0x0010: 7f00 0001 2352 bfe8 bfe6 87a4 83eb 4bcb #RK.
>> 0x0020: 8018 0243 fe35 0101 080a 0459 a158 ...C.5...Y.X
>> 0x0030: 0459 a154 *8400 0800 0400 * .Y.T
>> 0x0040: *01* .
>> 17:16:24.834452 IP localhost.49128 > localhost.9042: Flags [.], ack 27,
>> win 530, options [nop,nop,TS val 72982872 ecr 72982872], length 0
>> 0x: 4500 0034 370d 4000 4006 05b5 7f00 0001 E..47.@.@...
>> 0x0010: 7f00 0001 bfe8 2352 83eb 4bcb bfe6 87b1 ..#R..K.
>> 0x0020: 8010 0212 fe28 0101 080a 0459 a158 .(...Y.X
>> 0x0030: 0459 a158.Y.X
>>
>
>
> Check the TCP flow, the data send and ack succeed, but not sure why the
> data doesn't insert in Server side successful. Any tool or method can I
> debug the Cassandra Server, or am I missing something?
>
> Thanks,
> Linbo
>
--
Ben Slater
Chief Product Officer
Instaclustr: Cassandra + Spark - Managed | Consulting | Support
+61 437 929 798
running for a long time just results in overwriting a lot a data that gets
compacted away. Using n=[number > 1M] will get you n somethings.
Cheers
Ben
On Wed, 15 Jun 2016 at 22:25 Peter Kovgan
wrote:
> Hi,
>
>
>
> The cassandra-stress is not helping really to populate the d
plicated
> to the other?
>
> Whenever we add a new keyspace it seems to get replicated even if we don't
> specify the other DC in the keyspace's NetworkTopologyStrategy.
>
> -J
>
--
Ben Slater
Chief Product Officer
Instaclustr: Cassandra + Spark - Managed | Consulting | Support
+61 437 929 798
That’s the behaviour I would have expected. I’m not aware of anyway to
prevent this and would be surprised if there is one (but I’ve never tried
to find one either so it might be possible).
Cheers
Ben
On Fri, 17 Jun 2016 at 12:02 Jason J. W. Williams
wrote:
> Hey Ben,
>
> Looks like
discussion of how we
went about benchmarking.
Cheers
Ben
On Fri, 8 Jul 2016 at 07:52 Yuan Fang wrote:
> Yes, here is my stress test result:
> Results:
> op rate : 12200 [WRITE:12200]
> partition rate: 12200 [WRITE:12200]
> row rate
somewhere at some point in time).
Cheers
Ben
On Thu, 28 Jul 2016 at 09:17 Alain RODRIGUEZ wrote:
> Hi,
>
> I just released a detailed post about tombstones today that might be of
> some interest for you:
> http://thelastpickle.com/blog/2016/07/27/about-deletes-and-tombstones.html
>
a way to make your partitions smaller (like
1/10th of the size).
Cheers
Ben
<https://issues.apache.org/jira/browse/CASSANDRA-11206>
On Wed, 3 Aug 2016 at 12:35 Kevin Burton wrote:
> I have a theory as to what I think is happening here.
>
> There is a correlation between the mass
Yep, that was what I was referring to.
On Thu, 4 Aug 2016 2:24 am Reynald Bourtembourg <
reynald.bourtembo...@esrf.fr> wrote:
> Hi,
>
> Maybe Ben was referring to this issue which has been mentioned recently on
> this mailing list:
> https://issues.apache.org/jira/b
Hi all,
We are investigating using Cassandra in our data platform. We would like
data to go into Cassandra first and to eventually be replicated into our
data lake in HDFS for long term cold storage. Does anyone know of a good
way of doing this? We would rather not have parallel writes to HDFS
; for duplication checks to dedup then output to another source (form of dual
> write but with dedup), this was really silly and slow. I only bring it up
> to save you the trouble in case you end up in the same path chasing for
> something more 'real time'.
>
> Regards,
> R
read at CL=ALL which could not read
sufficient replicas - a bit confusing (although I can also see how getting
CL=ALL errors when you thought you were reading at QUORUM or ONE would be
confusing).
Cheers
Ben
On Sun, 28 Aug 2016 at 10:52 kurt Greaves wrote:
> Looking at the wiki for the read p
what you are
doing, particular if you’re working with a production cluster.
Cheers
Ben
On Tue, 30 Aug 2016 at 16:09 Aiman Parvaiz wrote:
> Hi all
> I am running C* 2.1.12 in AWS EC2 Classic with RF=3 and vnodes(256
> tokens/node). My nodes are distributed in three different availabilit
Thanks Sam - a couple of subtleties there that we missed in our review.
Cheers
Ben
On Tue, 30 Aug 2016 at 19:42 Sam Tunnicliffe wrote:
> Just to clarify a little further, it's true that read repair queries are
> performed at CL ALL, but this is slightly different to a regular,
>
Hi Siddarth,
It would probably help people provide and answer if you let everyone some
more details like:
- cassandra version and driver version you are using
- query that is being executed when the error occurs
- schema of the table that is being queried
Cheers
Ben
On Thu, 1 Sep 2016 at 21:19
that the node doesn’t get cleaned up through the replacement process like a
newly streamed node does (plus the extra operational complexity).
Cheers
Ben
On Thu, 15 Sep 2016 at 19:47 Vasileios Vlachos
wrote:
> Hello and thanks for your responses,
>
> OK, so i
? Too high, too low?
>>
>>
>> Thanks,
>>
>> Jerome
>>
> --
>
> Jens Rantil
> Backend Developer @ Tink
>
> Tink AB, Wallingatan 5, 111 60 Stockholm, Sweden
> For urgent matters you can reach me at +46-708-84 18 32.
>
--
Ben Bromhead
CTO | Instaclustr <https://www.instaclustr.com/>
+1 650 284 9692
Managed Cassandra / Spark on AWS, Azure and Softlayer
cluster to run
> sstableloader, what methods can I use to load a single keyspace from one
> cluster to another cluster of different size?
>
>
>
> Appreciate the help…
>
>
>
> Thanks,
>
> Justin
>
>
>
--
Ben Slater
Chief Product Officer
Instacl
distributed across all the nodes in your cluster).
Cheers
Ben
On Tue, 20 Sep 2016 at 14:09 Jun Wu wrote:
> Hi there,
>
> I have a question about the replica and replication factor.
>
> For example, I have a cluster of 6 nodes in the same data center.
> Replication factor R
) the coordinator would need to contact and execute a read on
at least one other node to satisfy the query.
Cheers
Ben
On Tue, 20 Sep 2016 at 14:50 Jun Wu wrote:
> Hi Ben,
>
> Thanks for the quick response.
>
> It's clear about the example for single row/partition.
in most uses of
Cassandra).
Cheers
Ben
On Tue, 20 Sep 2016 at 15:18 Jun Wu wrote:
>
>
> Yes, I think for my case, at least two nodes need to be contacted to get
> the full set of data.
>
> But another thing comes up about dynamic snitch. It's the wrapped snitch
> and
schema of
the table that might help.
Beyond that the only thing I can think of is running scrub if you haven’t
already done so.
Cheers
Ben
On Mon, 26 Sep 2016 at 16:36 Nikhil Sharma wrote:
> Hi,
>
> We are not exactly sure what is causing this problem. But after compaction
> happens (
-partition batches? Massive partitions?
Especially given it hit two nodes simultaneously I would be looking for
rogue query as my first point of investigation.
Cheers
Ben
On Tue, 27 Sep 2016 at 17:49 xutom wrote:
>
> Hi, all
> I have a C* cluster with 12 nodes. My cassandra version
frozen
is relatively recent (3.6) so the doco may be out date there.
Cheers
Ben
On Wed, 28 Sep 2016 at 13:12 zha...@easemob.com wrote:
> hi, everyone, I'm learning Cassandra now , and have some problems about
> the document of "Data Types" . I don't know where to re
My best guess it that you need to remove the quotes from around the zip
values (ie change if to zip: 20500 rather than zip: ‘20500’ ) as zip is
defined as an int.
Cheers
Ben
On Wed, 28 Sep 2016 at 14:38 zha...@easemob.com wrote:
> Hi, Ben Slater, thank you very much for your replay!
>
Hi,
It depends on which snitch you are using, but the DC name is
in cassandra-rackdc.properties for the GossipingPropertyFileSnitch.
Best wishes,
Ben
On 10 October 2016 at 20:54, Adam Hutson wrote:
> There is a cluster name in the cassandra.yaml for naming the cluster, aka
> data
be an issue.
Even if you can't/don't want to upgrade the instance type, the other steps
will help things.
Ben
On Tue, 11 Oct 2016 at 10:23 Satoshi Hikida wrote:
> Hi,
>
> I'm investigating the read/write performance of the C* (Ver. 2.2.8).
> However, I have an issue abou
causing, and working around) ? To work around, we tried increasing
> #compactors and reducing stream throughput so that at least incoming
> #SSTables would be controlled.
>
>
>
> This has happened to us few times in the past too, so I am wondering if
> this is a known problem (I c
olumn by positive integer and the others by 0.
>
> Makes sense ?
>
--
Ben Bromhead
CTO | Instaclustr <https://www.instaclustr.com/>
+1 650 284 9692
Managed Cassandra / Spark on AWS, Azure and Softlayer
1 - 100 of 484 matches
Mail list logo