Re: 2014 nosql benchmark

Wilm Schumacher Thu, 18 Dec 2014 22:48:50 -0800

Hi,

I'm always interessted in such benchmark experiments, because the
databases evolve so fast, that the race is always open and there is a
lot motion in there.


And of course I askes myself the same question. And I think that this
publication is unreliable. For 4 reasons (from reading very fast,
perhaps there is more):

1.) It is unclear what this is all about. The title is "NoSQL
Performance Testing". The subtitle is "In-Memory Performance Comparison
of SequoiaDB, Cassandra,  and MongoDB". However, in the introduction
there is not one word about "in memory performance". The introduction
could be a general introduction for a general "on-disk-nosql" benchmark.
So ... only the subtitle (and a short sentence in the "Result Summary")
says what this is actually about.

2.) There are very important databases missing. For "in memory" e.g.
redis. If e.g. redis is not a valid candidate in this race, why is this
so?MySQL is capable of "in memory" distributed databanking, too.

3.) The methodology is unclear. Perhaps I'm the only one, but what does
"Run workload for 30 minutes (workload file workload[1-5]) " mean for
mixed read/write ops? Why 30 min? Okay, I can image, that the authors
estimated the throughput, preset the number of 100 Mio rows and designed
it to be larger than the estimated throughput in x minutes. However, all
this information is missing. And why 45% and 22% of RAM? My first Idea
would be a VERY low ration, like 2% or so, and a VERY large ratio, like
80-90%. And than everything in between. Is 22% or 45% somehow a magic
number? Furthermore in the Result summary there 1/2 and 1/4 of RAM are
discussed. Okay, 22% is near 1/4 ... but where does the difference
origin from? And btw. ... 22% of what? Stuff to insert? Stuff already
insererted? It's all deductable, but it's strange that the description
is so sloppy.

4.) There is no repetion of the loads (as I understand). Its one run,
one result ... and it's done. I don't know a lot of cassandra in
in-memory use. But either the experiment should be repeated quite some
runs OR it should be explained why this is not neccessary.

Okay, perhaps 1 is a little picky, and 4 is a little fussy. But 3 is
strange and 2 stinks.

Well, just my first impression. And that's Cassandra is very fast ;).

Best regards

Wilm


Am 19.12.2014 um 06:41 schrieb diwayou:
> i just have read this benchmark pdf, does anyone have some opinion
> about this?
> i think it's not fair about cassandra
> url:http://www.bankmark.de/wp-content/uploads/2014/12/bankmark-20141201-WP-NoSQLBenchmark.pdf‍
> http://msrg.utoronto.ca/papers/NoSQLBenchmark‍
>

Re: 2014 nosql benchmark

Reply via email to