Thanks, Milind!
I looked at potentially coming out for the workshop, however, the CfP asked
for up to 12 page papers. That gave me the impression that this meeting of
the workshop was starting to try to narrow its focus and go more in-depth
on topics, and get closer to converging on a final benchm
Erik,
Very useful info. As you may know (saw the reference to BDBC in your page),
we are organizing fourth workshop on Big Data Benchmarking on Oct 9-10 in
San Jose (http://clds.ucsd.edu/bdbc/workshops/fourth_wbdb). In this
workshop, we hope to get closer to defining a definitive Big Data
benchmar
interesting.
FWIW the work on formally specify (in the Computer Science notion of
"formally") is in HADOOP-9361; the HCFS work being driven by redhat is more
about testing.
Some extra ideas on benchmarking
# something to assess performance of cross FS operations
# it'd be nice to have something t
Hello all -
As part of a side project, I've been interested in HDFS benchmarking,
particularly of the Namenode. To get started, I tried to track down a
number of different benchmarks and collect a few observations about each.
I've put together a list here:
http://epaulson.github.io/HadoopInternal