Re: A list of some HDFS benchmarks

2013-09-09 Thread Erik Paulson
Thanks, Milind! I looked at potentially coming out for the workshop, however, the CfP asked for up to 12 page papers. That gave me the impression that this meeting of the workshop was starting to try to narrow its focus and go more in-depth on topics, and get closer to converging on a final benchm

Re: A list of some HDFS benchmarks

2013-09-05 Thread Milind Bhandarkar
Erik, Very useful info. As you may know (saw the reference to BDBC in your page), we are organizing fourth workshop on Big Data Benchmarking on Oct 9-10 in San Jose (http://clds.ucsd.edu/bdbc/workshops/fourth_wbdb). In this workshop, we hope to get closer to defining a definitive Big Data benchmar

Re: A list of some HDFS benchmarks

2013-09-05 Thread Steve Loughran
interesting. FWIW the work on formally specify (in the Computer Science notion of "formally") is in HADOOP-9361; the HCFS work being driven by redhat is more about testing. Some extra ideas on benchmarking # something to assess performance of cross FS operations # it'd be nice to have something t

A list of some HDFS benchmarks

2013-09-04 Thread Erik Paulson
Hello all - As part of a side project, I've been interested in HDFS benchmarking, particularly of the Namenode. To get started, I tried to track down a number of different benchmarks and collect a few observations about each. I've put together a list here: http://epaulson.github.io/HadoopInternal