Re: hadoop hardware configuration

Brian Bockelman Thu, 28 May 2009 12:11:50 -0700


On May 28, 2009, at 2:00 PM, Patrick Angeles wrote:

On Thu, May 28, 2009 at 10:24 AM, Brian Bockelman <[email protected]>wrote:
We do both -- push the disk image out to NFS and have a mirroredSAS hard
drives on the namenode.  The SAS drives appear to be overkill.
This sounds like a nice approach, taking into account hardware,labor anddowntime costs... $700 for a RAID controller seems reasonable tominimizemaintenance due to a disk failure. Alex's suggestion to go JBOD andwrite to
all volumes would work as well, but slightly more labor intensive.

Remember though that disk failure downtime is actually rather rare.The question is "how tight is your hardware budget": if $700 is worththe extra 1 day of uptime a year, then spend it. I come from anacademic background where (a) we don't lose money if things go downand (b) jobs move to another site in the US if things are down. Thatperhaps gives you a reading into my somewhat relaxed attitude.

I'm not a hardware guy anymore, but I'd personally prefer a softwareRAID. I've seen mirrored disks go down because the RAID controllerdecided to puke.

2. What is a good processor-to-storage ratio for a task node with4TB of
raw storage? (The config above has 1 core per 1TB of raw storage.)
We're data hungry locally -- I'd put in bigger hard drives. The1.5TBSeagate drives seem to have passed their teething issues, and areat apretty sweet price point. They only will scale up to 60 IOPS, somake sure
your workflows don't have lots of random I/O.
I haven't seen too many vendors offering the 1.5TB option. What typeof dataare you working with? At what volumes? I sense that at 50GB/day, weare
higher than average in terms of data volume over time.

We have just short of 300TB of raw disk; our daily downloads rangefrom a few GB to 10TB.

We bought 1.5TB drives separately from the nodes and sent studentswith screwdrivers at the cluster.

As Steve mentions below, the rest is really up to your algorithm.Do youneed 1 CPU second / byte? If so, buy more CPUs. Do you need .1CPU second
/ MB?  If so, buy more disks.
Unfortunately, we won't know until we have a cluster to test on.Classiccatch-22. We are going to experiment with a small cluster and asmall dataset, with plans to buy more appropriately sized slave nodes based onwhat we
learn.

In that case, you're probably good! 24TB probably formats out to20TB. With 2x replication at 50GB a day, you've got enough room forabout half a year of data. Hope your procurement process isn't tooslow!


Brian

Re: hadoop hardware configuration

Reply via email to