> When you say that you have huge images, how big is "huge?" Yes, we're looking at some images that are 100Megs in size, but nothing like what you're speaking of. This helps me understand Hadoop's usage better and unfortunately it won't be the fit I was hoping for.
> You can use the API or the FUSE module to mount hadoop but that is not > a direct goal of hadoop. Hope that helps. Very interesting, and yes, that indeed does help, not to veer off thread too much, but does Sun's Lustre follow in the steps of Gluster then? I know Lustre requires kernel patches to install, so it's at a different level than the others, but I have seen some articles about large scale clusters built with Lustre and want to look at that as another option. Again, thanks for the info, if anyone has general information on cluster software, or know of a more appropriate list, I'd appreciate the advice. Thanks P On Thu, Mar 26, 2009 at 12:32 PM, Edward Capriolo <[email protected]> wrote: > It is a little more natural to connect to HDFS from apache tomcat. > This will allow you to skip the FUSE mounts and just use the HDFS-API. > > I have modified this code to run inside tomcat. > http://wiki.apache.org/hadoop/HadoopDfsReadWriteExample > > I will not testify to how well this setup will perform under internet > traffic, but it does work. > > GlusterFS is more like a traditional POSIX filesystem. It supports > locking and appends and you can do things like put the mysql data > directory on it. > > GLUSTERFS is geared for storing data to be accessed with low latency. > Nodes (Bricks) are normally connected via GIG-E or infiniban. The > GlusterFS volume is mounted directly on a unix system. > > Hadoop is a user space file system. The latency is higher. Nodes are > connected by GIG-E. It is closely coupled with MAP/REDUCE. > > You can use the API or the FUSE module to mount hadoop but that is not > a direct goal of hadoop. Hope that helps. >
