RE: proper method for writing files to hdfs

2009-05-18 Thread Bill Habermaas
riginal Message- From: Sasha Dolgy [mailto:sdo...@gmail.com] Sent: Monday, May 18, 2009 9:50 AM To: core-user@hadoop.apache.org Subject: Re: proper method for writing files to hdfs Ok, on the same page with that. Going back to the original question. In our scenario we are trying to stream d

RE: proper method for writing files to hdfs

2009-05-18 Thread Habermaas, William
-Original Message- From: Sasha Dolgy [mailto:sdo...@gmail.com] Sent: Monday, May 18, 2009 9:50 AM To: core-user@hadoop.apache.org Subject: Re: proper method for writing files to hdfs Ok, on the same page with that. Going back to the original question. In our scenario we are trying to

Re: proper method for writing files to hdfs

2009-05-18 Thread Sasha Dolgy
Ok, on the same page with that. Going back to the original question. In our scenario we are trying to stream data into HDFS and despite the posts and hints I've been reading, it's still tough to crack this nut and this is why I thought (and thankfully I wasn't right) that we were going about this

RE: proper method for writing files to hdfs

2009-05-18 Thread Bill Habermaas
point the namenode's data so you can recover from a namenode failure that has corrupted data. Bill -Original Message- From: Sasha Dolgy [mailto:sdo...@gmail.com] Sent: Monday, May 18, 2009 9:34 AM To: core-user@hadoop.apache.org Subject: Re: proper method for writing files to hdf

Re: proper method for writing files to hdfs

2009-05-18 Thread Sasha Dolgy
Hi Bill, Thanks for that. If the NameNode is unavailable, how do we find the secondary name node? Is there a way to deal with this in the code or should a load balancer of some type sit above each and only direct traffic to the name node if its listening? -sd On Mon, May 18, 2009 at 2:09 PM, B

RE: proper method for writing files to hdfs

2009-05-18 Thread Bill Habermaas
Sasha, Connecting to the namenode is the proper way to establish the hdfs connection. Afterwards the Hadoop client handler that is called by your code will go directly to the datanodes. There is no reason for you to communicate directly with a datanode nor is there a way for you to even know wher