Re: What if an XML file cross boundary of HDFS chunks?

2010-05-17 Thread Vamc
Hi Steve, I am new to this forum and a buddy on Hadoop.. I have same kind of problem where input file is not able to treated as a text file .. Cant we do like this , Define our own InputFormat ,InputSplit and RecordReader.. Thanks Vamsi Jeff Zhang-4 wrote: > > Hi Steve, > > When you

Basic Hadoop Doubt

2010-05-17 Thread Vamc
Hi All, Vamc here, Buddy in Hadoop I have some basic doubt on hadoop Input Data placement... Like, If i input some 30GB of data to hadoop program , it will place the 30gb into HDFS into some set of files based on some input formats.. I have 2 doubts here .. 1. Each time i run a program