No, you don't need Hdfs. Hadoop common/ Hadoop core should be enough. But make 
sure you add it to the classpath as I mentioned before.

Hari

On Nov 19, 2012, at 4:27 PM, Rahul Ravindran <rahu...@yahoo.com> wrote:

> That is unfortunate. Is it sufficient if I package just hadoop-common.jar or 
> is the recommended way essentially doing an apt-get install flume-ng which 
> will install the below
> 
> # apt-cache depends flume-ng
> 
> flume-ng
>   Depends: adduser
>   Depends: hadoop-hdfs
>   Depends: bigtop-utils
> 
> My concern is that hadoop-hdfs brings in a ton of other stuff which will not 
> be used in any box except the one running the hdfs sink.
> 
> Thanks,
> ~Rahul.
> From: Hari Shreedharan <hshreedha...@cloudera.com>
> To: user@flume.apache.org; Rahul Ravindran <rahu...@yahoo.com> 
> Sent: Monday, November 19, 2012 4:08 PM
> Subject: Re: Hadoop jars
> 
> Unfortunately, the FileChannel too has a hadoop dependency - even though the 
> classes are never used. So you need the hadoop jars (and they should be added 
> to FLUME_CLASSPATH in flume-env.sh or HADOOP_HOME/HADOOP_PREFIX should be 
> set) on machines which will use the FileChannel. The channel directly does 
> not depend on Hadoop anymore, but still needs them in the class path because 
> we support migration from the older format to new format.                
> 
> 
> Thanks,
> Hari
> 
> -- 
> Hari Shreedharan
> 
> On Monday, November 19, 2012 at 4:04 PM, Rahul Ravindran wrote:
>> Thanks for the responses.
>> 
>> Good to know that the only external dependencies are Hadoop and Hbase. We 
>> will deploy those components only on boxes which are going to have those 
>> sinks set up.
>> 
>> From: Hari Shreedharan <hshreedha...@cloudera.com>
>> To: user@flume.apache.org 
>> Sent: Monday, November 19, 2012 3:29 PM
>> Subject: Re: Hadoop jars
>> 
>> Flume installs all required binaries, except for Hadoop (and the 
>> dependencies it would pull in) and HBase. This is because Flume, like most 
>> other Hadoop ecosystem components is meant to work against binary 
>> incompatible versions of Hadoop (Hadoop-1/Hadoop2). So instead of packaging 
>> hadoop jars with Flume, we expect Hadoop to be available on the machines you 
>> are running Flume on. Once you install Hadoop you should not have any 
>> dependency issues. Same is true for HBase.
>> 
>> 
>> Hari
>> 
>> -- 
>> Hari Shreedharan
>> 
>> On Monday, November 19, 2012 at 2:33 PM, Mohit Anchlia wrote:
>>> Easiest way is to install cdh binary and point your flume's classpath to it.
>>> 
>>> On Mon, Nov 19, 2012 at 2:25 PM, Roshan Naik <ros...@hortonworks.com> wrote:
>>>> Currently, unfortunately, i dont think there is any such documentation.
>>>> A  very general answer would be..Normally this list would depend on the 
>>>> source/sink/channel you are using.
>>>> I think it would be nice if the user manual did list these external 
>>>> dependencies for each component.
>>>> I am not the expert on HDFS sink.. but i dont see why it would depend on 
>>>> anything more than HDFS itself. 
>>>> -roshan
>>>> 
>>>> 
>>>> On Mon, Nov 19, 2012 at 2:18 PM, Rahul Ravindran <rahu...@yahoo.com> wrote:
>>>>> Are there other such libraries which will need to be downloaded? Is there 
>>>>> a well-defined location for the hadoop jar and any other jars that flume 
>>>>> may depend on?
> 
> 
> 

Reply via email to