+1 on viewfs, I was going to add that :-)

To add to this, viewfs can be use as a federation layer for supporting
multiple HDFS clusters for checkpoint/savepoint HA purpose as well.

--
Rong

On Wed, May 23, 2018 at 6:29 AM, Stephan Ewen <se...@apache.org> wrote:

> I think that Hadoop recommends to solve such setups with a viewfs:// that
> spans both HDFS clusters and then the two different clusters look like
> different paths within on file system. Similar as mounting different file
> systems into one directory tree in unix.
>
> On Tue, May 22, 2018 at 4:41 PM, Kien Truong <duckientru...@gmail.com>
> wrote:
>
>> You only need to modify the core-site and hdfs-site read by Flink.
>>
>> Regards,
>>
>> Kiên
>> On 5/22/2018 9:07 PM, Deepak Sharma wrote:
>>
>> Wouldnt 2 core-site and hdfs-site xmls need to be provided in this case
>> then ?
>>
>> Thanks
>> Deepak
>>
>> On Tue, May 22, 2018, 19:34 Raul Valdoleiros <
>> raul.valdoleiros.olive...@gmail.com> wrote:
>>
>>> Hi Kien,
>>>
>>> Thanks for you reply.
>>>
>>> Your goal is to store the checkpoints in one hdfs cluster and the data
>>> in other hdfs cluster.
>>>
>>> So the flink should be able to connect to two different hdfs clusters.
>>>
>>> Thanks
>>>
>>> 2018-05-22 15:00 GMT+01:00 Kien Truong <duckientru...@gmail.com>:
>>>
>>>> Hi,
>>>>
>>>> If your cluster are not high-availability clusters then just use the
>>>> full path to the cluster.
>>>>
>>>> For example, to refer to directory /checkpoint on cluster1, use
>>>> hdfs://namenode1_ip:port/checkpoint
>>>>
>>>> Like wise, /data on cluster2 will be hdfs://namenode2_ip:port/data
>>>>
>>>>
>>>> If your cluster is a HA cluster, then you need to modify the
>>>> hdfs-site.xml like section 1 of this guide
>>>>
>>>> https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.6.4/bk_
>>>> administration/content/distcp_between_ha_clusters.html
>>>>
>>>> Then use the full path to the cluster hdfs://cluster1ha/checkpoint &
>>>> hdfs://cluster2ha/data
>>>>
>>>> Regards,
>>>> Kien
>>>>
>>>>
>>>> On 5/21/2018 9:19 PM, Raul Valdoleiros wrote:
>>>>
>>>>> Hi,
>>>>>
>>>>> I want to store my data in one hdfs and the flink checkpoints in
>>>>> another hdfs. I didn't find a way to do it, anyone can point me a 
>>>>> direction?
>>>>>
>>>>> Thanks in advance,
>>>>> Raul
>>>>>
>>>>
>>>
>

Reply via email to