The files under archive directory are referenced by snapshots. 
Please don't delete them manually. 

You can delete unused snapshots. 

Cheers

On Sep 7, 2014, at 4:08 AM, Brian Jeltema <[email protected]> 
wrote:

> 
> On Sep 6, 2014, at 9:32 AM, Ted Yu <[email protected]> wrote:
> 
>> Can you post your hbase-site.xml ?
>> 
>> /apps/hbase/data/archive/data/default is where HFiles are archived (e.g.
>> when a column family is deleted, HFiles for this column family are stored
>> here).
>> /apps/hbase/data/data/default seems to be your hbase.rootdir
> 
> hbase.rootdir is defined to be hdfs://foo:8020/apps/hbase/data. I think 
> that's the default that Ambari creates.
> 
> So the HFiles in the archive subdirectory have been discarded and can be 
> deleted safely? 
> 
>> bq. a problem I'm having running map/reduce jobs against snapshots
>> 
>> Can you describe the problem in a bit more detail ?
> 
> I don't understand what I'm seeing well enough to ask an intelligent question 
> yet.
> I appear to be scanning duplicate rows when using initTableSnapshotMapperJob,
> but I'm trying to get a better understanding of how this works, since It's 
> probably just
> something I'm doing wrong.
> 
> Brian
> 
>> Cheers
>> 
>> 
>> On Sat, Sep 6, 2014 at 6:09 AM, Brian Jeltema <
>> [email protected]> wrote:
>> 
>>> I'm trying to track down a problem I'm having running map/reduce jobs
>>> against snapshots.
>>> Can someone explain the difference between files stored in:
>>> 
>>>   /apps/hbase/data/archive/data/default
>>> 
>>> and files stored in
>>> 
>>>   /apps/hbase/data/data/default
>>> 
>>> (Hadoop 2.4, HBase 0.98)
>>> 
>>> Thanks
> 

Reply via email to