On Sep 6, 2014, at 9:32 AM, Ted Yu <[email protected]> wrote:

> Can you post your hbase-site.xml ?
> 
> /apps/hbase/data/archive/data/default is where HFiles are archived (e.g.
> when a column family is deleted, HFiles for this column family are stored
> here).
> /apps/hbase/data/data/default seems to be your hbase.rootdir
> 
> 

hbase.rootdir is defined to be hdfs://foo:8020/apps/hbase/data. I think that's 
the default that Ambari creates.

So the HFiles in the archive subdirectory have been discarded and can be 
deleted safely? 

> bq. a problem I'm having running map/reduce jobs against snapshots
> 
> Can you describe the problem in a bit more detail ?
> 
> 

I don't understand what I'm seeing well enough to ask an intelligent question 
yet.
I appear to be scanning duplicate rows when using initTableSnapshotMapperJob,
but I'm trying to get a better understanding of how this works, since It's 
probably just
something I'm doing wrong.

Brian

> Cheers
> 
> 
> On Sat, Sep 6, 2014 at 6:09 AM, Brian Jeltema <
> [email protected]> wrote:
> 
>> I'm trying to track down a problem I'm having running map/reduce jobs
>> against snapshots.
>> Can someone explain the difference between files stored in:
>> 
>>    /apps/hbase/data/archive/data/default
>> 
>> and files stored in
>> 
>>    /apps/hbase/data/data/default
>> 
>> (Hadoop 2.4, HBase 0.98)
>> 
>> Thanks

Reply via email to