Eclipse doesn't show that RestoreSnapshotHelper.restoreHdfsRegions() is called by initTableSnapshotMapperJob (in master branch)
Looking at TableMapReduceUtil.java in 0.98, I don't see direct relation between the two. Do you have stack trace or something else showing the relationship ? Cheers On Sun, Sep 7, 2014 at 5:48 AM, Brian Jeltema < [email protected]> wrote: > initTableSnapshotMapperJob writes into this directory (indirectly) via > RestoreSnapshotHelper.restoreHdfsRegions > > Is this expected? I would have expected writes to be limited to the temp > directory passed in the init call > > Brian > > On Sep 7, 2014, at 8:17 AM, Ted Yu <[email protected]> wrote: > > > The files under archive directory are referenced by snapshots. > > Please don't delete them manually. > > > > You can delete unused snapshots. > > > > Cheers > > > > On Sep 7, 2014, at 4:08 AM, Brian Jeltema < > [email protected]> wrote: > > > >> > >> On Sep 6, 2014, at 9:32 AM, Ted Yu <[email protected]> wrote: > >> > >>> Can you post your hbase-site.xml ? > >>> > >>> /apps/hbase/data/archive/data/default is where HFiles are archived > (e.g. > >>> when a column family is deleted, HFiles for this column family are > stored > >>> here). > >>> /apps/hbase/data/data/default seems to be your hbase.rootdir > >> > >> hbase.rootdir is defined to be hdfs://foo:8020/apps/hbase/data. I think > that's the default that Ambari creates. > >> > >> So the HFiles in the archive subdirectory have been discarded and can > be deleted safely? > >> > >>> bq. a problem I'm having running map/reduce jobs against snapshots > >>> > >>> Can you describe the problem in a bit more detail ? > >> > >> I don't understand what I'm seeing well enough to ask an intelligent > question yet. > >> I appear to be scanning duplicate rows when using > initTableSnapshotMapperJob, > >> but I'm trying to get a better understanding of how this works, since > It's probably just > >> something I'm doing wrong. > >> > >> Brian > >> > >>> Cheers > >>> > >>> > >>> On Sat, Sep 6, 2014 at 6:09 AM, Brian Jeltema < > >>> [email protected]> wrote: > >>> > >>>> I'm trying to track down a problem I'm having running map/reduce jobs > >>>> against snapshots. > >>>> Can someone explain the difference between files stored in: > >>>> > >>>> /apps/hbase/data/archive/data/default > >>>> > >>>> and files stored in > >>>> > >>>> /apps/hbase/data/data/default > >>>> > >>>> (Hadoop 2.4, HBase 0.98) > >>>> > >>>> Thanks > >> > > > >
