Re: Un-referenced rfiles in hdfs

2022-07-12 Thread Christopher
This question seems to be a log4j performance/behavior question. You may have more luck at https://logging.apache.org/log4j for anything specific to the behavior of log4j. On Fri, Jul 8, 2022 at 2:44 AM Hart, Andrew via user wrote: > > > > I am still trying to track down the cause of the un-refer

RE: Un-referenced rfiles in hdfs

2022-07-07 Thread Hart, Andrew via user
I am still trying to track down the cause of the un-referenced rfiles and unsplitable tablets causing hold-time tservers exits. In the logs I see (something like) 2022-07-07 12:00:00 ..splitting tablet x 2022-07-07 12:00:03 x was split size1 10 size2 11 time 30ms So in the lo

RE: Un-referenced rfiles in hdfs

2022-06-30 Thread Hart, Andrew via user
t: Re: Un-referenced rfiles in hdfs EXTERNAL SENDER: Do not click any links or open any attachments unless you trust the sender and know the content is safe. EXPÉDITEUR EXTERNE: Ne cliquez sur aucun lien et n'ouvrez aucune pièce jointe à moins qu'ils ne proviennent d'un expéditeur fi

Re: Un-referenced rfiles in hdfs

2022-06-29 Thread Christopher
The Accumulo file garbage collection mechanism is designed to fail safe to only delete files it knows are no longer in use. It also tries to do this with minimal interaction with the hdfs name node (so, no scanning the entire file system to find files). It's possible that in some circumstances, ser

Un-referenced rfiles in hdfs

2022-06-29 Thread Hart, Andrew via user
Hi, I have some rfiles in hdfs that aren't referenced in the accumulo.metadata. So there will be a file like 85 2022-02-02 11:59 /accumulo/tables/3/t-1234567/Cabcdef.rf but grep -t accumulo.metadata Cabcdef.rf doesn't find anything. Is there any way run the gc process so that it cleans