On 25-04-2010 17:34, Yuri Vasilevski wrote: > Hello, > > On Sun, 25 Apr 2010 13:18:25 +0200 > Angelo Arrifano <mik...@gentoo.org> wrote: > >> Hello developers developers and developers, >> >> Ever wondered how much crap is left in your X-years old Gentoo box? >> >> I just developed a python utility to efficiently find orphaned files >> in the system. By orphaned files I mean the files that are present on >> system directories and don't belong to any installed package. >> >> The package builds a virtual filesystem (cache) on the RAM using >> python hash tables. Then it uses the cache to find the ownership of >> files inside user-specified dirs. >> >> Building the cache takes less than 10 seconds here in a system with >> 1366 installed packages. >> >> This is not intended to be a finished program yet, I'm looking forward >> for your constructive commentaries. > > There is a tool that does that, qfile from app-portage/portage-utils. > Check the "-o, --orphans * List orphan files" option. > > It's not as straight forward as it could be, as it checks only for > files specified as arguments or read from file. > > But you can trivially use it like: > # find /dir/you/want/to/check/for/orphans | qfile -o -f - > > Best, > Yuri. >
Based on the comments so far, I'll try to make my PoC a better tool. My primary objective is to make this some kind of disk cleanup utility for Gentoo boxens. I don't expect Gentoo systems to be *that* polluted but sometimes we all have to do ugly things to fix broken systems real fast. - If you know what I mean. There are other things that came to my mind, like using stored hashes to check the system files integrity (as in security). My next steps in regard to this utility will be: * Follow harring suggestion and use available PM API. * Make the application handle symlinks so we start getting a more informative output. * To store the generated cache on disk and to only regenerate it if needed. Regards, - Angelo