On Fri, September 8, 2006 10:42 am, mcd said:
> This is an interesting approach, but let me explain a little more. I will
>  be running md5sums for every file on a system. I will then compare that
> list of md5sums against a list of md5sums that are know to be virus free.
> The files that do not have valid md5sums in the database will then need
> to be scanned. In a lot of situations this list of files will be in the
> thousands. At this point would I be better off calling clamscan to scan
> the entire disk, or call clamscan 10,000+ times with unknown files? I
> hope I am making this clear. Thanks for all of your help.

(Please don't top-post, it makes it hard to usefully trim the discussion.)

Running clamscan over the entire disk is still overkill in this situation;
even if 10% if only the disk does *not* need scanning, I'd restrict to
just the other 90%.

The response you replied to is probably what I'd use for this scenario:
Either write a file with the paths of the files to be scanned and pipe
that to clamscan, or just pipe it directly from you comparison script. 
That's only calling clamscan once, so you aren't hitting the startup costs
over and over.  (Which *would* be a problem.)

If doing one of those is not possible, and you must call a program
seperately for each file, use clamdscan to do the scanning, but be aware
of permissions issues based on who started clamd.

Daniel T. Staal

---------------------------------------------------------------
This email copyright the author.  Unless otherwise noted, you
are expressly allowed to retransmit, quote, or otherwise use
the contents for non-commercial purposes.  This copyright will
expire 5 years after the author's death, or in 30 years,
whichever is longer, unless such a period is in excess of
local copyright law.
---------------------------------------------------------------

_______________________________________________
http://lurker.clamav.net/list/clamav-users.html

Reply via email to