Thanks Dave and Al for spending time answering my questions.. I did go through the two documents before but did not understand how the different database get deployed.. I was under the (wrong) impression that all the databases would contain signatures for all malware.. I saw the different formats as one evolving from another in response to the sophistication malware were undergoing, guess that was wrong.
Apologize for my misunderstanding, guess thats coming from me having no background on computer security. Given that a malware's signature gets captured in one and only of the different formats, now my questions is: 1) Can ldb (more effectively) represent a malware's signature than ndb or mdb or hdb? Also, is it true that databases which contain(allow) regular expression,like ndb or ldb, would lead to a decrease in the size of the database as variants of the malware can be easily captured by fewer signatures? If yes, why do we need mdb,hdb or ndb? Is it for legacy reasons (or) for code performance (or) something else? 2) What are the factors that an expert signature author would consider before deciding on the format of the signature? 3) How do the different formats get handled by the antivirus coder at design and compilation? I mean if we were mapping a string matching algorithm which is targeted for fixed length strings(mdb signatures for instance) that could be designed/optimized differently compared to an algorithm that handles variable length regular expressions(ldb, ndb). thanks a lot! -Kaushik On Wed, Jan 23, 2013 at 7:03 PM, David Raynor <dray...@sourcefire.com>wrote: > On Wed, Jan 23, 2013 at 9:56 PM, Al Varnell <alvarn...@mac.com> wrote: > > > On 1/23/13 5:52 PM, "Kaushik Vaidyanathan" wrote: > > > > > I had a couple of basic questions: > > > a) Of the different signature formats in the cvd file(like mdb, ldb, > ndb) > > > which format does clamav use? Does it pick a format(ldb, mdb, ndb > > > etc.) depending on the nature of the file under inspection? > > > > > It uses all of them, but some are format dependant. > > > > > b) I guess ldb files are tough to create automatically. If thats true, > > then > > > is the ldb file as complete as the mdb file? > > > > > My impression is that there is little, if any automation involved in the > > creation of a signature. I believe they are all done manually and then > > checked thorough an automated process. > > > > > c) Which signature database(ldb or ndb or mdb etc.) is best tradeoff > > > between size of database Vs false positives? > > > > > They serve different purposes, so I don't understand what sort of > tradeoff > > you would be interested in, if there are even statistics available to > > determine the answer. > > > > Have you read through the documentation at > > <http://www.clamav.net/doc/webinars/Webinar-Alain-2009-03-04.pdf> > > and > > <http://www.clamav.net/doc/latest/signatures.pdf>? > > > > > > -Al- > > > > -- > > Al Varnell > > Mountain View, CA > > > > > > > > _______________________________________________ > > Help us build a comprehensive ClamAV guide: visit http://wiki.clamav.net > > http://www.clamav.net/support/ml > > > > ClamAV loads the entire CVD, with the possible exception of signatures > which are only loaded if you turn on certain features (e.g. PUA scanning, > phishing checks, etc.). The 3 filetypes you list are always loaded. What > kind of signature is written to detect a malware is a choice the signature > author makes. > > Knowing those things may change the way you think about your questions. So > I agree with Al. The 2 references he is suggesting would probably be a > helpful read. > Dave R. > > -- > --- > Dave Raynor > Sourcefire Vulnerability Research Team > dray...@sourcefire.com > _______________________________________________ > Help us build a comprehensive ClamAV guide: visit http://wiki.clamav.net > http://www.clamav.net/support/ml > _______________________________________________ Help us build a comprehensive ClamAV guide: visit http://wiki.clamav.net http://www.clamav.net/support/ml