On Thu, 9 Jun 2005, Chris Santerre wrote:

> >My only comment on a system like this is that it could be
> >easily subverted.
> >A spammer could use automated image editting tools to randomly
> >change some
> >aspect of the file that would give it a totally different MD5 sum. Like
> >changing the lower right pixel to a different color would
> >throw the md5 sum
> >way off.
>
> I completely agree. But I'd like to see it tried. Then maybe combine it with
> distancing techniques to see how distant one MD5 is to another.

Nice try, but the crypto characteristic of MD5 makes this totally
impractical. One of the attributes of MD5 (by design) is that even small
changes in the input cause signficant changes in the output.
This is intended to deter attackers from breaking crypto systems
with incremental guessing methods.

Try this; take a 100Kbyte text file, get a MD5 sum, change one letter
(say a 'b' to 'c') and re-calculate the MD5 sum.
Note that almost every digit of that 32 digit hex value has changed,
even tho you've changed only 1 bit out of 800,000 bits of data in
that file.

There are image processing algorithms that are much better at 'looking'
at two images and giving a 'distance' value. (Only problem is that they're
compute intensive).

-- 
Dave Funk                                  University of Iowa
<dbfunk (at) engineering.uiowa.edu>        College of Engineering
319/335-5751   FAX: 319/384-0549           1256 Seamans Center
Sys_admin/Postmaster/cell_admin            Iowa City, IA 52242-1527
#include <std_disclaimer.h>
Better is not better, 'standard' is better. B{

Reply via email to