Hey - cool! ... but my gocr doesn't have that option :(
Which version do you have, and where did you get it from? Thanx Si. -----Original Message----- From: decoder [mailto:[EMAIL PROTECTED] Sent: 14 August 2006 19:47 To: users@spamassassin.apache.org Subject: Re: The arms race continues -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Michel Vaillancourt wrote: > Simon Standley wrote: >> Hi Gang, >> >> I've had the latest FuzzyOcr on test for the past day or so - >> very nice work. Congrats to all involved. >> >> Thought you may be interested in the attached GIF. It was only a >> matter of time before something like this came along ... >> >> Si. >> >> <<forgiving26.gif>> >> >> . > I've seen three of these this morning alone... and FuzzyOCR isn't > trapping them. > > --Michel Wolfstar Systems > gocr features a nice parameter called -d. It is able to remove smaller particles before scanning, compare these results: Original: [EMAIL PROTECTED] ~/Uni/SysOP-Paul/spamassassin $ gocr -i forgiving26.gif ' ''v''ìgt _' 'CÒ'O'' '0' '':CO'.M.'''_.'..'_'__'_i.'''._'' _.'''.''.'.'...'.','_ ;'_ _'. 1don '.. t. 'cn.c'k. _. s._. t'y,_' e. m'.' bro. 'w_'er).''. _ .'_ '.'.ì. .,. _ ._. _. ä'nìd.....'SA'.. V..'E... .j.Oq.'o.. .'.òn,'.m.. ù. ì.'m''. ._ìm. .'.'_i.._'_'' !..'. ' '.''VI'A'' i_' ' À ììàm'' ._.$' '3' _,''3 ''3 ' '_ ' _' i_ .' :ì.'ì ';.'. ì CIAL_I_' fr.om ..$3, 75 _' _. ' ' __ ..' ''''.' ' _. '_. _.. K. ._. .'_.ì'UM' ' _ m..Q.m. '._.$. 1 ;2.. .'.ì ..'.._. _. ._._.' '..'... _..'..'.. ' .ì '.. M.. .i'a.v...e.'.g...''m.iì''e.'. .d..a.._.'...'!,.',.'_ ;_'.'.'.. .'._... ,'_..',i_.'...._.'. ' .','...i..'..'_.'.ì'.'..'...'_.'.''._ ''.'.._ With -d 2: [EMAIL PROTECTED] ~/spamassassin $ gocr -d 2 -i forgiving26.gif t v:gt _CO00.COM ,_ 1don t cnck_s_ty,_e m' brow_'er) , _ ànd.SAVE 50q.o.o. n mur marm_cy!. VIAGRA fram $3' ,33 _ CIALI_ from $3,75 K__ì_ mQm'_$l,2l Mav,e g nIce da_'I . , The second one surely gets detected because it contains at least two words recognized (viagra and cialis). In the next version I will put - -d 2 as the default and make the parameter configurable via the cf file. Until that, simply put -d 2 into the gocr arguments. This works for this one sample, but there are plenty of other methods to avoid OCR. If you get more mails like that with different methods of obfuscation, please tell me. Chris -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.5 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org iD8DBQFE4MUaJQIKXnJyDxURAuLiAJ40Hqd3/X1xbcsXc6xFrhOTUfkjYgCghcGl l7p7ZgIfjcHbJclBoL2LT04= =y9sq -----END PGP SIGNATURE-----