Hey - cool!

... but my gocr doesn't have that option :(

Which version do you have, and where did you get it from?

Thanx

Si.

-----Original Message-----
From: decoder [mailto:[EMAIL PROTECTED]
Sent: 14 August 2006 19:47
To: users@spamassassin.apache.org
Subject: Re: The arms race continues


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Michel Vaillancourt wrote:
> Simon Standley wrote:
>> Hi Gang,
>>
>> I've had the latest FuzzyOcr on test for the past day or so -
>> very nice work. Congrats to all involved.
>>
>> Thought you may be interested in the attached GIF. It was only a
>> matter of time before something like this came along ...
>>
>> Si.
>>
>> <<forgiving26.gif>>
>>
>> .
> I've seen three of these this morning alone...  and FuzzyOCR isn't
> trapping them.
>
> --Michel Wolfstar Systems
>

gocr features a nice parameter called -d. It is able to remove smaller
particles before scanning, compare these results:


Original:

[EMAIL PROTECTED] ~/Uni/SysOP-Paul/spamassassin $ gocr -i forgiving26.gif
' ''v''ìgt _' 'CÒ'O'' '0' '':CO'.M.'''_.'..'_'__'_i.'''._''
_.'''.''.'.'...'.','_
;'_ _'. 1don '.. t. 'cn.c'k. _. s._. t'y,_' e. m'.' bro. 'w_'er).''. _
.'_ '.'.ì. .,. _ ._.
_. ä'nìd.....'SA'.. V..'E... .j.Oq.'o.. .'.òn,'.m.. ù. ì.'m''. ._ìm.
.'.'_i.._'_'' !..'. '
'.''VI'A'' i_' ' À ììàm'' ._.$' '3' _,''3 ''3 ' '_ ' _' i_ .'  :ì.'ì ';.'.
ì CIAL_I_' fr.om ..$3, 75 _' _. ' ' __ ..' ''''.' ' _. '_.
_.. K. ._. .'_.ì'UM' ' _ m..Q.m. '._.$. 1 ;2.. .'.ì ..'.._. _. ._._.'
'..'... _..'..'.. ' .ì '..
M.. .i'a.v...e.'.g...''m.iì''e.'. .d..a.._.'...'!,.',.'_ ;_'.'.'..
.'._... ,'_..',i_.'...._.'. ' .','...i..'..'_.'.ì'.'..'...'_.'.''._
''.'.._


With -d 2:

[EMAIL PROTECTED] ~/spamassassin $ gocr -d 2 -i forgiving26.gif
t
v:gt _CO00.COM
,_  1don t cnck_s_ty,_e m' brow_'er)   , _
ànd.SAVE 50q.o.o. n mur marm_cy!.
VIAGRA fram $3' ,33      _
CIALI_ from $3,75
K__ì_ mQm'_$l,2l
Mav,e g nIce da_'I    . ,



The second one surely gets detected because it contains at least two
words recognized (viagra and cialis). In the next version I will put
- -d 2 as the default and make the parameter configurable via the cf
file. Until that, simply put -d 2 into the gocr arguments.

This works for this one sample, but there are plenty of other methods
to avoid OCR.

If you get more mails like that with different methods of obfuscation,
please tell me.



Chris
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.5 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFE4MUaJQIKXnJyDxURAuLiAJ40Hqd3/X1xbcsXc6xFrhOTUfkjYgCghcGl
l7p7ZgIfjcHbJclBoL2LT04=
=y9sq
-----END PGP SIGNATURE-----

Reply via email to