Jeff Chan wrote:
Does anyone have any recent feedback about the performance of
ImageInfo versus FuzzyOCR about detecting stock image spams (or
any others)? Does FuzzyOCR catch significantly more spams than
ImageInfo?
Cheers,
Jeff C.
I maybe biased, as I help in FuzzyOcr development, but do use both.
ImageInfo is fine and will get you part of the way there, but FuzzyOcr
hits more often. Daily scanning ~8Kmsg/day, FuzzyOcr hits ~1600 times
and ImageInfo hits < 150 times on average. On my system, here are the
top10 rule hits from yesterday:
SPAM Results:
3936 Message(s) 49.83%
19.399 Average Score
3343 Time(s) 7.50% 84.93% Hit Rule: BAYES_99
3068 Time(s) 6.88% 77.95% Hit Rule: HTML_MESSAGE
1655 Time(s) 3.71% 42.05% Hit Rule: FUZZY_OCR
1527 Time(s) 3.42% 38.80% Hit Rule: SARE_GIF_ATTACH
1411 Time(s) 3.16% 35.85% Hit Rule: URIBL_BLACK
1274 Time(s) 2.86% 32.37% Hit Rule: URIBL_BLACK_OVERLAP
1271 Time(s) 2.85% 32.29% Hit Rule: MIME_HTML_ONLY
1215 Time(s) 2.72% 30.87% Hit Rule: URIBL_JP_SURBL
1187 Time(s) 2.66% 30.16% Hit Rule: RCVD_IN_BL_SPAMCOP_NET
1184 Time(s) 2.66% 30.08% Hit Rule: SARE_GIF_STOX
Jorge Valdes