Am 24. Apr 2009 um 22:12 CEST schrieb Igor Chudov: > I get plenty of these also, and cannot get them to score well. > > These advertise knockoffs of bestselling Pfizer products. The text is > meaningless garbage text. The sales message is contained in a PNG > image, but it could be other image types like jpeg. > > http://igor.chudov.com/tmp/spam008.txt > > Any ides what I can do?
You can install FuzzyOcr <http://wiki.apache.org/spamassassin/FuzzyOcrPlugin> ,---- | X-Spam-Status: Yes, score=19.8 required=5.0 tests=BADRELAY,BAYES_99,FUZZY_OCR, | HK_IMGSPAM,HTML_MESSAGE,SAGREY autolearn=no version=3.2.5 | X-Spam-Relay-Country: US TR | X-Spam-Report: =?ISO-8859-1?Q? | * 3.5 BAYES_99 BODY: Spamwahrscheinlichkeit nach Bayes-Test: 99-100% | * [score: 1.0000] | * 0.3 HTML_MESSAGE BODY: Nachricht enth=e4lt HTML | * 2.5 BADRELAY bad Relay | * 2.0 HK_IMGSPAM Inline image in message, Bayes think it's spam | * 10 FUZZY_OCR BODY: | * 1.0 SAGREY Adds 1.0 to spam from first-time senders `---- ,----[ fuzzyocr.log ] | 2009-04-24 22:30:08 [9756] Scanset "ocrad" found word "cialis" with fuzz of 0.0000 | line: "ur prce viagra cialis special offer" | 2009-04-24 22:30:08 [9756] Scanset "ocrad" found word "cialis" with fuzz of 0.0000 | line: "lgg cialis special offer" | 2009-04-24 22:30:08 [9756] Scanset "ocrad" found word "viagra" with fuzz of 0.0000 | line: "ur prce viagra cialis special offer" | 2009-04-24 22:30:08 [9756] Scanset "ocrad" found word "viagra" with fuzz of 0.1667 | line: "l ls lo x vagra loo mg lo x cals omg" | 2009-04-24 22:30:08 [9756] Scanset "ocrad" found word "viagra" with fuzz of 0.0000 | line: " viagra hot offer" | 2009-04-24 22:30:08 [9756] Scanset "ocrad" generates enough hits (5), skipping further scansets... | 2009-04-24 22:30:08 [9756] Message is spam, score = 10.500 | 2009-04-24 22:30:08 [9756] Adding Hash to "/home/stefan/.fuzzyocr/FuzzyOcr.hashdb" | 2009-04-24 22:30:08 [9756] Words found: | "cialis" in 2 lines | "viagra" in 3 lines | (7.5 word occurrences found) `---- Greets Stefan -- ,-----------------------------------------------------------------------------. | Stefan Lütje | "Die Zukunft wird morgen besser sein." | | stefan.lue...@t-online.de | George W. Bush | `----Key fingerprint = BCB2 48E4 9211 C975 5A3F B192 9B6E CCCF 99CC 44FA-----'
signature.asc
Description: Digital signature