On 23.05.09 12:43, alex k wrote:
> It seems that image spam is back. So I wrote a new OCR plugin for
> spamassassin, which uses convert and ocrad to extract text.
> For details and download see:
> 
> http://spielwiese.la-evento.com/facileOCR/
> 
> We use this plugin on our servers. It kicks out every image-spam, that
> made it through the other filters and produces not a single false
> positive.

hmmm, last two images I've checked were much nicer read by gocr, just FYI.

another question I've raised some time ago was the possibility of pushing
read text to spamassassin so it could be detected by other checks, e.g.
spamassassin and optionally uribl's...
The answer was gocr is not reliable enough for doing this stuff, but I hope
it's worth trying...
-- 
Matus UHLAR - fantomas, uh...@fantomas.sk ; http://www.fantomas.sk/
Warning: I wish NOT to receive e-mail advertising to this address.
Varovanie: na tuto adresu chcem NEDOSTAVAT akukolvek reklamnu postu.
"To Boot or not to Boot, that's the question." [WD1270 Caviar]

Reply via email to