dar...@chaosreigns.com wrote:
On 07/20, Sharma, Ashish wrote:
Can someone suggest some better OCR plugin for Spamassassin 3.3.1 for image
spam?
It still seems strange to me that anybody has ever bothered with using OCR
to deal with image spam, when it's so easy, and for me not proble
On Thu, 21 Jul 2011 07:47:00 +0100
"Sharma, Ashish" wrote:
> Can you please outline the other techniques that you use to catch
> image spams?
We find Bayes (we have our own implementation) and RBLs (again, we have
our own) work pretty well.
Regards,
David.
http://wiki.apache.org/spamassassin/UnmaintainedCustomPlugins
"OCR scanner and image validator SA-plugin"
"OCR Plugin"
may be worth a try.. no idea how well they work
The Spamassassin wiki is so cool
On 2011-07-21 8:53, Sharma, Ashish wrote:
All,
The current function
:03 AM
To: users@spamassassin.apache.org
Subject: Re: Suggest OCR plugin on Spamassassin 3.3.1 for image spam
On 7/20/2011 9:18 PM, dar...@chaosreigns.com wrote:
> On 07/20, Sharma, Ashish wrote:
>> Can someone suggest some better OCR plugin for Spamassassin 3.3.1 for image
>> spam?
&
.com]
Sent: Thursday, July 21, 2011 7:50 AM
To: users@spamassassin.apache.org
Subject: Re: Suggest OCR plugin on Spamassassin 3.3.1 for image spam
On Wed, 20 Jul 2011 21:18:48 -0400
dar...@chaosreigns.com wrote:
> It still seems strange to me that anybody has ever bothered with
> using OCR t
On 7/20/2011 9:18 PM, dar...@chaosreigns.com wrote:
On 07/20, Sharma, Ashish wrote:
Can someone suggest some better OCR plugin for Spamassassin 3.3.1 for image
spam?
It still seems strange to me that anybody has ever bothered with using OCR
to deal with image spam, when it's so easy, an
On Wed, 20 Jul 2011 21:18:48 -0400
dar...@chaosreigns.com wrote:
> It still seems strange to me that anybody has ever bothered with
> using OCR to deal with image spam, when it's so easy, and for me not
> problematic, to just block all emails that might be image spam -
> those with an attached ima
On 07/20, Sharma, Ashish wrote:
> Can someone suggest some better OCR plugin for Spamassassin 3.3.1 for image
> spam?
It still seems strange to me that anybody has ever bothered with using OCR
to deal with image spam, when it's so easy, and for me not problematic, to
just block all
FuzzyOCR for my Spamassassin stack.
Lately I am not convinced with FuzzyOCR performance and the errors that I keep
getting on it.
Moreover the community support and active development on FuzzyOCR too seems to
be missing.
Can someone suggest some better OCR plugin for Spamassassin 3.3.1 for image
On Wed, May 27, 2009 23:43, decoder wrote:
> I am planning a new release, but my time schedule is though.
super, i posted a new thread with subject "FuzzyOcr wordlist"
new words to be added for latest spams
--
http://localhost/ 100% uptime and 100% mirrored :)
LuKreme wrote:
On 24-May-2009, at 18:40, Henrik K wrote:
I don't know why users are so afraid of words like SVN. You have to
look at the project, not version numbers.
I don't have FuzzyOCR installed, and it's not because of the SVN.
First, I don't think my server can take the processing hit
alex k wrote:
If only FuzzyOCR's developer would read that ;)
Unfortunately he doesn't seem to be interested in his project anymore.
Maybe you could take care of this orphaned code.
Dear Alex,
I am reading exactly everything you write ;)
The code is not orphaned, but also not being ext
On 24-May-2009, at 18:40, Henrik K wrote:
I don't know why users are so afraid of words like SVN. You have to
look at the project, not version numbers.
I don't have FuzzyOCR installed, and it's not because of the SVN.
First, I don't think my server can take the processing hit and second
On Sun, May 24, 2009 at 08:57:28AM +0200, alex k wrote:
>
> > Looks like nothing that fuzzyOCR couldn't do, being more flexible and
> > proven
> > by time.
>
> If only FuzzyOCR's developer would read that ;)
> Unfortunately he doesn't seem to be interested in his project anymore.
> Maybe you coul
On Sun, 24 May 2009, LuKreme wrote:
On 24-May-2009, at 03:10, alex k wrote:
You forgot ocrad. Ocrad is needed by facileOCR (see "Dependencies") and as
far as I know, there is no ready-to-use binary for Windows.
You keep talking about Windows. The world is not bifurcated between windows
and
On 24-May-2009, at 03:10, alex k wrote:
You forgot ocrad. Ocrad is needed by facileOCR (see "Dependencies")
and as
far as I know, there is no ready-to-use binary for Windows.
You keep talking about Windows. The world is not bifurcated between
windows and Linux, there is Solaris, OS X, Free
alex k a écrit :
> Hi,
>
>> On Sun, May 24, 2009 at 08:57:28AM +0200, alex k wrote:
>>> It is Linux centric and I do mention that on the project side.
>>>
>>> The code part you mention is the one that kills a leftover convert
>>> process
>>> after it reached its timeout, an exeption.
>>> You got t
Hi Xela,
I think there has been some misunderstanding:
In an older episode (Sunday, 24. May 2009), Henrik K wrote:
> You should mention that it's pretty Linux centric, atleast code like
> "ps -o pid,cmd --ppid $$ --no-header".. why don't you use perl
> functions?
In an older episode (Sunday, 24.
Hi,
> On Sun, May 24, 2009 at 08:57:28AM +0200, alex k wrote:
>>
>> It is Linux centric and I do mention that on the project side.
>>
>> The code part you mention is the one that kills a leftover convert
>> process
>> after it reached its timeout, an exeption.
>> You got the sources, go ahead and
On Sun, May 24, 2009 at 08:57:28AM +0200, alex k wrote:
>
> It is Linux centric and I do mention that on the project side.
>
> The code part you mention is the one that kills a leftover convert process
> after it reached its timeout, an exeption.
> You got the sources, go ahead and make a windows
Hi,
> On Sat, May 23, 2009 at 12:43:15PM +0200, alex k wrote:
>> Hi,
>> It seems that image spam is back. So I wrote a new OCR plugin for
>> spamassassin, which uses convert and ocrad to extract text.
>> For details and download see:
>>
>> http://spielwies
On Sat, May 23, 2009 at 12:43:15PM +0200, alex k wrote:
> Hi,
> It seems that image spam is back. So I wrote a new OCR plugin for
> spamassassin, which uses convert and ocrad to extract text.
> For details and download see:
>
> http://spielwiese.la-evento.com/facileOCR/
>
&g
Hi,
> On 23.05.09 12:43, alex k wrote:
>> It seems that image spam is back. So I wrote a new OCR plugin for
>> spamassassin, which uses convert and ocrad to extract text.
>> For details and download see:
>>
>> http://spielwiese.la-evento.com/facileOCR/
>>
&
On 23.05.09 12:43, alex k wrote:
> It seems that image spam is back. So I wrote a new OCR plugin for
> spamassassin, which uses convert and ocrad to extract text.
> For details and download see:
>
> http://spielwiese.la-evento.com/facileOCR/
>
> We use this plugin on our
In an older episode (Saturday, 23. May 2009), alex k wrote:
> Hi,
> It seems that image spam is back. So I wrote a new OCR plugin for
> spamassassin, which uses convert and ocrad to extract text.
Thank you. It works out of the box (after installing ocrad) here on
Ubuntu 8.04.2 linu
Hi,
It seems that image spam is back. So I wrote a new OCR plugin for
spamassassin, which uses convert and ocrad to extract text.
For details and download see:
http://spielwiese.la-evento.com/facileOCR/
We use this plugin on our servers. It kicks out every image-spam, that
made it through the
Theo Van Dinter wrote:
On Fri, May 02, 2008 at 09:12:12PM +0200, decoder wrote:
Also, the SA plugin architecture is not designed to modify the message
in any way, so you cannot push back the text into the normal processing
line.
Really? Who says? I made very specific modifications i
On Fri, May 02, 2008 at 09:12:12PM +0200, decoder wrote:
> Also, the SA plugin architecture is not designed to modify the message
> in any way, so you cannot push back the text into the normal processing
> line.
Really? Who says? I made very specific modifications in 3.2 to allow for
just that
Matus UHLAR - fantomas wrote:
does it push the extracted text back to SA so it could be used by e.g.
bayes? This is how it imho should be used.
(and imho the same for .pdf and/or .doc - extract text _and_ images from
it, call OCR for images...)
That is a question that was very frequently ask
> >>> Am I right to say that picture spam has dropped dramatically since the
> >>> last months?
On 02.05.08 11:38, Joseph Brennan wrote:
> Right. There's close to none now. Spam techniques come and go.
does it push the extracted text back to SA so it could be used by e.g.
bayes? This is how it
> Am I right to say that picture spam has dropped dramatically since the
> last months?
Right. There's close to none now. Spam techniques come and go.
Joseph Brennan
Columbia University IT
On Fri, May 02, 2008 at 06:06:05PM +0300, Henrik K wrote:
> On Fri, May 02, 2008 at 03:38:41PM +0200, polloxx wrote:
> > Hi,
> >
> > Am I right to say that picture spam has dropped dramatically since the
> > last months?
>
> Has there been any in a year? That's when I dropped using it.
>
It's p
On Fri, May 02, 2008 at 03:38:41PM +0200, polloxx wrote:
> Hi,
>
> Am I right to say that picture spam has dropped dramatically since the
> last months?
Has there been any in a year? That's when I dropped using it.
ugin? I see the latest FuzzyORC
> version is
> not SA 3.2.x compatible. Are there more recent product compatible to 3.2.x?
> Are you guys still running an ocr plugin on production servers?
>
> Thanks for your answers,
> P.
>
Hi,
Am I right to say that picture spam has dropped dramatically since the
last months?
Is it still reasonable to run an orc plugin? I see the latest FuzzyORC
version is
not SA 3.2.x compatible. Are there more recent product compatible to 3.2.x?
Are you guys still running an ocr plugin on
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
Mike Pepe wrote:
> decoder wrote:
>
>> Which OCR plugin are you using there? If it is the original
>> OcrPlugin, then you might try FuzzyOcr instead. The original
>> OcrPlugin was more proof-of-concept, and will cause you lots
decoder wrote:
Which OCR plugin are you using there? If it is the original OcrPlugin,
then you might try FuzzyOcr instead. The original OcrPlugin was more
proof-of-concept, and will cause you lots of headaches with the
current image spam...
I did upgrade to FuzzyOCR after I read your message
re_specific.cf local.cf
> WebRedirect.cf 70_sare_spoof.cf Ocr.cf
> WebRedirect.pm 70_sare_stocks.cfOcr.pm 70_sare_uri0.cf
> RulesDuJour
>
Which OCR plugin are you using there? If it is the original OcrPlugin,
then you might try FuzzyOcr instead. The original OcrPlugin was more
proof-of-con
Hey guys,
Running SA 3.1.1, on Fedora Core 3, with Perl 5.8.5
I installed gocr and imagemagick packages, copied the Ocr.pm and cf
files into /etc/mail/spamassassin
The tests don't seem to run, the pump 'n dump GIFs are still arriving
and I don't see that the test is being run in the headers.
decoder wrote:
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
decoder wrote:
Hello there,
I have improved the original OcrPlugin (found at
http://wiki.apache.org/spamassassin/OcrPlugin), so it contains
fuzzy matching. Like that, mistakes made by the OCR recognition or
intentional obfuscation
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
decoder wrote:
> Hello there,
>
> I have improved the original OcrPlugin (found at
> http://wiki.apache.org/spamassassin/OcrPlugin), so it contains
> fuzzy matching. Like that, mistakes made by the OCR recognition or
> intentional obfuscations in the t
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
decoder wrote:
> Hello there,
>
> I have improved the original OcrPlugin (found at
> http://wiki.apache.org/spamassassin/OcrPlugin), so it contains
> fuzzy matching. Like that, mistakes made by the OCR recognition or
> intentional obfuscations in the t
On Thu, Aug 10, 2006 at 10:55:30AM -0700, Dave . wrote:
> foreach my $p ( $pms->{msg}->find_parts("image") ) {
>Does this mean the message must have the text "image" and/or "image/gif"
>within the body? Many of the "penny stock" spam gifs I get appear as follows:
Generally speaking, RTM (Mail::S
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
Dave . wrote:
> Give them code from Ocr.pm:
>
> --- foreach my $p (
> $pms->{msg}->find_parts("image") ) { my ( $ctype, $boundary,
> $charset, $name ) = Mail::SpamAssassin::Util::parse_content_type(
> $p->get_header('content-type') ); i
Give them code from Ocr.pm:--- foreach my $p ( $pms->{msg}->find_parts("image") ) { my ( $ctype, $boundary, $charset, $name ) =Mail::SpamAssassin::Util::parse_content_type( $p->get_header('content-type') ); if ( $ctype eq "image/gif" ) { open OCR, "
On Tue, Aug 08, 2006 at 12:43:24AM +0200, decoder wrote:
>
> You can find a full description and an example in the wiki under:
>
> http://wiki.apache.org/spamassassin/FuzzyOcrPlugin
>
>
> Ideas for improvements or critics are always welcome :)
>
>
Hi,
First, thanks for working on such a gr
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
Bill Landry wrote:
> - Original Message - From: "Spamassassin List"
> <[EMAIL PROTECTED]>
> To:
> Sent: Wednesday, August 09, 2006 2:26 PM
> Subject: Re: Improved OCR Plugin with approximate matchin
> > yum install libungif* will get both libungif and libungif-progs (which
> > contains giffix)
I'm using Debian (Sarge) and I think libungif-bin is here the better package.
giflib-bin wants to install the packages libx11-6, xfree86-common, xlibs-data
additionaly. Which means 10MB more than inst
> -Original Message-
> From: decoder [mailto:[EMAIL PROTECTED]
> Sent: Wednesday, August 09, 2006 5:31 PM
> To: Spamassassin List; users@spamassassin.apache.org
> Subject: Re: Improved OCR Plugin with approximate matching
>
>
[snip]
>
> According to google, lib
- Original Message -
From: "Spamassassin List" <[EMAIL PROTECTED]>
To:
Sent: Wednesday, August 09, 2006 2:26 PM
Subject: Re: Improved OCR Plugin with approximate matching
Spamassassin List wrote:
decoder wrote:
See http://wiki.apache.org/spamassassin/FuzzyOcrPlugi
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
Spamassassin List wrote:
>> Spamassassin List wrote:
> decoder wrote:
>
> See http://wiki.apache.org/spamassassin/FuzzyOcrPlugin
>
> Major changes: Replaced imagemagick with netpbm, support
> png, invoked giffix for broken gifs,
Spamassassin List wrote:
decoder wrote:
See http://wiki.apache.org/spamassassin/FuzzyOcrPlugin
Major changes: Replaced imagemagick with netpbm, support png,
invoked giffix for broken gifs, detect image format with magic
bytes and not by content-type, added various configuration
options.
I ins
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
Spamassassin List wrote:
>>> decoder wrote:
>>>
>>> See http://wiki.apache.org/spamassassin/FuzzyOcrPlugin
>>>
>>> Major changes: Replaced imagemagick with netpbm, support png,
>>> invoked giffix for broken gifs, detect image format with magic
>>> byte
decoder wrote:
See http://wiki.apache.org/spamassassin/FuzzyOcrPlugin
Major changes: Replaced imagemagick with netpbm, support png, invoked
giffix for broken gifs, detect image format with magic bytes and not
by content-type, added various configuration options.
I install the above plugin, and
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
Expertsites, Inc. wrote:
>> decoder wrote:
>>
>> See http://wiki.apache.org/spamassassin/FuzzyOcrPlugin
>>
>> Major changes: Replaced imagemagick with netpbm, support png, invoked
>> giffix for broken gifs, detect image format with magic bytes and not
decoder wrote:
See http://wiki.apache.org/spamassassin/FuzzyOcrPlugin
Major changes: Replaced imagemagick with netpbm, support png, invoked
giffix for broken gifs, detect image format with magic bytes and not
by content-type, added various configuration options.
Feedback is welcome :)
Chris
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
decoder wrote:
> Hello there,
>
> I have improved the original OcrPlugin (found at
> http://wiki.apache.org/spamassassin/OcrPlugin), so it contains
> fuzzy matching. Like that, mistakes made by the OCR recognition or
> intentional obfuscations in the t
On Tue, 8 Aug 2006, decoder wrote:
> I only wanted to add a small note: I recently saw gifs that cannot be
> converted using imagemagick because they are either sloppy generated
> or with intention partly corrupted. Please think about using giftopnm
> and jpegtopnm instead. If you have a better id
Perhaps corrupted gifs should be treated as spam?
decoder wrote:
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
Hello again,
I only wanted to add a small note: I recently saw gifs that cannot be
converted using imagemagick because they are either sloppy generated
or with intention partly
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
Matthias Keller wrote:
> decoder wrote:
>> -BEGIN PGP SIGNED MESSAGE- Hash: SHA1
>>
>> Hello there,
>>
>> I have improved the original OcrPlugin (found at
>> http://wiki.apache.org/spamassassin/OcrPlugin), so it contains
>> fuzzy matching. Like
decoder wrote:
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
Hello there,
I have improved the original OcrPlugin (found at
http://wiki.apache.org/spamassassin/OcrPlugin), so it contains fuzzy
matching. Like that, mistakes made by the OCR recognition or
intentional obfuscations in the text don't
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
Hello again,
I only wanted to add a small note: I recently saw gifs that cannot be
converted using imagemagick because they are either sloppy generated
or with intention partly corrupted. Please think about using giftopnm
and jpegtopnm instead. If yo
From: "uNiXpSyChO" <[EMAIL PROTECTED]>
decoder wrote:
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
Hello there,
I have improved the original OcrPlugin (found at
http://wiki.apache.org/spamassassin/OcrPlugin), so it contains fuzzy
matching. Like that, mistakes made by the OCR recognition or
i
seems to work... but i never see a score about 1.00.
the docs say the default score is 4. did i miss something?
above 1.00 i meant.
decoder wrote:
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
Hello there,
I have improved the original OcrPlugin (found at
http://wiki.apache.org/spamassassin/OcrPlugin), so it contains fuzzy
matching. Like that, mistakes made by the OCR recognition or
intentional obfuscations in the text don't
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
Hello there,
I have improved the original OcrPlugin (found at
http://wiki.apache.org/spamassassin/OcrPlugin), so it contains fuzzy
matching. Like that, mistakes made by the OCR recognition or
intentional obfuscations in the text don't make the recogni
On Sat, 2006-08-05 at 14:38 -0700, Gary Funck wrote:
> Bill Randle wrote:
> > I don't have the OCR plugin installed, but am using the recently
> > posted ImageInfo plugin. This is what I get on spam-gif-1.txt:
> >
> > X-Spam-Status: Yes, score=20.6 r
Bill Randle wrote:
> I don't have the OCR plugin installed, but am using the recently
> posted ImageInfo plugin. This is what I get on spam-gif-1.txt:
>
> X-Spam-Status: Yes, score=20.6 required=5.0
> tests=BAYES_99,DC_GIF_MULTI_LARGO,EXTRA_MPART_TYPE,FORGED_RCVD_HELO,
&g
eshold, but the second misses
> by a full point.
>
> Am wondering how these fare on others' set up, and what you've
> done so that these messages are detected as spam?
>
>
>
> configuration:
> SA 3.1.4
> gocr 0.40
> perl-Image-ExifTool 6.00
>
others' set up, and what you've
done so that these messages are detected as spam?
configuration:
SA 3.1.4
gocr 0.40
perl-Image-ExifTool 6.00
MIMEdefang 2.56
OCR plugin per Davin's 2006-08-02 message
(note didn't applied patches to gocr or perl-Image-Exiftool,
Hello there,
since I read it in the todo of the OCR Plugin that the words are still
hardcoded and since I wanted to contribute something too, I modified the
plugin so the words can be configured in the cf file...
I have attached both the new pm file and the cf file.
If you like it, please put
71 matches
Mail list logo