On Sat, 2 Feb 2013, Eliezer Croitoru wrote:

Yes I do understand that it's hard.
I worked a bit with perl so I might be able to write something that will do that if dosn't exists already.

That's probably what it will take.

I will try to explain even more.
The problem is that I get the mail with an example of the SPAM content which didn't came from EMAIL and just to categorize it as SPAM. This is not how and for what SA was built for but it gives very good results in general.
This is a specific case.

Ah, I think I see; by "this is a form" you meant your need is for scanning content submitted via a web form to see if it is spammy?

I have an active system which someone wrote in C# that scans the chars etc but the problem is that it's in C# and it's an active check that crawls the site and parsing it rather then a restful system that triggers the checks when needed.

This is an example of the content:
http://www.fpaste.org/yFOC/

It can be even some CMS post that someone got and he want's to categorize as spam.

So that sample message is largely hacked up just to provide headers so that it looks like an email and SA can scan it? That sure doesn't look like a valid email and there are a lot of obvious spam signs in the headers.

--
 John Hardin KA7OHZ                    http://www.impsec.org/~jhardin/
 jhar...@impsec.org    FALaholic #11174     pgpk -a jhar...@impsec.org
 key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
  Users mistake widespread adoption of Microsoft Office for the
  development of a document format standard.
-----------------------------------------------------------------------
 10 days until Abraham Lincoln's and Charles Darwin's 204th Birthdays

Reply via email to