NGSS wrote:
Hi,
I am losing confident in SA, the training process is pretty slow or it
doesn’t seem to be learning.
I am training SA with around 30-50 manually identified spam (moving
spam mails to and spam folder created in squirrelmail and crond the
sa-train command on that folder every hour to train and delete them).
The script is tested to be working on the shell before I put it on crond
However, I found that the learning process is either not right or it
is rather slow.
I gone through the headers of the spams and found that even almost
identical (in content) spams always got a score 0.1 and these spams
are received on separated occasions across several days. This had made
me losing confident on SA.
I wonder if had it setup correct to detect and learn spams . I am
using a default setup from qmail-toaster cnt50 , do I need more
filters to harden my defense? Any recommendations you will be
appreciated.
Here are sample samples I taken from my mailbox on this server,
(eg, sample spam 1 and 8 are almost identical in content but they are
both scored with only 0.1 … : (
http://www.keac.com/id3303/spam-egs.txt
Turn on URIBLs and Razor
Content analysis details: (11.6 points, 5.0 required)
pts rule name description
---- ----------------------
--------------------------------------------------
0.0 TO_MALFORMED To: has a malformed address
1.5 RAZOR2_CF_RANGE_E8_51_100 Razor2 gives engine 8 confidence level
above 50%
[cf: 100]
0.5 RAZOR2_CHECK Listed in Razor2 (http://razor.sf.net/)
0.5 RAZOR2_CF_RANGE_51_100 Razor2 gives confidence level above 50%
[cf: 100]
2.0 URIBL_BLACK Contains an URL listed in the URIBL blacklist
[URIs: jesecretary.com]
2.1 URIBL_WS_SURBL Contains an URL listed in the WS SURBL blocklist
[URIs: jesecretary.com]
2.9 URIBL_JP_SURBL Contains an URL listed in the JP SURBL blocklist
[URIs: jesecretary.com]
2.1 URIBL_OB_SURBL Contains an URL listed in the OB SURBL blocklist
[URIs: jesecretary.com]