I'm hoping someone can help me with mass-check, or more specifically with
hit-frequencies.

I've installed Cygwin on my W/XP-H box. Within Cygwin I've installed SA,
not to use for mail filtering (that happens on my servers), but
specifically for mass-check.

Directory structure:
C:\cygwin
     /home
        /owner
           /Mail-SpamAssassin-2.60
             /blib
             /build
             /contrib
             /lib
             /old
             /qmail
             /rules
             /spamd
             /sql
             /t
             /tools
             /masses
                /corpora
                /graphs
                /rule-qa
                /tenpass
                /tmp
                /corpus.ham
                /corpus.spam
                /results

In the masses directory, along with all of the SA-provided code, I have
created my own masscheck.sh shell script, which contains the commands:

mv    $testfile            spamassassin/user_prefs
echo                     >>spamassassin/user_prefs
echo "use_bayes       0" >>spamassassin/user_prefs
echo "auto_learn      0" >>spamassassin/user_prefs
echo "required_hits   9" >>spamassassin/user_prefs
echo "skip_rbl_checks 1" >>spamassassin/user_prefs
rm -f ham.log spam.log

perl ./mass-check -c ./spamassassin -j 1 --loghits --mid --mbox ./corpus.ham/*  
>ham.log
perl ./mass-check -c ./spamassassin -j 1 --loghits --mid --mbox ./corpus.spam/* 
>spam.log
perl ./hit-frequencies -x -c /home/Owner/sa-masses/spamassassin

The first set is based on trial and error. The only way I was able to get
mass-check to focus on a specific ruleset was to put that ruleset into a
user_prefs file in a masses/spamassassin directory. I create that
user_prefs file from the ruleset, turn off Bayes and network processing,
and set my required hits level.

I then run mass-check against these rules and against my ham, and then
against my spam, creating masses/ham.log and masses/spam.log -- this
works fine, and I'm able to do my own analysis against these logs.

However, I've so far been unable to get hit-frequencies to reference the
ruleset.

When I run hit-frequencies without the -c parameter, it defaults to
masses/../rules and pulls in the entire distribution ruleset, with every
rule hitting zero, since those rules weren't part of mass-check.

When I run hit-frequencies with any -c parameter I've tried, I get the
header line
> OVERALL     SPAM      HAM     S/O   SCORE  NAME
>   81383    65609    15774    0.806   0.00    0.00  (all messages)
(the counts are right -- my corpus today is 65609s/15774h), and no
rule-specific detail.

What do I need to do to have hit-frequencies report on the stats for my
specific rules?

Many thanks, and a Happy New Year to all!

Bob Menschel





-------------------------------------------------------
This SF.net email is sponsored by: IBM Linux Tutorials.
Become an expert in LINUX or just sharpen your skills.  Sign up for IBM's
Free Linux Tutorials.  Learn everything from the bash shell to sys admin.
Click now! http://ads.osdn.com/?ad_id=1278&alloc_id=3371&op=click
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to