On Wed, Aug 27, 2003 at 09:43:50PM -0700, Bob Dickinson (BSL) wrote: > We've read the readme's and looked at the code, but can't quite piece it > together. We tried a variety of things, including running
Heh... > rewrite-cf-with-new-scores, then another runGA with a command line argument > to trigger the [GA Validation Results] part, but none of the results seem > correct (the recall number in STATISTICS.txt is much lower than the one that > the GA spit out, among other things). It's sort of complicated, which is why I'm hoping we work on score generation for 2.70. First, the GA doesn't output scores for every rule. I forget the ones it skips, but I have script I use to "normalize" the scores to have the full list for the release. You then run rewrite-cf... to generate the new scores, and you put that in ../rules/50_scores.cf. Then you run "runGA" with some argument (doesn't matter), and it'll generate the STATISTICS and such. (note: that's based on the unseen validation logs, so it's likely to get slightly worse performance than the direct GA output). -- Randomly Generated Tagline: A penny saved is 2.5 grams of zinc alloy.
pgp00000.pgp
Description: PGP signature