Hello, I have txrep data in a mysql database, and am working on a training script to run sa-learn; with bayes also in MySQL and a corpus size of 5279 nspam and 849 nham, sa-learn takes a full 2 hours to run with txrep enabled (use_txrep 1), but only 13 minutes with txrep disabled (use_txrep 0). One of my main gripes with the old AWL was that it didn't learn/correct when training messages, so I love that txrep does that, but does anyone have any tips to improve txrep training performance? Either tweaks/improvements on my end, or even a little thought on logic redesign in that area?
Thanks, -- Jesse Norell Kentec Communications, Inc. 970-522-8107 - www.kci.net