Hi,

dbg: bayes: error inserting token for line: t 1 0 1308114254
   4fd2b3f2f0 dbg: bayes: _put_token: Updated an unexpected number
   of rows. [repeats ...]

Which version of MySQL?

Did you remember to replace TYPE=MyISAM with TYPE=InnoDB in the
schema (according to README.bayes) if you are using the recommended
Mail::SpamAssassin::BayesStore::MySQL as the bayes_store_module?

Please try the following patch (against 3.3.2), at least it should provide
more informative diagnostics:

[snip..]

I faced the same problem today. In my case, MySQL was configured to
use utf8 by default:

   # my.cnf

   [client]
   default-character-set=utf8

   [mysqld]
   character-set-server=utf8
   collation-server=utf8_unicode_ci
   init_connect='set collation_connection = utf8_unicode_ci;'

After commenting out the utf8 definitions and reverting back to latin1
"sa-learn --restore" worked fine.

I'm using mysql version:

# mysql --version
mysql Ver 14.14 Distrib 5.1.56, for redhat-linux-gnu (x86_64) using readline 5.1

Is there a difference between InnoDB and MyISAM in terms of training, or can that change be made the initial training? Why is it so much slower using InnoDB during training?

It looks like that may be my problem too. This is the result with your patch:

dbg: bayes: database connection established
dbg: bayes: found bayes db version 3
dbg: bayes: Using userid: 2
dbg: bayes: database connection established
dbg: bayes: found bayes db version 3
dbg: bayes: using userid: 3
dbg: bayes: _put_token: Updated an unexpected number of rows: 3, id: 3, token: 7�OR�
dbg: bayes: error inserting token for line: t 0 1 1308332646 37fc4f52eb
dbg: bayes: _put_token: Updated an unexpected number of rows: 3, id: 3, token: ����Y
dbg: bayes: error inserting token for line: t 0 2 1308070890 d2eec4f659

I'll try the suggested my.cnf changes and restart the process.

Thanks,
Dave




















Reply via email to