On Wed, 30 Mar 2011 16:51:57 +0200 Marcin Mirosław <mar...@mejor.pl> wrote:
> I'm using postgresql, but machine isn't quick... Any db is slowly > there. Using Pg for Bayes data will be really slow. We don't use the SpamAssassin Bayes implementation and we went through three iterations of storage back-ends before finding one we liked. 1) PostgreSQL: Convenient but slow. 2) Berkeley DB: Faster than PostgreSQL, but still slow and occasionally flaky 3) CDB: Very fast, but cannot be incrementally updated. You need to rebuild the entire DB and then atomically rename it. In our implementation, it's not a problem to have a read-only DB, so we went with CDB. It's dramatically faster than Berkeley DB: http://www.dmo.ca/blog/benchmarking-hash-databases-on-large-data/ Regards, David.