Please keep list mail on list!
if you run parallel sa-learn instances you'll run into locked DB errors.
With a SDBM backend it would be a bit faster but still lock up.
afaik, Redis backend won't have locking issues.
(dunno about SQL - I use Redis)
On 4/15/21 2:45 PM, Christian Völker wrote:
Hi,
well, here it is not I/O bound (running on RAID1-SSDs). I am using the
"default" file based backend ~/.spamassassin/bayes*.
40msg/sec is not really fast enough for me. The number of messages to be
processed is really huge.
So again asking: is it possible with the file-based dbackend to do this
stuff in parallel?
Thanks
/Christian
Am 15.04.2021 um 14:38 schrieb Axb:
Depending on your Bayes backend, your bottleneck will not be the CPUs
but I/O.
Normally there's no need for running multiple sa-learn instances.
My sa-learn is learning +40 msgs/sec from a SSD into a Redis DB.
On 4/15/21 2:33 PM, Christian Völker wrote:
Hi all,
I am going to add some large spam archives for my Bayes database with
sa-learn.
I have a machine with six vCPUs and obviously I would like to speed
up the learning process. I am thinking of running six sa-learn
processes in parallel. Is there any issue with this like locks for
the database?
Or is sa-learn itself multithreaded and I do not need to run it in
parallel (does not look so)?
Next, when running the above in parallel (if possible) should I use
the "--no-sync" and do the syncing afterwards? But again, this is
then only single-threaded, right?
Thanks a lot for your input!
/Christian