On 2020-12-09 4:41 am, @lbutlr wrote: 

> On 08 Dec 2020, at 13:54, micah anderson <mi...@riseup.net> wrote:
> 
>> Kris Deugau <kdeu...@vianet.ca> writes:
> There will only be one database and set of tables, but one of the fields in 
> each table is the user identifier. Fair warning - if you go full per-user on 
> a large system, this will MASSIVELY balloon the size of your Bayes database, 
> and most users will idle below the learning thresholds for quite a long time.

> Can you give an idea of the size calculation? I'm wanting to do this, but I 
> need to figure out how much space I need to allocate per user!

That would be pretty hard to predict as it would vary a lot based on the
users and the mail.

I don't think Bayes is really that big (a few MB max?)

It's not big. Here's my personal spamassassin database (just a few
users, but SA has been running for years and years ... About 48MB 

> mysql> SELECT TABLE_NAME AS `Table`, ROUND((DATA_LENGTH + INDEX_LENGTH) / 
> 1024 ) AS `Size (KB)` FROM information_schema.TABLES WHERE TABLE_SCHEMA = 
> "spamassassin" ORDER BY (DATA_LENGTH + INDEX_LENGTH) DESC;
> +-------------------+-----------+
> | Table | Size (KB) |
> +-------------------+-----------+
> | bayes_token | 48160 |
> | awl | 1040 |
> | bayes_vars | 32 |
> | bayes_seen | 16 |
> | bayes_global_vars | 16 |
> | bayes_expire | 16 |
> +-------------------+-----------+
> 6 rows in set (0.00 sec)
 

Reply via email to