RE: [Dbmail] 2.0 and header-cache

Mark Mackay - Orcon Sat, 6 Dec 2003 00:56:22 +0100 (CET)

> A few questions,
> - What should the table structure be?
> - What about automatic rebuilding the table cache, if you get a 
> query for a new header?


If you change the syntax, then as Jesse pointed out, dbmail-maintenance
would be a logical place. Failing that, based on the lifetime of most email
-- you could simply add another header, and for the first month or so the
header would only show up for newly delivered email. 

> - What about this table structure
> 
> CREATE TABLE cached_header_types (
>       header_idnr int(??) NOT NULL AUTO_INCREMENT,
>       header_name text,
>       PRIMARY KEY (header_idnr),
>       INDEX (header_name)
>       ) TYPE=InnoDB;
> This table holds the headers types we cache,
> 
> CREATE TABLE header_cache (
>       header_cache_idnr bigint(21) NOT NULL AUTO_INCREMENT,
>       header_type int(??),
>       header_val text,
>       message_id bigint(21),
>       UNIQUE(message_id, header_type),
>       FOREIGN KEY (`message_id`)
>               REFERENCES `messages` (`message_idnr`)
>               ON DELETE CASCADE,
>       ) TYPE=InnoDB;
> 
> This way, we have a good index on the user/header combination,
> This might not be the best approach,
> I beleive the typical query is
>       Get headers A,B and C from mailbox x.
> This would reguire something like
> SELECT header_type, header_val FROM (header_cache, messages)
>       WHERE header_cache.message_id=messages.message_idnr AND
>               messages.mailbox_idnr=<XXX> AND
>               (header_cache.header_type=head_id1 OR
>               header_cache.header_type=head_id2 OR ...
>               header_cache.header_type=head_idN)
> 
> I beleive we would get a quite good index and quite fast replies.

I'm not suggesting that a header cache is built up for every header, or for
use in IMAP optimisations, etc (although others may want this). The issue I
see with the above model is the one-to-many relationship. Typically in a
webmail client you want to display a single summary row of info for each
message (usually pulling stuff out of the messages table too, like
message_idnr, etc. Thus a one-to-one join works well. My complex SQL is a
bit rough so maybe there is a way to do this elegantly. 

A single sql transaction seems the best way for my purposes. "select
messages.*, messageheaders.* from messages, messageheaders where
messages.message_idnr=messageheaders.message_idnr order by <sorting header>
"

If someone can point out an easy way to get the same output from the above
structure, then fine by me. Guessing the above model has merit for it's easy
customization of header types.

One other note -- I don't think everyone wants the header cache. A lot of
people are using IMAP webmail clients, and unless dbmail itself can benefit
from such a cache, I'd suggest it's a compile-time option. No processing
overhead if you don't need it, and typically you're gonna choose the list of
headers you want once or twice in the lifetime of a dbmail setup. If people
want more headers, it should only affect DB size and processing time at
delivery/rebuild. Selects from I should still be fast and only the desired
headers can be returned. 

/Mark

RE: [Dbmail] 2.0 and header-cache

Reply via email to