Re: Add UTF8 support in PostgreSQL lookup table interface

Wietse Venema Fri, 24 Aug 2018 13:55:28 -0700

Viktor Dukhovni:
> Yes, but that'd have to be done by the dictionary lookup layer,
> possibly in proxymap, based on a suitable signal from the lookup
> client, but the low-level API (dict_get()) does not presently
> support any per-lookup flags.  So we'd need dict_get_ex() that
> takes a new utf8 flag and supporting changes throughout the
> code.  This is a major change.
> 
> Perhaps there's a clever way to avoid this, but I am not seeing it yet.


I don't think that it is a good idea to signal the query string
encoding through the Postfix dict(3) API. Primarily because there
is no legitimate use for non-UTF8 in envelopes and headers, so why
bother adding complexity now for a dying cause?

Relevant for the $Subject: in UTF8 mode, the Postfix dict API will
never look up a non-UTF-8 string, because it cannot exist, and it
will never return a non-UTF-8 lookup result, for the same reasons.
If non-UTF8 content were allowed in UTF-8 mode, then every string
would be a potential bomb ready to explode.

Even more relevant for the $Subject: several Postfix lookup tables
that already block non-UTF8 queries when smtputf8 mode is turned
off (dict_ldap.c, dict_sqlite.c):

    /*
     * Don't frustrate future attempts to make Postfix UTF-8 transparent.
     */
    if ((dict->flags & DICT_FLAG_UTF8_ACTIVE) == 0
        && !valid_utf8_string(name, strlen(name))) {
        if (msg_verbose)
            msg_info("%s: %s: Skipping lookup of non-UTF-8 key '%s'",
                     myname, dict_ldap->parser->name, name);
        return (0);
    }

This code has been there for four years already. Never heard a peep.

I think that we should by default extend this to all lookup tables
and do away with that LATIN1 hack. That would also address the
problem that underlies the original request. No need for signaling
the request encoding!

        Wietse

Re: Add UTF8 support in PostgreSQL lookup table interface

Reply via email to