Re: requests and sub-requests

André Warnier Sun, 12 Oct 2008 10:12:46 -0700

Torsten,

Many thanks for the excellent information, I will ponder that.


More below, but one more question here :
Where does $r->internal_redirect "live" (in which package) ?
I am having trouble finding it.

Torsten Foertsch wrote:

On Sun 12 Oct 2008, André Warnier wrote:
In an attempt at being clever, I put the following code in the
handler :

     unless ($r->is_initial_req) {
         if (defined $r->prev) {
             # we are in a subrequest.  Just copy user from main
request. $r->user( $r->prev->user );
         }
         # Also disable authorization phase
         $r->set_handlers(PerlAuthzHandler => undef);
         return OK;
     }
You have to distinguish between subrequests and internal redirects. Theformer result from $r->lookup_uri, $r->lookup_file or similar (thereare a few more such functions in the C API) and internal redirects thatresult from $r->internal_redirect (internal_fast_redirect() is not asthe name suggests an internal redirect but simply overrides the currentrequest). Subrequests are used for example by mod_rewrite, mod_include,mod_negotiation to look for some characteristics of a document andperhaps pull it in (run() it). Internal redirects are used in mod_cgiwhen the CGI output indicates a status 200 (HTTP_OK) but also containsa Location header. But the main usage of internal redirects is theErrorDocument.
Now, is_initial_req() checks if the current $r is the result of asubrequest or the result of a internal redirect and returns false ifso. prev() returns the parent request if the current $r is the resultof an internal redirect and main() returns the main request if thecurrent $r is a subrequest. So, your code checks only for internalredirects (ErrorDocument).
Now, have a look at httpd-2.x.y/server/request.c around line 170. You'llsee this piece of code:
    /* Skip authn/authz if the parent or prior request passed the
     * authn/authz,
     * and that configuration didn't change (this requires
     * optimized _walk()
     * functions in map_to_storage that use the same merge results given
     * identical input.)  If the config changes, we must re-auth.
     */
    if (r->main && (r->main->per_dir_config == r->per_dir_config)) {
        r->user = r->main->user;
        r->ap_auth_type = r->main->ap_auth_type;
    }
else if (r->prev && (r->prev->per_dir_config == r->per_dir_config)){
        r->user = r->prev->user;
        r->ap_auth_type = r->prev->ap_auth_type;
    }
    else {
        switch (ap_satisfies(r)) {
        case SATISFY_ALL:
        case SATISFY_NOSPEC:
            if ((access_status = ap_run_access_checker(r)) != 0) {
                return decl_die(access_status, "check access", r);
    ...

Ok, I get it.

I have a little question related to the above, but not very urgent : whythe check on the configuration change ? what can change between arequest and a sub-request (or internal redirect) ?

You see, you are not the first who had had the idea of reusing anestablished identity.

I did not think I would be.

If your subreq or internal redirect hits the same

Location or Directory container the AAA phases are completely skipped.
Maybe this is enough optimization if you shift a few directives aroundin your httpd.conf.

I don't think so, because this is a really specific authenticationmethod, for a special case.

And I don't think that Apache will skip the mod_perl AAA phases, will it ?

If not, the code above shows you how to do it. But you must ask yourselfif it really is valid to reuse the identity. I believe, you can safelyinherit the identity from $r->main or $r->prev but you must not skipthe other 2 A's. If you can't it would mean you have one realm ofidentities for the main request and another for the subreq. That, I'dsay, is a configuration error.

As a first stage of the AAA, for some Locations, there is a filtering onthe remote IP of the caller. Some IP's get an "automatic" user-id,which can vary according to the IP. In some cases, this is authoritative(no access unless you have the right IP), in some cases not (you get asecond chance). Some Locations don't have the IP filter, they alwaysget the second chance below. This IP filter is implemented as aPerlAccessHandler. This is the main reason for trying to optimise,because it is expensive : the IP of the caller must be compared toseveral ranges of IP, not necessarily matching regular subnets.

The second step is a PerlAuthenHandler, which can re-direct to a loginpage.Then there is a PerlAuthenzHandler to check if this user is allowed toaccess that resource.It also combines with SSO, with some URL rewriting, and with trying tocontrol access to a Tomcat application behind the Apache.

The back-end for the authentication is a special DB system, whose accessfor that is rather heavy, but required.On the positive side, this is for a limited range of well-knownapplications, for a limited public and for a reasonable number ofexpected transactions/s.

So I am trying to wring out the optimisations I can, without going too far.

I started this module wanting to keep it "clean and lean and mean", butas I discover more and more twists, it is getting to look like theclassical spaghetti bowl..

I am also, but on a separate thread, looking at tying this AAA stuff tothe $r->connection (with notes()).


I'm also having fun doing this, it's interesting.

The idea being that if we are in a sub-request, there is no point in
authenticating/authorizing it again, since the main request should
already do that, right ?  Optimisation..

Now the above works very nicely, except in the case where, before
this handler gets called, there is an intervention by mod_rewrite. It
seems as if mod_rewrite makes the above fail, even when the rewrite
condition does not apply and the URL is considered as a
"pass-through".

I suspect that it is because mod_rewrite, no matter what, invoques
the original (or modified) URL as a sub-request of the original
request. This would cause the above to fail, because in such a case,
the above conditional code would be invoked, but there is no
$r->prev->user to be copied.

mod_rewrite doesn't make subrequests if not asked to. I know only of 2ways to have mod_rewrite perform a subreq: %{LA-U:variable}and %{LA-F:variable} in a RewriteCond.


This :
http://perl.apache.org/docs/2.0/api/Apache2/RequestUtil.html#C_is_initial_req_
may be missing ".. or an internal redirect" in a couple of places.

Re: requests and sub-requests

Reply via email to