On Jan 30, 2008 6:01 PM, Mark Green <[EMAIL PROTECTED]> wrote: > Ahem, there's a huge difference between being confronted with > a spinner/progress bar or an error page. The former speaks > "Please wait", the latter speaks "Try again".
OK, so let's break this down. There are two potential cases where you run up against your database's concurrent connection limit: 1. Your average traffic level involves more concurrent connections than your database permits. 2. Your average traffic level is within the number of concurrent connections your database permits, but you are experiencing a temporary spike above that level. In case (1), the odds of a timeout/retry scheme yielding success within the time the average user is prepared to wait are low; your database is simply swamped, and the only options are to increase available resources (in the form of database connections) or refuse to serve some number of requests. So case (2) is the only one worth discussing as a target for a timeout/retry scheme. In this case, you are (apparently) asking for Django to provide three things: 1. A mechanism for specifying a set number of database connections which Django will persistently maintain across request/response boundaries. 2. A mechanism for specifying how long Django should wait, when attempting to obtain a connection, before timing out and concluding that no connection will be obtained. 3. A mechanism for specifying how many times, after failing to obtain a connection, Django should re-try the attempt. How, then, would we apply these hypothetical configuration directives to the situation of a traffic spike? There are two possibilities: 1. Set these directives in advance and have them be a permanent part of the running application's configuration. 2. Avoid setting these directives until a spike is imminent (difficult to do) or in progress, and leave them only so long as the spike persists. In case (1) you are flat-out wasting resources and complicating Django's operation, by holding resources which are not used and mandating more complex logic for obtaining those resources. In nearly all cases this is a bad trade-off to make. In case (2) the first option really isn't possible, because you can't reliably predict traffic spikes in advance. This leaves the second option, which requires you to be constantly watching the number of database connections in use and involves shutting down your application temporarily in order to insert the necessary configuration directives. It is also unlikely that you will be able to do so before at least some users have received error pages. So you must either waste resources, or accept increased monitoring overhead and the inevitability that some requests will not receive successful responses. Add to this the following disadvantages: * More complex configuration of Django (and hence more potential for configuration error). * More complex code base for Django (and hence more operating overhead and more potential bugs). * Loss of flexibility in that the application layer must now possess more information about the database layer. * Loss of flexibility in that Django must now maintain persistent resources between requests, and have some mechanism for equitably distributing them across arbitrarily-many server processes on one or more instances of the HTTP daemon on one or more virtual or physical machines. Given all of this, I don't see this level of configuration in the application layer as a worthwhile trade-off to make. You've mentioned JDBC, which points to one potential conflict between your expectations and the reality of Django's design: Java-based web applications typically run threaded within a single Java-based server instance which is able to contain persist resources across requests and manage them equitably across all threads. Django, on the other hand, typically runs multi-*process* behind or embedded in an HTTP daemon and shares no information across the boundaries of those processes, and deliberately seeks to minimize the number of things which must be persisted across the boundaries of request/response cycles. This is a fundamentally different architecture, and so decreases the portability of assumptions from a Java-style background. In particular, there is no master Django process which can create and maintain a resource pool and allocate resources from it to other Django processes (especially since processes may be running behind/embedded in other instances of the HTTP daemon on the same machine, or on remote machines). Which brings up yet another disadvantage: * The configuration of the number of connections Django should maintain, and the timeout/retry directives, must be calculated with the assumption that they will be applied by each and every process independently and without knowledge of any of the others. At that point even the calculations involved in determining the correct settings are probably impossible, since common server arrangements can and will increase or decrease the number of running processes on the fly; calculations for Apache/mod_python, for example, must be able to equitably and reliably manage your resources for any number of independent processes "n" such that StartServers <= n <= MaxClients, and must take into account the turnover of processes which reach MaxRequestsPerChild. Throw in load-balancing among multiple HTTP daemons, and the complexity becomes mind-boggling. Based on this, it's even more clear that attempting to do this at the application layer is a bad idea. Since you apparently place an extremely high value on availability of a timeout/retry feature, I recommend seeking out an external connection manager which offers that feature; you will gain all the benefits of that feature while retaining the simplicity and flexibility of Django's configuration and shared-nothing architecture. -- "Bureaucrat Conrad, you are technically correct -- the best kind of correct." --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "Django users" group. To post to this group, send email to django-users@googlegroups.com To unsubscribe from this group, send email to [EMAIL PROTECTED] For more options, visit this group at http://groups.google.com/group/django-users?hl=en -~----------~----~----~----~------~----~------~--~---