Find attached the modified AJP14 proposal which include the remarks and request from the list. Nota : The launch of Tomcat JVM by the mod_jk/jakarta-connector is not covered since we speak here only about protocol. That feature is something asked by many JServ users and may be added later in the native part, at least in Apache 1.3/2.0 mode. I'm not sure how it can be done when using IIS or NES. The automatic context update was an interesting subject and how we can have these kind of admin informations raised more questions than answers. Opening alternate sockets (control socket) we'll load more the web-server with more opened sockets : on forked environnement (ie Apache 1.3), 2 x forked (1 data + 1 admin) on threaded environnement (ie Apache 2.0), 1 x forked (x threaded data + 1 admin) A solution could be to use the URGENT DATA mode of TCP/IP socket but I'm not sure it's supported now in JDK 1.1/1.2/1.3.... Another solution is to delay the automatic update mode. AJP14 have provision on NEGOCIATION phase to disactivate this feature. But we still need to know when a context is DOWN and later UP. * A lazy solution could be to have the servlet engine drop all AJP connections at each update of context (UP/DOWN). The web servlet will have then to re-open the connection and will received the UPDATED context list. * Another solution will be having the servlet engine sending a 'Context XXX Down' reply when the user send a request against a DOWN context. The servlet engine could then route the request to another engine (in LB mode), or to indicate the failure (in simple servlet engine mode). And when a context is marked DOWN, we need to know when it's UP again. Brute mode : the web-server continue to forward the requests to the servlet engine, and if it receive the 'Context XXX Down', it try another route. Clean mode : the web-server send a 'Context XXX Status' request, check if the reply is 'Context XXX Up', and if still down, try another route. This mode could be tuned. ie for a down context, ask for status each 10 or 50 requests. The context UP/DOWN is really a rare case, and we mustn't spend to much time in handling this type of event. The protocol must keep its speed. Another feature I'd like to have in mod_jk/jakarta-connector and present in mod_jserv is the backup mode : - In standard mode, a request is routed to only one servlet-engine (using a know worker). - In load balancing mode, a request is sent to a pool of servlet-engine, using a load factor. - We miss a backup mode, where all requests goes allways to the same servlet engine until the connection is broken (maybe the remote engine is down or network link closed). We define then one (or more) alternate engines to handle requests. In that mode, the web-servlet will try to detect later if the principal servlet engine is up each N requests (or after M seconds...) - Henri Gomez ___[_]____ EMAIL : [EMAIL PROTECTED] (. .) PGP KEY : 697ECEDD ...oOOo..(_)..oOOo... PGP Fingerprint : 9DF8 1EA8 ED53 2F39 DC9B 904A 364F 80E6
Proposal for Apache JServ 1.4 - Second Pass This document is a proposal of evolution of the current Apache JServ Protocol version 1.3, also known as ajp13. I'll not cover here the full protocol but only the add-on from ajp13. This second pass include comments from the tomca-dev list Missing features in AJP13 ------------------------- ajp13 is a good protocol to link a servlet engine like tomcat to a web server like Apache: * use persistants connections to avoid reconnect time at each request * encode many http commands to reduce stream size * send to servlet engine many info from web server (like SSL certs) But ajp13 lacks support for : * security between web server and servlet engine. Anybody can connect to an ajp13 port (no login mecanism used) You could connect, for example with telnet, and keep the remote thread up by not sending any data (no timeout in connection) * context information passed from servlet engine to web server. Part of the configuration of mod_jk, the web server connector, is to indicate to the web server which URI to handle. The mod_jk JkMount directive, told to web server which URI must be forwarded to servlet engine. A servlet engine allready knows which URI it handle and TC 3.3 is allready capable to generate a config file for mod_jk from the list of available contexts. * state update of contexts from servlet engine to web server. Big site with farm of Tomcat, like ISP and virtuals hosters, may need to stop a context for admin purposes. In that case the front web server must know that the context is currently down, to eventually relay the request to another Tomcat * verify state of connection before sending request. Actually mod_jk send the request to the servlet engine and next wait for the answer. But one of the beauty of the socket API, is you that you could write() to a closed connection without any error reporting, but a read() to a closed connection return you the error code. AJP14 add-ons to AJP13 ---------------------- Let's descrive here the features and add-on that will be added to AJP13, which will became AJP14. Since this document is a proposal, a resonable level of chaos must be expected at start. Be sure that discussion on tomcat list will help clarify points, add features but the current list seems to be a 'minimun vital' * Advanced login features at connect time * Basic authorisation system, where a shared secret key is present in web server and servlet engine. * Basic protocol negociation, just to be sure that if functionnalities are added to AJP14 in the future, current implementations will still works. * Clean handling of 'Unknown packets' * Extended env vars passed from web-server to servlet engine. Advanced login -------------- 1) WEB-SERVER send LOGIN INIT CMD + NEGOCIATION DATA + WEB SERVER INFO 2) TOMCAT respond with LOGIN SEED CMD + RANDOM DATA 3) WEB-SERVER calculted the MD5 of RANDOM DATA+SECRET DATA 4) WEB-SERVER send LOGIN COMP CMD + MD5 (SECRET DATA + RANDOM DATA) 5) TOMCAT respond with LOGIN STATUS CMD + NEGOCIED DATA + SERVLET ENGINE INFO To prevent DOS attack, the servlet engine will wait the LOGIN CMD only 15/30 seconds and reports the timeout exception for admins investigation. The login command will contains basic protocol negociation information like compressing ability, crypto, context info (at start up), context update at run-time (up/down), level of SSL env vars, AJP protocol supported (AJP14/AJP15/AJP16...) The Web server info will contain web server info and connector name (ie Apache 1.3.19 + mod_ssl 2.8.2 + mod_jk 3.3 + mod_perl 1.25). The servlet engine will mask the negociation mask with it's own mask (what it can do) and return it when loggin is accepted. This will help having a basic ajp14 implementation on a web-server working with a more advanced ajp14 on the servlet engine side or vice-versa. AJP13 was designed to be small and fast and so many SSL informations present in the web-server are not forwarded to the servlet engine. We add here four negociations flags to provide more informations on client SSL data (certs), server SSL datas , crypto used, and misc datas (timeout...). - Messages Stream - +-------------------------+---------------------------+---------------------------+ | LOGIN INIT CMD (1 byte) | NEGOCIATION DATA (32bits) | WEB SERVER INFO (CString) | +-------------------------+---------------------------+---------------------------+ +-------------------------+---------------------------+ | LOGIN SEED CMD (1 byte) | MD5 of entropy (32 chars) | +-------------------------+---------------------------+ +-------------------------+---------------------------------------+ | LOGIN COMP CMD (1 byte) | MD5 of RANDOM + SECRET KEY (32 chars) | +-------------------------+---------------------------------------+ +--------------------+------------------------+-------------------------------+ | LOGOK CMD (1 byte) | NEGOCIED DATA (32bits) | SERVLET ENGINE INFO (CString) | +--------------------+------------------------+-------------------------------+ +---------------------+-----------------------+ | LOGNOK CMD (1 byte) | FAILURE CODE (32bits) | +---------------------+-----------------------+ The secret key will be set by a new JkSecretKey ie: JkSecretKey myworker1 myverysecurekey Shutdown feature ---------------- AJP13 miss a functionnality of AJP12, which is shutdown command. A logout will tell servlet engine to shutdown itself. +-----------------------+---------------------------------------+ | SHUTDOWN CMD (1 byte) | MD5 of RANDOM + SECRET KEY (32 chars) | +-----------------------+---------------------------------------+ +---------------------+ | SHUTOK CMD (1 byte) | +---------------------+ +----------------------+-----------------------+ | SHUTNOK CMD (1 byte) | FAILURE CODE (32bits) | +----------------------+-----------------------+ Extended Env Vars feature ------------------------- Many users will want to see some of their web-server env vars passed to their servlet engine. To reduce the network traffic, the web-servlet will send a table to describing the external vars in a shorter fashion. We'll use there a functionnality allready present in AJP13, attributes list : In the AJP13, we've got : AJP13_FORWARD_REQUEST := prefix_code 2 method (byte) protocol (string) req_uri (string) remote_addr (string) remote_host (string) server_name (string) server_port (integer) is_ssl (boolean) num_headers (integer) request_headers *(req_header_name req_header_value) ?context (byte string) ?servlet_path (byte string) ?remote_user (byte string) ?auth_type (byte string) ?query_string (byte string) ?jvm_route (byte string) ?ssl_cert (byte string) ?ssl_cipher (byte string) ?ssl_session (byte string) ?attributes *(attribute_name attribute_value) request_terminator (byte) Using short 'web server attribute name' will reduce the network traffic. +----------------------------+-------------------------------------+-----------------------------------------+ | EXTENDED VARS CMD (1 byte) | WEB SERVER ATTRIBUTE NAME (CString) | SERVLET ENGINE |ATTRIBUTE NAME (CString) |... +----------------------------+-------------------------------------+-----------------------------------------+ ie : JkExtVars S1 SSL_CLIENT_V_START javax.servlet.request.ssl_start_cert_date JkExtVars S2 SSL_CLIENT_V_END javax.servlet.request.ssl_end_cert_date JkExtVars S3 SSL_SESSION_ID javax.servlet.request.ssl_session_id +-------------------+----+-------------------------------------------+ | EXTENDED VARS CMD | S1 | javax.servlet.request.ssl_start_cert_date | +-------------------+----+-------------------------------------------+ +----+-----------------------------------------+ | S2 | javax.servlet.request.ssl_end_cert_date | +----+-----------------------------------------+ +----+-----------------------------------------+ | S3 | javax.servlet.request.ssl_end_cert_date | +----+-----------------------------------------+ During transmission in AJP14 we'll see attributes name containing S1, S2, S3 and attributes values of 2001/01/03, 2002/01/03, 0123AFE56. This example showed the use of extended SSL vars but any 'personnal' web-server vars like custom authentification vars could be reused in the servlet engine. The cost will be only some more bytes in the AJP traffic. Context informations forwarding for Servlet engine to Web Server ---------------------------------------------------------------- Just after the LOGON PHASE, the web server will receive (if AJP14_CONTEXT_INFO_NEG is set) the list of contexts and URLs handled by the servlet engine. It will ease installation in many sites, reduce questions about configuration on tomcat-user list, and be ready for servlet API 2.3. This mode will be activated by a new directive JkAutoMount ie: JkAutoMount myworker1 A servlet engine could have many contexts, /examples, /admin, /test. We may want to use only some contexts for a given worker. It was done previously, in apache HTTP server for example, by setting by hand the JkMount accordingly in each <virtual> area of Apache. The new JkAutoMount is adapted for that purpose. We add a third parameter in JkAutoMount to meet these requirement ie: JkAutoMount myworker1 www.myvirtualserver.com In that case the servlet engine will only return the URL/URI matching these particular virtual server (defined in server.xml). This feature will help ISP and big sites which mutualize large farm of Tomcat in load-balancing configuration. - Messages Stream - +--------------------------+---------------------------------+ | CONTEXT QRY CMD (1 byte) | VIRTUAL HOST NAME (CString (*)) | +--------------------------+---------------------------------+ +---------------------------+---------------------------------+-------------------------------+ | CONTEXT INFO CMD (1 byte) | VIRTUAL HOST NAME (CString (*)) | URL1 [\n] URL2 [\n] |URL3 [\n] | +---------------------------+---------------------------------+-------------------------------+ *) The CString is a C string, ie an array of chars terminated by a null byte (/0). En empty string is just a null byte (/0). *) When VirtualMode is not to be used, the VIRTUAL HOST NAME is an empty string, the servlet engine will then send the whole context present. Context informations updates from Servlet engine to Web Server --------------------------------------------------------------- Context update are messages caming from the servlet engine each time a context is desactivated/reactivated. The update will be in use when the directive JkUpdateMount. This directive will set the AJP14_CONTEXT_UPDATE_NEG flag. ie: JkUpdateMount myworker1 Also in this mode we may have a third parameter to handle the virtual server to be used. ie: JkUpdateMount myworker1 www.myvirtualserver.com +-----------------------------+---------------------------------+-------------------------+ | CONTEXT UPDATE CMD (1 byte) | VIRTUAL HOST NAME (CString (*)) | STATUS UP/DOWN (1 |byte) | +-----------------------------+---------------------------------+-------------------------+ *) When VirtualMode is not in use, the VIRTUAL HOST NAME is an empty string. Context status query to Servlet engine -------------------------------------- This query will be used by the web-server to determine if a given context is UP or DOWN. +----------------------------+----------------------------------+----------------------------+ | CONTEXT STATE CMD (1 byte) | VIRTUAL HOST NAME (CString (*)) | CONTEXT NAME |(CString (*)) | +----------------------------+----------------------------------+----------------------------+ +----------------------------------+---------------------------------+----------------------------+------------------+ | CONTEXT STATE REPLY CMD (1 byte) | VIRTUAL HOST NAME (CString (*)) | CONTEXT NAME |(CString (*)) | UP/DOWN (1 byte) | +----------------------------------+---------------------------------+----------------------------+------------------+ *) When VirtualMode is not in use, the VIRTUAL HOST NAME is an empty string. Handling of unknown packets --------------------------- Sometimes even with a well negocied protocol, we may be in a situation where one end (web server or servlet engine), will receive a message it couldn't understand. In that case the receiver will send an 'UNKNOW PACKET CMD' with attached the unhandled message. +-----------------------------+------------------------------+ | UNKNOWN PACKET CMD (1 byte) | UNHANDLED MESSAGE (bytes...) | +-----------------------------+------------------------------+ Depending on the message, the sender will report an error and if possible will try to forward the message to another endpoint. Verification of connection before sending request ------------------------------------------------- One of the beauty of socket APIs, is that you could write on a half closed socket. When servlet engine close the socket, the web server will discover it only at the next read() to the socket. Basically, in the AJP13 protocol, the web server send the HTTP HEADER and HTTP BODY (POST by chunk of 8K) to the servlet engine and then try to receive the reply. If the connection was broken the web server will learn it only at receive time. We could use a buffering scheme but what happen when you use the servlet engine for upload operations with more than 8ko of datas ? The hack in the AJP13 protocol is to add some bytes to read after the end of the service : EXAMPLE OF DISCUSSION BETWEEN WEB SERVER AND SERVLET ENGINE AJP HTTP-HEADER (+ HTTP-POST) (WEB->SERVLET) AJP HTTP-REPLY (SERVLET->WEB) AJP END OF DISCUSSION (SERVLET->WEB) ---> AJP STATUS (SERVLET->WEB AJP14) The AJP STATUS will not be read by the servlet engine at the end of the request/response #N but at the begining of the next session. More at that time the web server could also use OS dependants functions (or better APR functions) to determine if there is also more data to read. And that datas could be CONTEXT Updates. This will avoid the web server sending a request to a desactivated context. In that case, if the load-balancing is used, it will search for another servlet engine to handle the request. And that feature will help ISP and big sites with farm of tomcat, to updates their servlet engine without any service interruption. +---------------------+----------------------+ | STATUS CMD (1 byte) | STATUS DATA (1 byte) | +---------------------+----------------------+ Conclusion ---------- The goal of the AJP14 protocol is to overcome some of the AJP13 limitation. An easier configuration, a better support for large site and farm of Tomcat, a simple authentification system and provision for protocol updates. Using the stable ajp13 implementation in mod_jk (native) and in servlet engine (java), it's a reasonable evolution of the well known ajp13. Commands and IDs in AJP14 ------------------------- - Commands IDs - AJP14_LOGINIT_CMD 0x10 AJP14_LOGSEED_CMD 0x11 AJP14_LOGCOMP_CMD 0x12 AJP14_LOGOK_CMD 0x13 AJP14_LOGNOK_CMD 0x14 AJP14_CONTEXT_QRY_CMD 0x15 AJP14_CONTEXT_INFO_CMD 0x16 AJP14_CONTEXT_UPDATE_CMD 0x17 AJP14_STATUS_CMD 0x18 AJP14_SHUTDOWN_CMD 0x19 AJP14_SHUTOK_CMD 0x1A AJP14_SHUTNOK_CMD 0x1B AJP14_CONTEXT_STATE_CMD 0x1C AJP14_CONTEXT_STATE_REP_CMD 0x1D AJP14_UNKNOW_PACKET_CMD 0x1E - Negociations Flags - AJP14_CONTEXT_INFO_NEG 0x80000000 /* web-server want context info after login */ AJP14_CONTEXT_UPDATE_NEG 0x40000000 /* web-server want context updates */ AJP14_GZIP_STREAM_NEG 0x20000000 /* web-server want compressed stream */ AJP14_DES56_STREAM_NEG 0x10000000 /* web-server want crypted DES56 stream with secret key */ AJP14_SSL_VSERVER_NEG 0x08000000 /* Extended info on server SSL vars */ AJP14_SSL_VCLIENT_NEG 0x04000000 /* Extended info on client SSL vars */ AJP14_SSL_VCRYPTO_NEG 0x02000000 /* Extended info on crypto SSL vars */ AJP14_SSL_VMISC_NEG 0x01000000 /* Extended info on misc SSL vars */ AJP14_PROTO_SUPPORT_AJPXX_NEG 0x00FF0000 /* mask of protocol supported */ AJP14_PROTO_SUPPORT_AJP14_NEG 0x00010000 /* communication could use AJP14 */ AJP14_PROTO_SUPPORT_AJP15_NEG 0x00020000 /* communication could use AJP15 */ AJP14_PROTO_SUPPORT_AJP16_NEG 0x00040000 /* communication could use AJP16 */ ... All others flags must be set to 0 since they are reserved for future use. - Failure IDs - AJP14_BAD_KEY_ERR 0xFFFFFFFF AJP14_ENGINE_DOWN_ERR 0xFFFFFFFE AJP14_RETRY_LATER_ERR 0xFFFFFFFD AJP14_SHUT_AUTHOR_FAILED_ERR 0xFFFFFFFC - Status - AJP14_CONTEXT_DOWN 0x01 AJP14_CONTEXT_UP 0x02 AJP14_CONTEXT_OK 0x03