Re: JK 1.2.28 - load balancer worker fails on startup with one worker down ?

David Fisher Fri, 17 Apr 2009 07:50:39 -0700

An interesting discussion. Since I am about to configure such a loadbalancer and we prefer to use DNS, understanding this type of detailis critical.

The OP said that the reason that the DNS did not resolve was that themachine had been moved off the network. That may have been an eventout of the control of the sysadmin for the web service. Supposesomeone takes a server away and then a week later apache and mod_jkare restarted via a cron job in the middle of the night? Suddenly theweb service is down.

I think that one could argue that if a configuration has beensuccessful then this error should be a warning and if theconfiguration file has been altered since the past time it is a fatalerror. That may be too much extra logic and it is non-deterministic asa configuration file change is hard to detect accurately.

Perhaps more helpful would be to have a sysadmin email address in theconfig and then when things go fatal send an email with theappropriate log information. It is all about catching appropriatelythrown error classes. What logging facility does mod_jk use? It couldbe that plugging in a special logger in this situation makes sense.


Regards,
Dave

On Apr 17, 2009, at 5:28 AM, André Warnier wrote:

Rainer Jung wrote:
[...]
What remains for me is your suggestion, that the error is not a fatal
one, since there are other balanced workers left. We could includesuch
a check in the startup code, although I'm not really convinced, that
your problem is a good reason for this.
I'm open to more argumntation and suggestions :)
Argumentation #1 against a change in logic:
The OP argues that one single unresolvable balanced worker shouldnot stop the other 4 from working, hence that the balancer shouldstart anyway, since 80% of the capacity is still available. Itsounds reasonable in principle.But what if there are only 2 balanced workers in total, of which oneis unresolvable at start ? would it be normal to start with only onebalanced worker available anyway ?
If not, then where's the limit of "acceptable" ?

Argumentation #2 against a change in logic:
Suppose the balancer would start, with the resolved workers only.
Suppose the resolving problem comes from a typo, not the fact thatthe given host is temporarily out of the DNS system, but a definitenon-existing host. It will not be retried, so there will never beanother error/warning message. The host itself may be ok and respondto pings etc.., it will just never be hit by Apache's mod_jk, sothis would be a very quiet error.How is the sysadmin going to figure out that there is, basically, aproblem ?
Argumentation for a change in logging:
It would be clearer if the error message stated explicitly that "thebalancer worker was not started due to a /configuration/ error, seeabove message(s)".
But then, if even I could figure it out from the existing errormessage, then just about everyone should be able to.And what would be the use of the likes of me, if everything wasclear ?
;-)

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscr...@tomcat.apache.org
For additional commands, e-mail: users-h...@tomcat.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscr...@tomcat.apache.org
For additional commands, e-mail: users-h...@tomcat.apache.org

Re: JK 1.2.28 - load balancer worker fails on startup with one worker down ?

Reply via email to