Re: [DISCUSS] The way we log

Mick Semb Wever Tue, 23 Jul 2024 04:36:07 -0700

reply below…

On Thu, 30 May 2024 at 14:34, Štefan Miklošovič <stefan.mikloso...@gmail.com>
wrote:

> I see the feedback is overall positive. I will merge that and I will
> improve the documentation on the website along with what Benedict suggested.
>

I think the codestyle needs to be more explicit that this applies _only_ to
trace statements, and also take the opportunity to provide more context to
our unusual use of log levels and its implications.

tl;dr

For the debug log level I would like to propose we:
 1. codestyle should mention that: debug doesn't mean debug to us, it is
for background activities intended to be enabled in production,
 2. doc/comment in logback.xml that the debug logger should not be disabled
 3. codestyle against (and remove all)  isDebugEnabled

long format…

1.
Our debug logging is not "debug" but the "non read/write path logging".
We hijacked the debug loglevel for the separation of all background
activities.  This trips most newcomers up. We can do a better job of
documenting this (see (2).

system.log == read/write path.
debug.log == also background activities.

There's some nuance here – there is logging around background activities
that we do want in system.log, which is correspondingly at the info
loglevel.  But generally, on the read/write path, we want to use the trace
level when one typically thinks about "debug", and on the background paths,
use debug when thinking about "info".

More history on this in CASSANDRA-10241 and in
https://lists.apache.org/thread/mgmbmdw4cp4rq684wcllfjc3sd1sbhkm

This separation of log files between foreground and background activity
*is* very valuable to the operator (and expert).  We have a lot of
experience on this.

And we did have some good docs about logging here:
https://cwiki.apache.org/confluence/display/CASSANDRA2/LoggingGuidelines
This should be revived IMHO.

2.
We often see operators in production disable the debug loglevel (or
appender), thinking they are doing the right thing.  We are then left
unable to investigate issues related to background tasks (compaction,
streaming, repairs, etc), and have to ask them to enable it again before
collecting the logs again.

Aside: we do tell operators to disable either the file appenders or stdout,
as having both running is unnecessary.  When they do this they often then
go and make the mistake of also disabling the ASYNCDEBUGLOG.

There's been calls in the past to better name our log files.  I do favour
the renaming system.log to foreground.log, and debug.log to everything.log,
or anything along those lines.   In the referenced ML thread there was also
discussion about using a marker, so to remove foreground activities from
debug.log, in which case a name like background.log would make more sense.
But renaming log files is a compatibility break, and is a separate
discussion.

3.
The use of isDebugEnabled should be avoided, as it confuses
- heavy methods calls in debug cannot be avoid
- that debug is always enabled (just like assertions)
- debug isn't appropriate on the read/write path (use trace instead)
- info isn't appropriate on the background paths, unless you intentionally
want it in system.log

Debug should never cause a production performance problem.  A past example
of this confusion:
https://github.com/apache/cassandra/commit/ac77e5e7742548f7c7c25da3923841f59d4b2713

It's my understanding this has already been the intentional practice and
precedence for most of us, ~10% of debug calls today are wrapped by
isDebugEnabled, compared to ~37% for trace being wrapped with
isTraceEnabled.

While the use of isDebugEnabled might offer some small value to those that
adominantly want to disable the background logging, and see some minimal
perf improvement from it, I don't think that argument is popular, or wise,
enough against a healthy code style and precedence that is intuitive to its
intent.

Like our assert statements needing not to cause production performance
problems (because like debug we expect it to be enabled in production),
best we document our idiosyncrasies rather than let them be tribal
knowledge.

Re: [DISCUSS] The way we log

Reply via email to