There is a healthy discussion going on over in HADOOP-15566 on tracing
in the Hadoop ecosystem. It would sit better on a mailing list than in
comments up on JIRA so here's an attempt at porting the chat here.

Background/Context: Bits of Hadoop and HBase had Apache HTrace trace
points added. HTrace was formerly "incubating" at Apache but has since
been retired, moved to Apache Attic. HTrace and the efforts at
instrumenting Hadoop wilted for want of attention/resourcing. Our Todd
Lipcon noticed that the HTrace instrumentation can add friction on
some code paths so can actually be harmful even when disabled.  The
natural follow-on is that we should rip out tracings of a "dead"
project. This then beggars the question, should something replace it
and if so what? This is where HADOOP-15566 is at currently.

HTrace took two or three runs, led by various Heros, at building a
trace lib for Hadoop (first). It was trying to build the trace lib, a
store, and a visualizer. Always, it had a mechanism for dumping the
traces out to external systems for storage and viewing (e.g. Zipkin).
HTrace started when there was little else but the, you guessed it,
Google paper that described the Dapper system they had internally.
Since then, the world of tracing has come on in leaps and bounds with
healthy alternatives, communities, and even commercialization.

If interested, take a read over HADOOP-15566. Will try and encourage
participants to move the chat here.

Thanks,
St.Ack

---------------------------------------------------------------------
To unsubscribe, e-mail: common-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-dev-h...@hadoop.apache.org

Reply via email to