I mean in AnalyzeColumnCommand.scala the first one to compute percentiles and the second one to compute columnStats.
Chrysan Wu 吴晓菊 Phone:+86 17717640807 2018-07-30 23:28 GMT+08:00 Reynold Xin <r...@databricks.com>: > Which API are you talking about? > > On Mon, Jul 30, 2018 at 7:03 AM 吴晓菊 <chrysan...@gmail.com> wrote: > >> I noticed that in column analyzing, 2 jobs will run separately to >> calculate percentiles and then distinct. Why not combine into one job since >> HyperLogLog also supports merge? >> >> Chrysan Wu >> Phone:+86 17717640807 >> >>