On Mon, Jan 13, 2025 at 10:47 PM Eugene Rozenfeld
<eugene.rozenf...@microsoft.com> wrote:
>
> We are initializing both the call graph node count and
>
> the entry block count of the function with the head_count value
>
> from the profile.
>
>
>
> Count propagation algorithm may refine the entry block count
>
> and we may end up with a case where the call graph node count
>
> is set to 0 but the entry block count is non-zero. That becomes
>
> a problem because we have this code in execute_fixup_cfg:
>
>
>
> profile_count num = node->count;
>
> profile_count den = ENTRY_BLOCK_PTR_FOR_FN (cfun)->count;
>
> bool scale = num.initialized_p () && !(num == den);
>
>
>
> Here if num is 0 but den is not 0, scale becomes true and we
>
> lose the counts in
>
>
>
> if (scale)
>
>   bb->count = bb->count.apply_scale (num, den);
>
>
>
> This is what happened the issue reported in PR116743
>
> (a 10% regression in MySQL HAMMERDB tests).
>
> 3d9e6767939e9658260e2506e81ec32b37cba041 made an improvement in
>
> AutoFDO count propagation, which caused the mismatch between
>
> the call graph node count (zero) and the entry block count (non-zero)
>
> and subsequent loss of counts as described above.
>
>
>
> The fix is to update the call graph node count once we've done count 
> propagation.
>
>
>
> Tested on x86_64-pc-linux-gnu.

OK.

Thanks,
Richard.

>
>
> gcc/ChangeLog:
>
>                 PR gcov-profile/116743
>
>                 * auto-profile.c (afdo_annotate_cfg): Fix mismatch between 
> the call graph node count
>
>                 and the entry block count.
>
> ---
>
> gcc/auto-profile.cc | 4 ++--
>
> 1 file changed, 2 insertions(+), 2 deletions(-)
>
>
>
> diff --git a/gcc/auto-profile.cc b/gcc/auto-profile.cc
>
> index 5d0e8afb9a1..aa4d1634f01 100644
>
> --- a/gcc/auto-profile.cc
>
> +++ b/gcc/auto-profile.cc
>
> @@ -1538,8 +1538,6 @@ afdo_annotate_cfg (const stmt_set &promoted_stmts)
>
>    if (s == NULL)
>
>      return;
>
> -  cgraph_node::get (current_function_decl)->count
>
> -     = profile_count::from_gcov_type (s->head_count ()).afdo ();
>
>    ENTRY_BLOCK_PTR_FOR_FN (cfun)->count
>
>       = profile_count::from_gcov_type (s->head_count ()).afdo ();
>
>    EXIT_BLOCK_PTR_FOR_FN (cfun)->count = profile_count::zero ().afdo ();
>
> @@ -1578,6 +1576,8 @@ afdo_annotate_cfg (const stmt_set &promoted_stmts)
>
>        /* Calculate, propagate count and probability information on CFG.  */
>
>        afdo_calculate_branch_prob (&annotated_bb);
>
>      }
>
> +  cgraph_node::get(current_function_decl)->count
>
> +      = ENTRY_BLOCK_PTR_FOR_FN(cfun)->count;
>
>    update_max_bb_count ();
>
>    profile_status_for_fn (cfun) = PROFILE_READ;
>
>    if (flag_value_profile_transformations)
>
> --
>
> 2.34.1
>
>

Reply via email to