On Fri, Apr 20, 2018 at 03:18:37PM -0700, Yonghong Song wrote:
> When helpers like bpf_get_stack returns an int value
> and later on used for arithmetic computation, the LSH and ARSH
> operations are often required to get proper sign extension into
> 64-bit. For example, without this patch:
>     54: R0=inv(id=0,umax_value=800)
>     54: (bf) r8 = r0
>     55: R0=inv(id=0,umax_value=800) R8_w=inv(id=0,umax_value=800)
>     55: (67) r8 <<= 32
>     56: R8_w=inv(id=0,umax_value=3435973836800,var_off=(0x0; 0x3ff00000000))
>     56: (c7) r8 s>>= 32
>     57: R8=inv(id=0)
> With this patch:
>     54: R0=inv(id=0,umax_value=800)
>     54: (bf) r8 = r0
>     55: R0=inv(id=0,umax_value=800) R8_w=inv(id=0,umax_value=800)
>     55: (67) r8 <<= 32
>     56: R8_w=inv(id=0,umax_value=3435973836800,var_off=(0x0; 0x3ff00000000))
>     56: (c7) r8 s>>= 32
>     57: R8=inv(id=0, umax_value=800,var_off=(0x0; 0x3ff))
> With better range of "R8", later on when "R8" is added to other register,
> e.g., a map pointer or scalar-value register, the better register
> range can be derived and verifier failure may be avoided.
> 
> In our later example,
>     ......
>     usize = bpf_get_stack(ctx, raw_data, max_len, BPF_F_USER_STACK);
>     if (usize < 0)
>         return 0;
>     ksize = bpf_get_stack(ctx, raw_data + usize, max_len - usize, 0);
>     ......
> Without improving ARSH value range tracking, the register representing
> "max_len - usize" will have smin_value equal to S64_MIN and will be
> rejected by verifier.
> 
> Signed-off-by: Yonghong Song <y...@fb.com>
> ---
>  kernel/bpf/verifier.c | 26 ++++++++++++++++++++++++++
>  1 file changed, 26 insertions(+)
> 
> diff --git a/kernel/bpf/verifier.c b/kernel/bpf/verifier.c
> index 3c8bb92..01c215d 100644
> --- a/kernel/bpf/verifier.c
> +++ b/kernel/bpf/verifier.c
> @@ -2975,6 +2975,32 @@ static int adjust_scalar_min_max_vals(struct 
> bpf_verifier_env *env,
>               /* We may learn something more from the var_off */
>               __update_reg_bounds(dst_reg);
>               break;
> +     case BPF_ARSH:
> +             if (umax_val >= insn_bitness) {
> +                     /* Shifts greater than 31 or 63 are undefined.
> +                      * This includes shifts by a negative number.
> +                      */
> +                     mark_reg_unknown(env, regs, insn->dst_reg);
> +                     break;
> +             }
> +             if (dst_reg->smin_value < 0)
> +                     dst_reg->smin_value >>= umin_val;
> +             else
> +                     dst_reg->smin_value >>= umax_val;
> +             if (dst_reg->smax_value < 0)
> +                     dst_reg->smax_value >>= umax_val;
> +             else
> +                     dst_reg->smax_value >>= umin_val;
> +             if (src_known)
> +                     dst_reg->var_off = tnum_rshift(dst_reg->var_off,
> +                                                    umin_val);
> +             else
> +                     dst_reg->var_off = tnum_rshift(tnum_unknown, umin_val);
> +             dst_reg->umin_value >>= umax_val;
> +             dst_reg->umax_value >>= umin_val;
> +             /* We may learn something more from the var_off */
> +             __update_reg_bounds(dst_reg);

I'm struggling to understand how these bounds are computed.
Could you add examples in the comments?
In particular if dst_reg is unknown (tnum.mask == -1)
the above tnum_rshift() will clear upper bits and will make it
64-bit positive, but that doesn't seem correct.
What am I missing?

Reply via email to