On Tue 2015-02-03 12:38:28, Petr Mladek wrote:
> On Tue 2015-02-03 16:41:39, Masami Hiramatsu wrote:
> > (2015/02/03 2:48), Petr Mladek wrote:
> > > can_probe() checks if the given address points to the beginning of
> > > an instruction. It analyzes all the instructions from the beginning
> > > of the function until the given address. The code might be modified
> > > by another Kprobe. In this case, the current code is read into a buffer,
> > > int3 breakpoint is replaced by the saved opcode in the buffer, and
> > > can_probe() analyzes the buffer instead.
> > > 
> > > There is a bug that __recover_probed_insn() tries to restore
> > > the original code even for Kprobes using the ftrace framework.
> > > But in this case, the opcode is not stored. See the difference
> > > between arch_prepare_kprobe() and arch_prepare_kprobe_ftrace().
> > > The opcode is stored by arch_copy_kprobe() only from
> > > arch_prepare_kprobe().
> > > 
> > > This patch makes Kprobe to use the ideal 5-byte NOP when the code
> > > can be modified by ftrace. It is the original instruction, see
> > > ftrace_make_nop() and ftrace_nop_replace().
> > > 
> > > Note that we always need to use the NOP for ftrace locations. Kprobes
> > > do not block ftrace and the instruction might get modified at anytime.
> > > It might even be in an inconsistent state because it is modified step
> > > by step using the int3 breakpoint.
> > > 
> > > The patch also fixes indentation of the touched comment.
> > > 
> > > Note that I found this problem when playing with Kprobes. I did it
> > > on x86_64 with gcc-4.8.3 that supported -mfentry. I modified
> > > samples/kprobes/kprobe_example.c and added offset 5 to put
> > > the probe right after the fentry area:
> > > 
> > > --- cut ---
> > >  static struct kprobe kp = {
> > >   .symbol_name    = "do_fork",
> > > + .offset = 5,
> > >  };
> > > --- cut ---
> > > 
> > > Then I was able to load kprobe_example before jprobe_example
> > > but not the other way around:
> > > 
> > > $> modprobe jprobe_example
> > > $> modprobe kprobe_example
> > > modprobe: ERROR: could not insert 'kprobe_example': Invalid or incomplete 
> > > multibyte or wide character
> > > 
> > > It did not make much sense and debugging pointed to the bug
> > > described above.
> > > 
> > 
> > This looks good to me :)
> > 
> > Acked-by: Masami Hiramatsu <masami.hiramatsu...@hitachi.com>
> > 
> > Ingo, could you merge this as an urgent fix?
> 
> Please, wait a bit, see below.
>  
> > Thank you!
> > 
> > > Signed-off-by: Petr Mladek <pmla...@suse.cz>
> > > ---
> > >  arch/x86/kernel/kprobes/core.c | 42 
> > > ++++++++++++++++++++++++++++--------------
> > >  1 file changed, 28 insertions(+), 14 deletions(-)
> > > 
> > > Changes against v1:
> > > 
> > >   + always use 5-byte NOP for ftrace location
> > >   + fix indentation of the touched comment
> > > 
> > > diff --git a/arch/x86/kernel/kprobes/core.c 
> > > b/arch/x86/kernel/kprobes/core.c
> > > index 98f654d466e5..2f464b56766a 100644
> > > --- a/arch/x86/kernel/kprobes/core.c
> > > +++ b/arch/x86/kernel/kprobes/core.c
> > > @@ -223,27 +223,41 @@ static unsigned long
> > >  __recover_probed_insn(kprobe_opcode_t *buf, unsigned long addr)
> > >  {
> > >   struct kprobe *kp;
> > > + unsigned long faddr;
> > >  
> > >   kp = get_kprobe((void *)addr);
> > > - /* There is no probe, return original address */
> > > - if (!kp)
> > > + faddr = ftrace_location(addr);
> 
> I have just realized that ftrace_location() might return another
> address if the given one points inside the ftrace_location.
> This situation is not checked by this patch. I am going to work
> on v3.

Well, it should not happen after all because __recover_probed_insn() is called
only for already approved Kprobe locations and therefore only for the first
byte of the ftrace location. Any address inside the ftrace location is
refused earlier by check_kprobe_address_safe() that is called from
register_kprobe.

It means that it will never return another address here and the patch
can be used as is unless you want to be paranoid.

I am sorry for the rumor.

Best Regards,
Petr

> > > + /*
> > > +  * Use the current code if it is not modified by Kprobe
> > > +  * and it cannot be modified by ftrace.
> > > +  */
> > > + if (!kp && !faddr)
> > >           return addr;
> > >  
> > >   /*
> > > -  *  Basically, kp->ainsn.insn has an original instruction.
> > > -  *  However, RIP-relative instruction can not do single-stepping
> > > -  *  at different place, __copy_instruction() tweaks the displacement of
> > > -  *  that instruction. In that case, we can't recover the instruction
> > > -  *  from the kp->ainsn.insn.
> > > +  * Basically, kp->ainsn.insn has an original instruction.
> > > +  * However, RIP-relative instruction can not do single-stepping
> > > +  * at different place, __copy_instruction() tweaks the displacement of
> > > +  * that instruction. In that case, we can't recover the instruction
> > > +  * from the kp->ainsn.insn.
> > >    *
> > > -  *  On the other hand, kp->opcode has a copy of the first byte of
> > > -  *  the probed instruction, which is overwritten by int3. And
> > > -  *  the instruction at kp->addr is not modified by kprobes except
> > > -  *  for the first byte, we can recover the original instruction
> > > -  *  from it and kp->opcode.
> > > +  * On the other hand, in case on normal Kprobe, kp->opcode has a copy
> > > +  * of the first byte of the probed instruction, which is overwritten
> > > +  * by int3. And the instruction at kp->addr is not modified by kprobes
> > > +  * except for the first byte, we can recover the original instruction
> > > +  * from it and kp->opcode.
> > > +  *
> > > +  * In case of Kprobes using ftrace, we do not have a copy of
> > > +  * the original instruction. In fact, the ftrace location might
> > > +  * be modified at anytime and even could be in an inconsistent state.
> > > +  * Fortunately, we know that the original code is the ideal 5-byte
> > > +  * long NOP.
> > >    */
> > > - memcpy(buf, kp->addr, MAX_INSN_SIZE * sizeof(kprobe_opcode_t));
> > > - buf[0] = kp->opcode;
> > > + memcpy(buf, (void *)addr, MAX_INSN_SIZE * sizeof(kprobe_opcode_t));
> > > + if (faddr)
> > > +         memcpy(buf, ideal_nops[NOP_ATOMIC5], MCOUNT_INSN_SIZE);
> > > + else
> > > +         buf[0] = kp->opcode;
> > >   return (unsigned long)buf;
> > >  }
> > >  
> > > 
> > 
> > 
> > -- 
> > Masami HIRAMATSU
> > Software Platform Research Dept. Linux Technology Research Center
> > Hitachi, Ltd., Yokohama Research Laboratory
> > E-mail: masami.hiramatsu...@hitachi.com
> > 
> > 
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Reply via email to