On Sat, Apr 27, 2019 at 09:28:09PM +0200, Greg Kroah-Hartman wrote:
> On Sat, Apr 27, 2019 at 06:13:30PM +1000, Tobin C. Harding wrote:
> > (Note at bottom on reasons for 'To' list 'Cc' list)
> > 
> > Hi,
> > 
> > kobject_init_and_add() seems to be routinely misused.  A failed call to this
> > function requires a call to kobject_put() otherwise we leak memory.
> > 
> > Examples memleaks can be seen in:
> > 
> >     mm/slub.c
> >     fs/btrfs/sysfs.c
> >     fs/xfs/xfs_sysfs.h: xfs_sysfs_init()
> > 
> >  Question: Do we fix the misuse or fix the API?
> 
> Fix the misuse.
> 
> > $ git grep kobject_init_and_add | wc -l
> > 117
> > 
> > Either way, we will have to go through all 117 call sites and check them.
> 
> Yes.  Same for other functions like device_add(), that is the "pattern"
> those users must follow.
> 
> > I
> > don't mind fixing them all but I don't want to do it twice because I chose 
> > the
> > wrong option.  Reaching out to those more experienced for a suggestion 
> > please.
> > 
> > Fix the API
> > -----------
> > 
> > Typically init functions do not require cleanup if they fail, this argument
> > leads to this patch
> > 
> > diff --git a/lib/kobject.c b/lib/kobject.c
> > index aa89edcd2b63..62328054bbd0 100644
> > --- a/lib/kobject.c
> > +++ b/lib/kobject.c
> > @@ -453,6 +453,9 @@ int kobject_init_and_add(struct kobject *kobj, struct 
> > kobj_type *ktype,
> >     retval = kobject_add_varg(kobj, parent, fmt, args);
> >     va_end(args);
> >  
> > +   if (retval)
> > +           kobject_put(kobj);
> > +
> >     return retval;
> >  }
> >  EXPORT_SYMBOL_GPL(kobject_init_and_add);
> 
> I would _love_ to do this, but realize what a kobject really is.
> 
> It's just a "base object" that is embedded inside of some other object.
> The kobject core has no idea what is going on outside of itself.  If the
> kobject_init_and_add() function fails, it can NOT drop the last
> reference on itself, as that would cause the memory owned by the _WHOLE_
> structure the kobject is embedded in, to be freed.
> 
> And the kobject core can not "know" that something else needed to be
> done _before_ that memory could be freed.  What if the larger structure
> needs to have some other destructor called on it first?  What if
> some other api initialization needs to be torn down.
> 
> As an example, consider this code:
> 
> struct foo {
>       struct kobject kobj;
>       struct baz *baz;
> };
> 
> void foo_release(struct kobject *kobj)
> {
>       struct foo *foo = container_of(kobj, struct foo, kobj);
>       kfree(foo);
> }
> 
> struct kobj_type foo_ktype = {
>       .release = foo_release,
> };
> 
> struct foo *foo_create(struct foo *parent, char *name)
> {
>       struct *foo;
> 
>       foo = kzalloc(sizeof(*foo), GFP_KERNEL);
>       if (!foo)
>               return NULL;
> 
>       foo->baz = baz_create(name);
>       if (!foo->baz)
>               return NULL;
> 
>       ret = kobject_init_and_add(&foo->kobj, foo_ktype, &parent->kobj, 
> "foo-%s", name);
>       if (ret) {
>               baz_destroy(foo->baz);
>               kobject_put(&foo->kobj);
>               return NULL;
>       }
> 
>       return foo;
> }
> 
> void foo_destroy(struct foo *foo)
> {
>       baz_destroy(foo->baz);
>       kobject_del(&foo->kobj);
        kojbect_put(&foo->kobj);
> }

Does this need this extra call to kobject_put()?  Then foo_create()
leaves foo with a refcount of 1 and foo_destroy drops that refcount.

Thanks for taking the time to explain this stuff.

thanks
Tobin.


Leaving below for reference.

> Now if kobject_init_and_add() had failed, and called kobject_put() right
> away, that would have freed the larger "struct foo", but not cleaned up
> the reference to the baz pointer.
> 
> Yes, you can move all of the other destruction logic into the release
> function, to then get rid of baz, but that really doesn't work in the
> real world as there are times you want to drop that when you "know" you
> can drop it, not when the last reference goes away as those are
> different lifecycles.
> 
> Same thing goes for 'struct device'.  It too is a kobject, so think
> about if the driver core's call to initialize the kobject failed, would
> it be ok at that exact moment in time to free everything?
> 
> Look at the "joy" that is device_add().  If kobject_add() fails, we have
> to clean up the glue directory that we had created, _before_ we can then
> call put_device().  Then stack another layer on top of that, look at
> usb_new_device().  If the call to device_add() fails, it needs to do
> some housekeeping before it can drop the last reference to the device to
> free the memory up.

Reply via email to