Hello, On Fri, Sep 19, 2025 at 09:51:15AM -0600, Simon Glass wrote: > Hi Michal, > > On Wed, 13 Aug 2025 at 05:41, Michal Suchánek <msucha...@suse.de> wrote: > > > > Hello, > > > > On Wed, Aug 13, 2025 at 11:58:19AM +0100, Andrew Goodbody wrote: > > > On 12/08/2025 16:03, Tom Rini wrote: > > > > On Tue, Aug 12, 2025 at 03:46:56PM +0100, Andrew Goodbody wrote: > > > > > On 12/08/2025 15:33, Tom Rini wrote: > > > > > > On Tue, Aug 12, 2025 at 10:17:47AM +0100, Andrew Goodbody wrote: > > > > > > > On 11/08/2025 17:36, Quentin Schulz wrote: > > > > > > > > Hi Andrew, > > > > > > > > > > > > > > > > On 8/11/25 5:24 PM, Andrew Goodbody wrote: > > > > > > > > > Hi, > > > > > > > > > > > > > > > > > > I was wondering what people's thoughts were on API return > > > > > > > > > types. In > > > > > > > > > particular there is this and other examples in > > > > > > > > > include/clk-uclass.h > > > > > > > > > > > > > > > > > > /** > > > > > > > > > * get_rate() - Get current clock rate. > > > > > > > > > * @clk: The clock to query. > > > > > > > > > * > > > > > > > > > * This returns the current rate of a clock. If the clock > > > > > > > > > is > > > > > > > > > disabled, it > > > > > > > > > * returns the rate at which the clock would run if it was > > > > > > > > > enabled. The > > > > > > > > > * following pseudo-code should hold:: > > > > > > > > > * > > > > > > > > > * disable(clk) > > > > > > > > > * rate = get_rate(clk) > > > > > > > > > * enable(clk) > > > > > > > > > * assert(get_rate(clk) == rate) > > > > > > > > > * > > > > > > > > > * Return: > > > > > > > > > * * The rate of @clk > > > > > > > > > * * -%ENOSYS if this function is not implemented for @clk > > > > > > > > > * * -%ENOENT if @clk->id is invalid. Prefer using an > > > > > > > > > assert > > > > > > > > > instead, and doing > > > > > > > > > * this check in request(). > > > > > > > > > * * Another negative error value (such as %EIO or %ECOMM) > > > > > > > > > if the > > > > > > > > > rate could > > > > > > > > > * not be determined due to a bus error. > > > > > > > > > */ > > > > > > > > > ulong get_rate(struct clk *clk); > > > > > > > > > > > > > > > > > > > > > > > > > > > get_rate is declared as returning a ulong but the description > > > > > > > > > says > > > > > > > > > that it can return negative errors. A simple test of the > > > > > > > > > return > > > > > > > > > value for being less than 0 will always fail so errors can go > > > > > > > > > undetected. Casting to a signed type seems less than ideal. > > > > > > > > > > > > > > > > > > What is the best way to deal with this? Cast to a signed or > > > > > > > > > update > > > > > > > > > the API to be signed or...? > > > > > > > > > > > > > > > > > > > > > > > > > Note that clk_get_rate() in the kernel has the same function > > > > > > > > signature > > > > > > > > so I would refrain from changing the type otherwise we'll have > > > > > > > > some > > > > > > > > "funny" bugs to handle considering it isn't that uncommon to > > > > > > > > import > > > > > > > > drivers almost as-is from the Linux kernel. > > > > > > > > > > > > > > Ah yes. The difference being that the kernel does not seem to > > > > > > > attempt to > > > > > > > push an error code through this API, you get a rate or you get 0. > > > > > > > > > > > > How is the error code pushed? Or is it up to the caller to decide > > > > > > that 0 > > > > > > means on a case by case basis? > > > > Presumably it returns -EBUSY which is then implicitly converted to a large > > positive value. For pointers there is IS_ERR macro to decode that, and > > for integers there is another corresponding macro (I don't recall the > > name). > > > > For memory this assumes that the last page that holds the 'error' > > pointers cannot be mapped, and there was already a bug in the kernel > > that this was not true. > > > > For clock rates this assumes that rates close to the maximum value are > > invalid which is not guaranteed in any way whatsoever. > > > > Consequently, this should not be used. > > > > 0 is the only clock rate guaranteed to be invalid. > > > > As far as I have seen in the u-boot code almost no caller checks the > > result, and those that do cannot do anything meaningful with it anyway, > > at most a message is printed. That can be done by the driver without > > passing around the error. > > > > > > > In the Linux kernel almost no code checks the return of clk_get_rate > > > > > at all. > > > > > Some code will check the value is sensible and 0 is obviously not > > > > > sensible. > > > > > But pretty much the call to clk_get_rate is not expected to fail. > > > > > > > > Perhaps getting lost in the specifics then, but perhaps for this case we > > > > should just do the same? But your question was more general, so another > > > > example might help. > > > > > > I suspect that the answer is always going to that it depends on the > > > specifics of each case. The U-Boot implementation of clk_get_rate seems to > > > have become more complicated leading to the addition of returning error > > > codes. This leads to the question about what level of compatibility should > > > there be maintained with the kernel? That addition of returning error > > > codes > > > in U-Boot already means that the API is no longer compatible with that of > > > the kernel. > > > > I would suggest to abolish returning 'errors' as clock rates. > > I don't think that is a good idea. Debugging drivers then becomes > quite complicated since there is no feedback and you must head into > the clock subsystem to see what is going on. Bring this into U-Boot > more generally and it will be very painful. At the moment at least you > can enable CONFIG_LOG_ERROR_RETURN and see what happened.
Does that really work with unsigned return values containing negative error values as large positive integers? > Better I think to create some new functions with the API you want > (e.g. panic on failure, ignore failure). Or rename clk_get_rate_int() > - for internal - and create a new clk_get_rate() that does what you > want / what Linux does. The problem is that once an error is returned as 'clock rate' there is really no way to determine if what you got is really a clock rate or an error. You can assume that the values IS_ERR decodes as errors are not clock rates but there is no guarantee that is the case. While IS_ERR only uses small part of the avaiale range the valid values are not spread evenly over the range, and hardware-specific. Thanks Michal