On Thu, Aug 07, 2025 at 02:51:29PM +0530, Naresh Kamboju wrote:
> Hi Dan,
> 
> On Wed, 6 Aug 2025 at 20:24, Dan Carpenter <dan.carpen...@linaro.org> wrote:
> >
> > On Tue, Aug 05, 2025 at 12:50:28AM +0530, Naresh Kamboju wrote:
> > > While booting and testing selftest cgroups and filesystem testing on arm64
> > > dragonboard-410c the following kernel warnings / errors noticed and system
> > > halted and did not recover with selftests Kconfig enabled running the 
> > > kernel
> > > Linux next tag next-20250804.
> > >
> > > Regression Analysis:
> > > - New regression? Yes
> > > - Reproducibility? Re-validation is in progress
> > >
> > > First seen on the next-20250804
> > > Good: next-20250801
> > > Bad: next-20250804
> > >
> > > Test regression: next-20250804 Unable to handle kernel execute from
> > > non-executable memory at virtual address idem_hash
> > > Test regression: next-20250804 refcount_t: addition on 0;
> > > use-after-free refcount_warn_saturate
> > >
> > > Reported-by: Linux Kernel Functional Testing <l...@linaro.org>
> > >
> > > ## Test crash log
> > > [    9.811341] Unable to handle kernel NULL pointer dereference at
> > > virtual address 000000000000002e
> > > [    9.811444] Mem abort info:
> > > [    9.821150]   ESR = 0x0000000096000004
> > > [    9.833499]   SET = 0, FnV = 0
> > > [    9.833566]   EA = 0, S1PTW = 0
> > > [    9.835511]   FSC = 0x04: level 0 translation fault
> > > [    9.838901] Data abort info:
> > > [    9.843788]   ISV = 0, ISS = 0x00000004, ISS2 = 0x00000000
> > > [    9.846565]   CM = 0, WnR = 0, TnD = 0, TagAccess = 0
> > > [    9.851938]   GCS = 0, Overlay = 0, DirtyBit = 0, Xs = 0
> > > [    9.853510] rtc-pm8xxx 200f000.spmi:pmic@0:rtc@6000: registered as rtc0
> > > [    9.856992] user pgtable: 4k pages, 48-bit VAs, pgdp=00000000856f8000
> > > [    9.862446] rtc-pm8xxx 200f000.spmi:pmic@0:rtc@6000: setting system
> > > clock to 1970-01-01T00:00:31 UTC (31)
> > > [    9.868789] [000000000000002e] pgd=0000000000000000, 
> > > p4d=0000000000000000
> > > [    9.875459] Internal error: Oops: 0000000096000004 [#1]  SMP
> > > [    9.889547] input: pm8941_pwrkey as
> > > /devices/platform/soc@0/200f000.spmi/spmi-0/0-00/200f000.spmi:pmic@0:pon@800/200f000.spmi:pmic@0:pon@800:pwrkey/input/input1
> > > [    9.891545] Modules linked in: qcom_spmi_temp_alarm rtc_pm8xxx
> > > qcom_pon(+) qcom_pil_info videobuf2_dma_sg ubwc_config qcom_q6v5
> > > venus_core(+) qcom_sysmon qcom_spmi_vadc v4l2_fwnode llcc_qcom
> > > v4l2_async qcom_vadc_common qcom_common ocmem v4l2_mem2mem drm_gpuvm
> > > videobuf2_memops qcom_glink_smem videobuf2_v4l2 drm_exec mdt_loader
> > > qmi_helpers gpu_sched drm_dp_aux_bus qnoc_msm8916 videodev
> > > drm_display_helper qcom_stats videobuf2_common cec qcom_rng
> > > drm_client_lib mc phy_qcom_usb_hs socinfo rpmsg_ctrl display_connector
> > > rpmsg_char ramoops rmtfs_mem reed_solomon drm_kms_helper fuse drm
> > > backlight
> > > [    9.912286] input: pm8941_resin as
> > > /devices/platform/soc@0/200f000.spmi/spmi-0/0-00/200f000.spmi:pmic@0:pon@800/200f000.spmi:pmic@0:pon@800:resin/input/input2
> > > [    9.941186] CPU: 2 UID: 0 PID: 221 Comm: (udev-worker) Not tainted
> > > 6.16.0-next-20250804 #1 PREEMPT
> > > [    9.941200] Hardware name: Qualcomm Technologies, Inc. APQ 8016 SBC 
> > > (DT)
> > > [    9.941206] pstate: 60000005 (nZCv daif -PAN -UAO -TCO -DIT -SSBS 
> > > BTYPE=--)
> > > [    9.941215] pc : dev_pm_opp_put (/builds/linux/drivers/opp/core.c:1685)
> > > [    9.941233] lr : core_clks_enable+0x54/0x148 venus_core
> > > [   10.004266] sp : ffff8000842b35f0
> > > [   10.004273] x29: ffff8000842b35f0 x28: ffff8000842b3ba0 x27: 
> > > ffff0000047be938
> > > [   10.004289] x26: 0000000000000000 x25: 0000000000000000 x24: 
> > > ffff80007b350ba0
> > > [   10.004303] x23: ffff00000ba380c8 x22: ffff00000ba38080 x21: 
> > > 0000000000000000
> > > [   10.004316] x20: 0000000000000000 x19: ffffffffffffffee x18: 
> > > 00000000ffffffff
> > > [   10.004330] x17: 0000000000000000 x16: 1fffe000017541a1 x15: 
> > > ffff8000842b3560
> > > [   10.004344] x14: 0000000000000000 x13: 007473696c5f7974 x12: 
> > > 696e696666615f65
> > > [   10.004358] x11: 00000000000000c0 x10: 0000000000000020 x9 : 
> > > ffff80007b33f2bc
> > > [   10.004371] x8 : ffffffffffffffde x7 : ffff0000044a4800 x6 : 
> > > 0000000000000000
> > > [   10.004384] x5 : 0000000000000002 x4 : 00000000c0000000 x3 : 
> > > 0000000000000001
> > > [   10.004397] x2 : 0000000000000002 x1 : ffffffffffffffde x0 : 
> > > ffffffffffffffee
> > > [   10.004412] Call trace:
> > > [   10.004417] dev_pm_opp_put (/builds/linux/drivers/opp/core.c:1685) (P)
> > > [   10.004435] core_clks_enable+0x54/0x148 venus_core
> > > [   10.004504] core_power_v1+0x78/0x90 venus_core
> > > [   10.004560] venus_runtime_resume+0x6c/0x98 venus_core
> > > [   10.004616] pm_generic_runtime_resume
> >
> > Could you try adding some error checking to core_clks_enable()?
> > Does the patch below help?
> 
> Your patch works.
> The attached patch from Sasha fixes this reported problem on today's
> Linux next tag.
> 
> $ git log --oneline next-20250805..next-20250807 --
> drivers/media/platform/qcom/venus/pm_helpers.c
> 7881cd6886a89 media: venus: Fix OPP table error handling
> 

I feel a bit bad about this, because I saw this bug as a static
checker warning:

drivers/media/platform/qcom/venus/pm_helpers.c:51 core_clks_enable() error: 
'opp' dereferencing possible ERR_PTR()

But I figured that leaving out the error checking was probably
deliberate so I didn't report it...  I'll go through my list of old
warnings and review them again.

$ grep "dereferencing possible ERR_PTR" smatch_warns.txt | wc -l
115

regards,
dan carpenter


Reply via email to