Re: [PATCH v4] PCI: hotplug: Add a generic RAS tracepoint for hotplug event

2025-01-08 Thread Bjorn Helgaas
On Wed, Jan 08, 2025 at 05:04:25PM +0800, Shuai Xue wrote: > 在 2025/1/8 07:19, Bjorn Helgaas 写道: > > On Sat, Nov 23, 2024 at 07:31:08PM +0800, Shuai Xue wrote: > > > Hotplug events are critical indicators for analyzing hardware health, > > > particularly in AI superco

Re: [PATCH v4] PCI: hotplug: Add a generic RAS tracepoint for hotplug event

2025-01-07 Thread Bjorn Helgaas
On Sat, Nov 23, 2024 at 07:31:08PM +0800, Shuai Xue wrote: > Hotplug events are critical indicators for analyzing hardware health, > particularly in AI supercomputers where surprise link downs can > significantly impact system performance and reliability. The failure > characterization analysis ill