Hi Boris, Hi James, >-----Original Message----- >From: Borislav Petkov [mailto:b...@alien8.de] >Sent: 01 October 2020 18:31 >To: James Morse <james.mo...@arm.com> >Cc: Shiju Jose <shiju.j...@huawei.com>; linux-e...@vger.kernel.org; linux- >a...@vger.kernel.org; linux-kernel@vger.kernel.org; tony.l...@intel.com; >r...@rjwysocki.net; l...@kernel.org; Linuxarm <linux...@huawei.com> >Subject: Re: [PATCH 1/1] RAS: Add CPU Correctable Error Collector to isolate >an erroneous CPU core > >On Thu, Oct 01, 2020 at 06:16:03PM +0100, James Morse wrote: >> If the corrected-count is available somewhere, can't this policy be >> made in user-space? > >You mean rasdaemon goes and offlines CPUs when certain thresholds are >reached? Sure. It would be much more flexible too.
I will send the kernel changes for existing CEC to support the CPU CE errors. Can you please have a look? Thanks, Shiju > >-- >Regards/Gruss, > Boris. > >https://people.kernel.org/tglx/notes-about-netiquette