On Tue, Jan 24, 2017 at 09:21:22PM +1100, Michael Ellerman wrote: >Gavin Shan <gws...@linux.vnet.ibm.com> writes: > >> Currently, it's assumed that memcons and its output buffer are included >> in the linear mapping. It's not true when "mem=384M" is included in >> bootargs. The system runs into kernel crash eventually. >> >> # od -x /proc/device-tree/ibm,opal/ibm,opal-memcons >> 0000000 0000 0000 0b30 0010 >> 0000010 >> >> This validates memcons descriptor and its output buffer to ensure they >> are valid in linear mapping. Otherwise, the interface won't be populated >> to avoid kernel crash during system boot. >> >> Cc: sta...@vger.kernel.org #3.15+ >> Fixes: bfc36894a48 ("powerpc/powernv: Add OPAL message log interface") >> Signed-off-by: Gavin Shan <gws...@linux.vnet.ibm.com> > >Hmm. > >Arguably the memcons shouldn't be in the linear map at all. >
Thanks for review. I agree it shouldn't be in linear map as it is for logs produced by skiboot, not kernel. Currently, the regions (including the reserved ones) in memblock are covered by linear mapping. As I can see, there is no flag to differentiate the cases - the region is reserved by skiboot or kernel itself. With the flag, we can skip mapping the regions that were reserved by skiboot to linear mapping area in htab_initialize() or radix_init_pgtable(). I will see if I can introduce on without too much efforts. As I can see, the only benefit is to avoid randomly writing to the area and data corruption, if the region is invisible from linear mapping. >AFAICS the kernel only ever reads from it, so really it should be >explicitly mapped, and mapped read only. > >That would also fix this problem :D > Yes, I think your proposed solution is better. This patch just adds more checks to avoid the problem, meaning the logs is inaccessible if the buffer is out of range. With your suggestion, it's still accessible. I believe the log buffer should be mapped to VMALLOC area. In order to setup mapping, we need page frame (struct page). However, it's possible the page frame isn't existing (e.g. with mem= in bootargs). It seems the best option would be mapping it into IOREMAP area with ioremap_prot(). No page frame is needed and allow to set the region to READONLY. Michael, what's your thoughts about this? :-) Thanks, Gavin