> -----Original Message-----
> From: Guilherme G. Piccoli [mailto:gpicc...@linux.vnet.ibm.com]
> Sent: Friday, December 22, 2017 8:32 PM
> To: Elior, Ariel <ariel.el...@cavium.com>; Dept-Eng Everest Linux L2 <Dept-
> engeverestlinu...@cavium.com>
> Cc: netdev@vger.kernel.org; gpicc...@linux.vnet.ibm.com;
> gpicc...@protonmail.ch
> Subject: [PATCH net] bnx2x: Improve reliability in case of nested PCI errors
> 
> While in recovery process of PCI error (called EEH on PowerPC arch), another
> PCI transaction could be corrupted causing a situation of nested PCI errors. 
> Also,
> this scenario could be reproduced with error injection mechanisms (for debug
> purposes).
> 
> We observe that in case of nested PCI errors, bnx2x might attempt to 
> initialize
> its shmem and cause a kernel crash due to bad addresses read from MCP.
> Multiple different stack traces were observed depending on the point the 
> second
> PCI error happens.
> 
> This patch avoids the crashes by:
> 
>  * failing PCI recovery in case of nested errors (since multiple  PCI errors 
> in a row
> are not expected to lead to a functional  adapter anyway), and by,
> 
>  * preventing access to adapter FW when MCP is failed (we mark it as  failed
> when shmem cannot get initialized properly).
> 
> Reported-by: Abdul Haleem <abdha...@linux.vnet.ibm.com>
> Signed-off-by: Guilherme G. Piccoli <gpicc...@linux.vnet.ibm.com>

Acked-by: Shahed Shaikh <shahed.sha...@cavium.com>

Thanks,
Shahed

Reply via email to