On Mon, 13 Jun 2022 14:48:45 +0500
Sarosh Arif <sarosh.a...@emumba.com> wrote:

> Thank you for help, I'll do it this way.
> 
> On Sat, Jun 11, 2022 at 9:25 PM Mattias Rönnblom <hof...@lysator.liu.se> 
> wrote:
> >
> > On 2022-06-10 08:04, Sarosh Arif wrote:  
> > > On Thu, Jun 9, 2022 at 8:26 PM Stephen Hemminger
> > > <step...@networkplumber.org> wrote:  
> > >>
> > >> On Thu, 09 Jun 2022 12:47:43 +0000
> > >> bugzi...@dpdk.org wrote:
> > >>  
> > >>> https://bugs.dpdk.org/show_bug.cgi?id=1030
> > >>>
> > >>>              Bug ID: 1030
> > >>>             Summary: rte_malloc() and rte_free() get stuck when used 
> > >>> with
> > >>>                      signal handler
> > >>>             Product: DPDK
> > >>>             Version: 22.03
> > >>>            Hardware: All
> > >>>                  OS: Linux
> > >>>              Status: UNCONFIRMED
> > >>>            Severity: normal
> > >>>            Priority: Normal
> > >>>           Component: core
> > >>>            Assignee: dev@dpdk.org
> > >>>            Reporter: sarosh.a...@emumba.com
> > >>>    Target Milestone: ---
> > >>>
> > >>> Created attachment 205  
> > >>>    --> https://bugs.dpdk.org/attachment.cgi?id=205&action=edit  
> > >>> calls rte_malloc and rte_free in the handler and main code
> > >>>
> > >>> I have a dpdk based application which uses rte_malloc() and rte_free()
> > >>> frequently in it's main code. The general method to close the 
> > >>> application is
> > >>> though sending SIGINT. The application has a signal handler written for 
> > >>> cleanup
> > >>> purposes before closing the application. The handler also uses 
> > >>> rte_free() to
> > >>> release some of the memory during cleanup. The application gets stuck 
> > >>> in a
> > >>> deadlock.
> > >>>
> > >>>
> > >>> Upon investigation I found out that both rte_free() and rte_malloc() use
> > >>> rte_spinlock_lock() function to place a lock on heap. While this lock 
> > >>> is placed
> > >>> and the application receives SIGINT, it goes into the handler without 
> > >>> releasing
> > >>> the lock. Since the handler itself calls rte_free() which tries to 
> > >>> acquire the
> > >>> lock it gets stuck.
> > >>>
> > >>>
> > >>> I have attached a sample application to reproduce this problem.
> > >>>
> > >>>
> > >>> Steps to reproduce this problem:
> > >>>
> > >>> 1. compile the code provided in attachment with any version of dpdk
> > >>> 2. run the compiled binary
> > >>> 3. press ctrl+c till the prints stop
> > >>>
> > >>> Actual Results:
> > >>> The application gets stuck in either rte_free() or rte_malloc()
> > >>>
> > >>> Expected Results:
> > >>> Application should allocate and free the memory without getting stuck
> > >>>  
> > >>
> > >> rte_malloc and rte_free are not async sigsafe()
> > >>  
> > > Oh, I did not know that. This should be mentioned in the documentation.  
> >
> > Is there anything except <rte_atomic.h> that is/should be async-signal-safe?
> >  
> > >> but then again regular glibc is not either.  
> > > Memory allocated with glibc malloc() is freed by itself upon closing
> > > the application. My application runs as a secondary process, and it
> > > needs to use rte_malloc() specifically because the memory should be
> > > shared between the two processes. If I don't free it upon closure it
> > > would just be leaked. Is there any other solution for it?  
> >
> > The standard solution is that the signal handler using some appropriate,
> > async-signal-safe way talks to the main thread, which then goes on to
> > cleanly terminate the application.
> >
> > A write() to an fd, or an atomic store to a flag are two options.  

Patch is pending (why is it not merged?) to describe what is signal safe.
https://patchwork.dpdk.org/project/dpdk/patch/20220711230448.557715-1-step...@networkplumber.org/

Reply via email to