[PHP-DEV] Question on thread safety

Andy Wharmby Wed, 29 Nov 2006 05:40:54 -0800

Hi All,

My first post on here but I have a come across a potential issuewith the PHP code and rather than just raise a defect thought it betterto solicit other

peoples views on the issue first.

I have been reviewing the PHP code recently in order to familiarizemyself with how it all fits together. Lately I have been focusing onthread safety and Ihave already raised a couple of defects on issues found in the code:

   http://bugs.php.net/bug.php?id=39623
   http://bugs.php.net/bug.php?id=39648

Other potential issues have also been identified and further defects mayfollow.

However, this email relates to a question on the design of the TSRM.ccode itself. The code in/ ts_allocate_id() /which used to allocate a newthread saferesource id is single threaded by virtue of the mutex acquired on entry.When a new resource is allocated, the code allocates an instance of thatresource

for each active thread as follows:

   /* enlarge the arrays for the already active threads */
   for (i=0; i<tsrm_tls_table_size; i++) {
       tsrm_tls_entry *p = tsrm_tls_table[i];

       while (p) {
           if (p->count < id_count) {
               int j;

p->storage = (void *) realloc(p->storage, sizeof(void*)*id_count);

               for (j=p->count; j<id_count; j++) {

p->storage[j] = (void *)malloc(resource_types_table[j].size);

                   if (resource_types_table[j].ctor) {

resource_types_table[j].ctor(p->storage[j],&p->storage);

                   }
               }
               p->count = id_count;
           }
           p = p->next;
       }
   }

The realloc() in the above code will potentially acquire a new memoryblock, copy the contents from original block and the free the original block(making it eligible for re-allocation) before returning to caller whichsaves away the new memory blocks address in the threads/ /tsrm_tls-entry/. /

Next, looking at ts_resource_ex() which is called by a thread to get itsthread local storage for a particular resource we see:


   if (!th_id) {
       /* Fast path for looking up the resources for the current
        * thread. Its used by just about every call to
        * ts_resource_ex(). This avoids the need for a mutex lock
        * and our hashtable lookup.
        */
       thread_resources = tsrm_tls_get();

       if (thread_resources) {

TSRM_ERROR((TSRM_ERROR_LEVEL_INFO, "Fetching resource id %dfor current thread %d",

           id, (long) thread_resources->thread_id));
           /* Read a specific resource from the thread's resources.

* This is called outside of a mutex, so have to be awareabout external

            * changes to the structure as we read it.
            */

TSRM_SAFE_RETURN_RSRC(thread_resources->storage, id,thread_resources->count);

       }
       thread_id = tsrm_thread_id();
   } else {
       thread_id = *th_id;
   }

This is executed WITHOUT the mutex (I assume for performance reasons)and directly accesses the same "storage" field which is modifiedby ts_allocate_id(). The comment suggests someone has thought aboutpotential problems here but I see no code here or inTSRM_SAFE_RETURN_RSRC that takes account of possible modifications tothe address in "storage".

My reading of the code as it currently stands is that there is a windowbetween the freeing of the original storage block by realloc() and thesaving away of the new memory block address in the "storage" field byts_allocate_id() during which time the address in "storage" is stale.The old memory could potentially be reallocated and modified during thiswindow. So it is possible for a thread to access its tsrm_tls_entryand read an old address for "storage"; potentially picking up theaddress of storage which may have been reallocated to another thread andmodified. If is does so then the results are unpredictable but asegmentation violation is one of most likely outcomes.

Further, on an architecture which has a weakly ordered memory model, e.gPPC, there is further potential that another thread will see a staleaddress even after the store into "storage" has been executed due toabsence of any memory barrier instructions in the code. If all access to"storage" were within a mutex then this would not be an issue as themutex enter/release provide the necessary memory synchronization butas ts_resource_ex() accesses the memory outside of a mutex their is noguarantee another thread calling ts_resource_ex() will see the result

of the store.

Now having said all that I do not believe given the current usage ofts_allocate_id() that this will cause an issue. The reason being that aquickscan of the code reveals that ts_allocate_id() is only called during PHPinitialization and extension initialization (MINIT) when the code iseffectively single threaded anyway so no thread will see any staleaddress in "storage". However, I see nothing in the code that would stop anextension writer from calling ts_allocate_id() outside of MINIT, e.g inrequest initialization (RINIT). If this did happen then problems areinevitable in ZTS enabled builds and the code should be fixed before itcauses issues which could be difficult to diagnose.

So finally to my question. Is it the intention of TSRMc. to allowts_allocate_id() to be called at any time or is there an unwritten rulethat itshould only ever called during php startup ? If its the former then Ibelieve the code is broken as it is and should be fixed before it causes anyproblems. I am happy to investigate possible solutions. If its thelatter then I believe the rule needs to be policed by code and anyattempt to call

ts_allocate_id() after startup failed gracefully.

I myself see no reason why extension writers should be restricted fromcalling ts_allocate_id() outside PHP startup so believe the code needs to befixed but I would appreciate the views of others with more experienceinto the workings of the code on how this potential problem should be

addressed before I spend time working on any possible fix.

Regards,
   Andy


Andy Wharmby
IBM United Kingdom Limited

--
PHP Internals - PHP Runtime Development Mailing List
To unsubscribe, visit: http://www.php.net/unsub.php

[PHP-DEV] Question on thread safety

Reply via email to