[ANN] TransactionKit, Lockless Multi-Reader, Multi-Writer Transaction Capable Hash Tables

John Engelhart Tue, 22 Apr 2008 16:49:15 -0700

All,

I've recently released the first version of TransactionKit, which ismade of two main components: The core library, and a Foundationcompatibility API layer.


Homepage: http://transactionkit.sourceforge.net/
Documentation: http://transactionkit.sourceforge.net/Documentation/index.html
SVN trunk is available at: 
http://transactionkit.svn.sourceforge.net/svnroot/transactionkit/trunk

TransactionKit is made available under a 3-clause BSD License.

As an aside, and begging the karma gods for forgiveness, I'm currentlylooking for a job. I obviously would like to find work doing Cocoaprogramming, but I also have extensive experience in networking (alasr. backbone engineer at a top tier 1 provider). Physically in thegreater Toronto (CA) metro area, US citizen, legal to work in both CAand US. Working remotely has a certain appeal. :)

The core library is mostly C based, with a bit of Objective-C in theappropriate places for Objective-C compatibility. The Objective-Cparts in the core library are optional, and can be easily strippedaway leaving just a C based core.

The Foundation API compatibility layer replicates the C based (not thenewer 10.5 Objective-C based) NSHashTable and NSMapTable, along with asubclass / re-implementation of NSDictionary and NSMutableDictionary.

The development environment has been Mac OS X 10.5 on a G4 PPC system,though I expect it should work effortlessly on any 10.5 system. Theonly thing I can think of off the top of my head that would prevent10.4 use is the fact that the Dictionary clones implementNSFastEnumeration, other than that I can't think of any 10.5 specificfeatures it makes use of. 10.5 Garbage Collection is not supportednor are their plans to- I have simply not been able to get 10.5's GCsystem to work reliably under the grueling punishment of the syntheticstress tests that TransactionKit places on the proper accounting ofresources by the memory allocation system. This has not been from alack of trying, I've just found the 10.5 GC system to be buggy andunreliable. Examples include: compiler bugs that don't always insertwrite barriers (ala a typdefed struct assignment, such asMyExampleType = *initExampleType), or that the functionsobjc_atomicCompareAndSwapGlobalBarrier (and friends) were essentiallyno-ops until 10.5.2 (specifically, objc4-371.1). Personally, I'vefound the 10.5 GC system extremely non-intuitive and ultimatelyimpossible to correctly use in practice. I think the section on "TheCosts of Precise Pointer Identification" in http://www.hpl.hp.com/personal/Hans_Boehm/gc/conservative.htmlsummarize many of the objections and problems I've encountered.

Full disclosure: If you're not familiar with the implications of"lockless" and "multithreading" combined in the same sentence, youshould know that this library attempts to deal with some notoriouslydifficult to get right problems. The common methodology inmultithreading programming is to employ a mutual exclusion lock arounda data structure to prevent simultaneous modifications by multiplethreads at the same time. TransactionKit uses a novel, experimentalmeans to allow lockless modifications by multiple threadsconcurrently. While some concurrent multiple writer data structuresdo exist, most are generally academic research grade problems /solutions. TransactionKit is definitely a prototype and experimentalin nature at this time.

- There are almost certainly bugs. - While certainly unintentional, Ithink its proper to set your expectations now. I think for most'simple' things, things should be mostly bug free. Corner cases andthe complicated nuances that crop up during multithreading concurrentuse is where most of the bugs probably are, especially in dealing withthe concept of "when" things happens relative to transaction start,commit, and rollback times.

While the ultimate goal is to have a highly portable C "core" and anObjective-C layer built on top of that, for right now it realisticallyonly works on Mac OS X. This is probably OK considering theaudience :). At this point, the C library is largely undocumented,but it really isn't that complicated: create a table, free a table,insert, get, and remove a key, along with begin, commit, and rollbacka transaction. Thats about it, really, and won't be further coveredhere. The rest of this is regarding the Objective-C portion.

The Objective-C portion, specifically the Dictionary clones, areessentially straight, method for method, re-implementations of theirfoundation Dictionary counterparts. There are two classes,TKDictionary and TKMutableDictionary. TKDictionary is a subclass ofNSDictionary, and TKMutableDictionary is a subclass of TKDictionary.They "should" be drop in replacements for their Foundationequivalents, and interoperate invisibly with each other. Wherenecessary, the TransactionKit Dictionaries determine the base classtype and take the appropriate actions, such as in +dictionaryWithDictionary: or - isEqual:.

For now, no additional functionality is added to TransactionKitsDictionary replications, such as exposing the underling transactioncapabilities of the core library. The methods that perform thestoring or retrieval of keys/objects in the dictionary have obviouslybeen replaced with methods that use a TransactionKit backed hashtable, but some of the other Foundation Dictionary methods areperformed by creating a Foundation Dictionary clone of the currentTransactionKit Dictionary and using the clone as a stand in. This isdone for such functions as + dictionaryWithContentsOfFile: and -description.

The area where the largest differences between Foundations andTransactionKits Dictionaries occurs is in the case of mutating theMutable variety of Dictionaries and the specifics of enumerating theMutable varieties contents. For example, it is illegal to mutate thecontents of a NSMutableDictionary under enumeration by any means,either NSEnumerator or NSFastEnumeration. With TransactionKit, it'sperfectly safe to mutate the contents of a dictionary thats beingenumerated with no ill effects. For example:

NSMutableDictionary *mutableDictionary = [NSMutableDictionarydictionary];

[mutableDictionary setObject:@"object 1" forKey:@"key 1"];
[mutableDictionary setObject:@"object 2" forKey:@"key 2"];

for(id obj in mutableDictionary) {
 NSLog(@"Object: %@", obj);
 [mutableDictionary removeObjectForKey:@"key 1"];
 [mutableDictionary removeObjectForKey:@"key 2"];
}

Using a NSMutableDictionary, this will obviously throw a mutationexception on the second iteration of the loop. Switching the firstline to:

TKMutableDictionary *mutableDictionary = [TKMutableDictionarydictionary];

and the situation changes: All keys are properly enumerated, eventhough all the keys are removed on the first iteration. For brevity,only two keys are used in the example, but it extends to any amount ofkeys contained in the dictionary. At the moment that enumerationbegins, or in this NSFastEnumeration case the first call to -countByEnumeratingWithState:objects:count:, a snapshot of the contentsof the dictionary are taken (really, wrapped in a transaction), andthe contents of that frozen snapshot is what is enumerated. Since thekey removal happens after the point in time of the initial snapshot,it does not effect the keys that are to be enumerated.

While the above example demonstrates that it is possible to mutate thecontents of a dictionary under enumeration by the thread that isperforming the enumeration, the same principle holds true if themutator is instead a different thread. In other words, thread A canbegin an enumeration, and then thread B can mutate the dictionary inthe middle of thread As enumeration, causing no ill effects. Safe,multithreaded concurrent reader and writer access toMutableDictionaries without any complicated locking protocol to observe.

Note: There is an obvious deficiency in the above example. Whileenumeration will successfully extract all the keys from the point intime of the enumerations start, there is no practical way to retrievethe objects from the dictionary relative to the enumerationssnapshot. Again, this goes back to "adding APIs for functionality theoriginal never anticipated": NSFastEnumeration protocol provides noeasy way to pass 'additional' information back to the invokingfor...in loop. Using NSFastEnumeration, I could think of no easy wayto gain access to the underlying transaction related to enumeration inprogress. For the time being, this point is "deferred for furtherconsideration." On the other hand, it's probably a trivial matter toadd this functionality to the NSEnumeration based enumerators. Itseasy to imagine creating a key based NSEnumerator, then enumeratingthe keys of the enumerator as one normally would. When an object fora key was needed, one could call an additional method of the keyenumerator to retrieve the object using the NSEnumeratorstransaction. Again, due to the prototype and experimental state ofTransactionKit, these types of additions are postponed until later.

For those wondering what the magic is behind the lockless multireader,multiwriter hash tables is, it's actually pretty simple. While Ihaven't done any extensive research to find out if the techniques usedare truly novel, what digging I did do didn't turn up anythingrelated. All of the individual concepts have been put forth before,it just seems that no one combined things the way I did up till now,which is kind of surprising consider how "obvious" it is. I'm nottrying to make any fantastic claims that I've 'invented' somethingnovel, only that I couldn't find any references that describes thetechnique I used.

The short version is this: TransactionKit uses a standard hash tablethat has Multi-version Concurrency Control (MVCC). While there aremany databases that use MVCC, I could not find any references to anyuses of MVCC in a lighter weight data structure such as a hash table.For a quick overview of MVCC, see http://en.wikipedia.org/wiki/Multiversion_concurrency_control.

The basic concept is such: MVCC uses a 'timestamp' to recordmutations. This is how TransactionKit gains its transactioncapability, with full begin, commit, and rollback capability. Atimestamp in this case is really just an atomically incrementinginteger, a very common primitive available on virtually every modernarchitecture. With a given timestamp, it is possible to "view" thecontents of the hash table as it was when that timestamp was active.Later timestamps (mutations) are simply ignored as they happen "after"the timestamp being considered.

The hash table is your generic "collisions are chained" hash table.The hash chain is really a LIFO, or stack, in practice. Virtually allmodern architectures can perform a single "natural word" compare andswap operation atomically, and for practical purposes it's assumedthat a "natural word" is equal in size to a pointer. Using thisprimitive, it's possible to create a LIFO stack that atomically pushesand pops elements from the stack. In fact, Mac OS X provides a set offunctions for performing these very operations (OSAtomicEnqueue,OSAtomicDequeue, see `man atomic`). This allows items to be added tothe hash table atomically, but using this technique only the item onthe top of the stack can be removed atomically. Items in the middlecan't be removed atomically as it requires being able to alter morethan a single "natural word" atomically, something generally notpossible.

The "trick" to being able to dequeue items that are in the middle of ahash chain atomically is to break the steps necessary to dequeue anitem in to steps that can be performed atomically. Reaching back toMVCC, we have a convenient, agreed to be all idea of "time" and whenthings happened. The individual steps that need to be performedadvance the MVCC timestamp counter by one, and the thread attemptingto perform an individual step performs an atomic compare and swap ofthe location used to record when the event took place with the newtimestamp. If it "wins" the CAS, it can perform the actual operation.

The second half of the "trick" is to keep track of the lowest (oldest)possible transaction timestamp that is outstanding. Once the lowestpossible transaction outstanding is greater than a time stamp inquestion it is guaranteed that no thread can possible be referencingthe value that was in place before the timestamp update, and the "nextatomic step" can take place.

That's the basics, in a nutshell. It uses simple, commonly availableatomic primitives to create a lockless, multi-reader, multi-writerhash table that also happens to support begin, commit, and rollbackstyle transactions. While there is certainly some impact to keepingtrack of all the transaction housekeeping, performance is still prettygood. Some benchmarks that I did comparing NSMapTables againstTKMapTables, with NSMapTables guarded with an OSSpinLock, showTransactionKit to be fairly competitive. Using NSInteger callbacks(keys and values nothing more than NSIntegers) and 16 threads,NSMapTable was bout 2.8 times faster than TKMapTable. Using Objectcallbacks and NSNumber objects and 16 threads, TKMapTable was 23% to35% faster than NSMapTable. This is probably due to the fact thatobjects extend the time that a table must remain exclusively locked toadd and remove keys and values (a retain / release is required), alongwith a more complicated comparison (isEqual:, vs. a simple ==). Thiswas just a single cpu (G4 powerbook, 1.5GHz PPC), as I don't have amulti-CPU machine, but the performance benefits should continue toincrease with the number of CPU's available.

Since the core primitives are lockless, this means the great bane tomultithreaded programming, The Deadlock, is not possible. This factalone greatly simplifies multithreaded programming as making sure thatall locks are properly balanced with unlocks, or that all locks areacquired in the correct order, no longer needs to be considered. Andsince no thread blocks the use of a hash table from any other thread,a considerable amount of parallelism can be realized as well. It'spretty useful stuff, but it's still a work in progress at this point.In know that for me, fully lockless, transaction capable basiccollection objects would greatly simplify my multithreading programming.

As an implementation note, I decided to wrap transactions andenumerations in NSObject wrappers. These are used in an autoreleasedfashion so that even if "something" goes wrong in the middle of theiruse, they are in the autorelease pool. The hope is that by using thistechnique transactions and hash table enumerations will "clear"themselves under most (all?) circumstances, such as an exception beingthrown. As soon as the underlying NSAutoreleasePool is popped, thetransaction or enumeration will get dealloced, and will default torolling back if it was not properly closed out. It adds a bit ofoverhead, but it also greatly simplifies the tracking of such things.

If you're going to try TransactionKit out, you should be warned thatthere's obviously some rough edges. One of those is documentation,but since I've been concentrating on pre-existing API's, I don'tconsider that to be a huge defect right now. There's also almostcertainly bugs here and there, so the idea of unwinding stack framesby hand and being able to almost reflexively spot pointers to stackvariables, and which threads stack it refers to should be secondnature to you. When things go wrong, they will go wrong in astunningly complex way. Things seems fairly stable, though, andheavy, synthetic multithreaded stress tests don't result in eithercrashes or any memory loss, which is a pretty spectacular featconsidering its all lockless.

_______________________________________________

Cocoa-dev mailing list (Cocoa-dev@lists.apple.com)

Please do not post admin requests or moderator comments to the list.
Contact the moderators at cocoa-dev-admins(at)lists.apple.com

Help/Unsubscribe/Update your Subscription:
http://lists.apple.com/mailman/options/cocoa-dev/archive%40mail-archive.com

This email sent to [EMAIL PROTECTED]

[ANN] TransactionKit, Lockless Multi-Reader, Multi-Writer Transaction Capable Hash Tables

Reply via email to