Re: Display csv in a tableView with bindings

I. Savant Sun, 26 Jul 2009 07:55:16 -0700

On Jul 26, 2009, at 6:32 AM, Aaron Burghardt wrote:

Neither, you want an array of dictionaries where each row of CSV isa dictionary in which the values keyed to column names and each rowof CSV is one dictionary object in the array.


  This is a bit more complicated than that, actually.

There's a bit of a catch-22 here. On the one hand, you have aperformance consideration. On the other, you have an ease-of-programming consideration. Using NSDictionary is easier, but formoderately-sized files it is noticeably slow, for large files, it'sunusably so.

If you go the dictionary route, using the keys to identify the"fields" in each row, you're storing *way* more than just theindividual field contents. You're storing a copy of your fieldidentifier keys for every field, for every row. Best-case scenario,you're storing a pointer to some object that represents the "column"to which the fields belong, but this defeats the ease-of-use withbindings as you need string keys. As I mentioned above, withincreasingly large files, this dramatically increases your reading/writing time and uses a lot of memory. But at least you get theability to easily use bindings and to sort, all for free, performancebe damned.

If you go another route (an array of arrays of strings), it's farmore efficient, but adds a few programming complexities:

1 - How do you sort by a column? There's no key for sort descriptorsand sorting via selector provides no way to pass additionalinformation (such as column index or identifier).

2 - To what do you bind? The same limitation that causes concern inproblem #1 makes #2 difficult ... and there is little by way of over-the-counter laxative to make #2 less difficult.

3 - If you intend to allow reordering of columns (built-in NSTableViewfeature) or even adding/removing columns, how do you handle keepingthe columns mapped to the correct fields in the row array in theabsence of an associative array (dictionary)?

The easiest solution to all three of these problems (in my opinion)is to make a "row" a custom class and a helper class (we'll call it"ColumnMapper" - one mapper shared among all rows). The row's internalstorage can still be an array of strings for low overhead, but the Rowclass has a trick up its sleeve. It overrides -valueForUndefinedKey:so that it can still look up associative values (like a dictionary)but without storing them. The storage occurs once in the ColumnMapper.

When asked for a field value for a column, a Row asks theColumnMapper for the index (the index in its storage array) for thefield the column represents. Likewise for storing a field value. Thisworks because, since Row doesn't respond to these column ids as keys,it KVC falls back to -valueForUndefinedKey: and our Row classoverrides this and relies on the central ColumnMapper to determinewhere in its internal storage the value for that column ID is located.

This solves the sorting issue quite nicely too, if you sort usingdescriptors. Since NSSortDescriptor uses KVC, it "just works". Don'tforget to google around for "Finder-like sorting" ... the built-inmethods make a mess of alphanumeric strings. I leave implementing thatto your imagination ... it's actually really easy if you spend a fewminutes with Google.

Note also this approach requires that all rows have the same numberof columns/fields. Your parsing logic will have to account for this byeither automatically adjusting (fraught with complexities andassumptions) or rejecting the file and informing the user of the firstrow where trouble begins - ie, the first row where the number offields/columns differ from the rest. You really should take this routeanyway, since the missing field in a row might be somewhere other thanthe end ... so what do you do with the remaining fields in the row?They are probably in the wrong column and there's no way to knowbecause of CSV's inherent lack of solid structure.

The only remaining problem is bindings. If you want to be able tohandle any CSV file (ie, the "fields" are unknown), I'm afraid there'sno way to use bindings in IB. You'll have to create the table columns(and bind them) in code once you've parsed your file and determinedthe number of columns. In this regard, you might find it just as easy(if not easier) to eschew Cocoa Bindings altogether and just use theNSTableDatasource protocol. It gives you more precise control overwhat to refresh and when. Trust me, this will come up.

Of course for very large files, both methods will be slow (andmemory-intensive), and the problem becomes far more complex becausethen you need to start considering low-level solutions that don'tignore encodings. The anthesis to this concern is that, as thecomplexity and size increase, the likelihood that a human will want tosee it as a table they will manually manipulate decreases (or atleast, the reasonableness of the request does). At that magic tippingpoint, it's easy to argue that a GUI editor is no longer feasible andmost of this problem goes away.


  Good luck and happy coding! :-)

--
I.S.




_______________________________________________

Cocoa-dev mailing list (Cocoa-dev@lists.apple.com)

Please do not post admin requests or moderator comments to the list.
Contact the moderators at cocoa-dev-admins(at)lists.apple.com

Help/Unsubscribe/Update your Subscription:
http://lists.apple.com/mailman/options/cocoa-dev/archive%40mail-archive.com

This email sent to arch...@mail-archive.com

Re: Display csv in a tableView with bindings

Reply via email to