Node.js port, current thoughts

Sam Ruby Wed, 06 May 2020 15:59:24 -0700

TL;DR: it would be feasible to port the entire Whimsy code base over toNode.js, get it up an running on whimsy-vm6 hosted on Ubuntu 20.04(Focal Fossa). There would be a number of advantages to doing so.

The difference wouldn't merely be one of language syntax. The resultwould likely be closer to an actor model than an object oriented model.


https://en.wikipedia.org/wiki/Actor_model
https://en.wikipedia.org/wiki/Object-oriented_modeling

A number of quasi-independent observations lead up to this conclusion.Warning, the third part is rather lengthy.


- - - -

I wrote a test for posting an item to the agenda:

https://github.com/rubys/whimsy-board-agenda-nodejs/blob/d2b46afa81ccd980c416109f6c9c4fea198b508f/src/server/__tests__/post.js#L10

It looks remarkably similar to the analogous test for the Rubyimplementation:


https://github.com/apache/whimsy/blob/1316a898d5e8c91e8a33d89565d12efc4842dd56/www/board/agenda/spec/actions_spec.rb#L20

With this test in place, I now feel that I have one of everything neededto make a complete application working. The question is whether thisversion of the codebase will attract a sustainable community.

My current thoughts are that if I were to start from scratch, I woulddefinitely do so in Node.js, but at the moment, that's not what I amfacing. The Ruby base is more mature, and the Node.js base is juststarting.


- - - - -

It appears that svn added a new command line parameter--password-from-stdin. I added support for this parameter to thenode.js board agenda tool yesterday:


https://github.com/rubys/whimsy-board-agenda-nodejs/blob/d2b46afa81ccd980c416109f6c9c4fea198b508f/src/server/svn.js#L54

If you are running on a Mac, you may or may not have a version of svnthat supports it. brew upgrade svn will get it for you.

The version of svn in the repositories for Ubuntu 16.04 and 18.04 don'tsupport this parameter. The version of svn in the Ubuntu 20.04repository does.

If we are writing new code, it is relatively straightforward to handlethis correctly. If we want to update all of the existing code, at thispoint that represents a technical debt.


- - - - -

While Ruby and JavaScript have very different surface syntaxes, theysuperficially have a lot of similarities in their runtime models. Thereare some subtle differences, which I will over simplify as follows:

Ruby tends to encourage a more object oriented approach to solvingproblems. JavaScript tends to encourage a more event driven approach.


The current Ruby model can be seen here:

https://whimsy.apache.org/docs/api/

We have some obvious domain model object classes: Person, Committee,CCLAFiles. There are some cross-cutting concerns shared by each, andthose tend to be broken out into classes: LDAP, Git, SVN. Along theway, the reads and writes for any given data type tend to be clusteredtogether.

I continued with this approach on the client, where I had an Agendaclass which contained a list of items, each of which were objects thatresponded to method calls that would indicated whether that item wasread for review, flagged, or whatever.

There are a number of drawbacks to this approach. As an example, if youdon't take care, performance suffers. A number of operations take awhile because a large number of LDAP requests would be required.Caching can help, but then there are cache invalidation issues to worryabout. I built a clever solution using weak references, but clever isgenerally a sign that there is a flaw in this approach.

Going out to svn every time there is a request would be a problem, butthat can be mitigated by keeping a local working copy up to date withcron jobs. This, too, is a form of caching.

We can improve on that with pubsub, and the infrastructure team isworking on that.

But if you ignore the caches, the flow for displaying an agenda item onthe client is very linear: you start with getting a file from svn, youissue a bunch of LDAP requests, you get another file from svn, issuemore LDAP requests, package up a JSON response that is sent to theclient which pulls it apart, and renders the result in the DOM.


None of that would be feasible without caches.

The JavaScript approach is different. Instead of starting with theobjects and propping up the architecture with caches, you start with thedata (what you previously would call a cache) and build a number ofquasi independent units of work that operate on the data.

When I undertook the port to Node.js of the board agenda tool, it tookwhatever code I needed, and figured that I would find a way to factor itout later into libraries. The code to parse committee-info.txt is aprime example of something that would be useful to many tools.

On the client, I decided to replace my custom models, routing, and eventmodels with the ones that are favored by the React community.


Stepping back, I see that the code tends to be considerably less linear.

From the client side, pushing a button would tend to do something, andoften would need to have explicit code in place to cause data in anothercomponent to rerender.

Now pushing a button will generally do one of three things: surface amodal dialog, change state within the dialog, or dispatch an action tothe Redux store (possibly based on data retrieved from the server).That's it. In the first two cases, everything is local. In the lattercase, it it somebody elses problem to do something with the data.

A similar thing happened on the server. I have cache files thatrepresent the parsed version of the agenda, committee-info.txt, memberdata from LDAP, and the like. If cached files change, the client isnotified, and it has the option to load that information in the store.

The infrastructure team has already enabled pubsub for LDAP data, and isworking on pubsub for private svn repositories. The role of thesefunctions will be be to update the source of truth on the the servers(i.e., the caches).

Instead of having shared libraries for parsing committee-info.txt, therecan be a canonical JSON file for this data, and multiple tools can havefile watchers that trigger when this file changes.

In fact, we already have these types of files; you can see them inhttps://whimsy.apache.org/public/. We can create some more and put themin a private directory (and perhaps even make them available toauthenticated requests). But we mostly make this data available forother tools, we don't use this data much ourselves.

I'm finding that I'm liking the result more and more. Instead oflooking at an object and a description of what a method should return,and trying to figure out what's broken when things don't work asexpected, you can directly inspect the data to see if the problem is inthe production of the data or in the consumption.

In other words, when something goes wrong, the first thing you do isexamine the Redux store on the client (I enabled this by pressing "=" inthe board agenda tool) or go to https://whimsy.apache.org/public/ orequivalent.

And when we are done, we can not only be a pubsub consumer, but perhapswe can look to be a pubsub source (likely by hosting an instance of thethe pypubsub package). Different tools running on different machines,perhaps written in different languages, can all collaborate in this manner.


- Sam Ruby

Node.js port, current thoughts

Reply via email to