Re: [racket] DSLs and complexity

John Gateley Fri, 21 Jun 2013 20:56:28 -0700

Matthias, all:

Thanks very much for the considered replies. As a Racket newbie,
this gives me insight into how Racket is used to solve complex
problems.


Complexity exists, and require complex solutions. But often I
am faced (or even have designed) complex solutions that either
did not solve the problem, or solved it at a cost that was too
high.

I don't think the article is vacuously true - it is not against
complexity, it is against complexity that costs too much.
My personal experiences with DSLs has been in situations
where they were overused, or used with poor tools (and
again I hasten to add, not with Racket). I've worked with
too many people (including myself) that fall in love with
their own complex designs.

Thanks

John

On 6/21/13 1:35 PM, Matthias Felleisen wrote:

On Jun 21, 2013, at 8:26 AM, John Gateley wrote:
Subject for discussion:

http://firstround.com/article/The-one-cost-engineers-and-product-managers-dont-consider#

Interesting sentence in the middle:
Consider DSLs, abstractions and the attraction to being the one tobuild a framework that gets leveraged for years.
I think Racket is a different target: education vs. engineering (isthis true?). As a softwareengineer, I really agree with the article. Complexity is almostalways a terrible thing,whether it is a DSL, a complex implementation of a simple interface,or just the
one additional thing requested by product management that didn't fit.
For Racket: are DSLs a source of complexity? Or would you argue thatthey reduce the
complexity normally introduced with DSLs?
John,
this article's claim concerning abstractions and DLS is vacuously trueso it's also devoid of any information. I grant themaintenance-construction cost ratio; I teach to this slogan --starting with How to Design Programs through HtD Components and HtDSystems. It is the guiding principle.
If we wanted to turn this person's essay into a well-founded statementthat helps engineers, we would first have to clarify what complexityis or what simplicity is. No, "I know it when I see" it won't workhere. To clarify, I am sure that many people will say that C is asimpler language than Racket. If we follow the article'srecommendation then, we should use C. But as you know, C lacks safetyand memory safety and these gaps seriously impeded softwaredevelopment and maintenance. The lack of safety means that you neverknow whether the output of a C program is serious or whether it's somerandom bits from some place in memory interpreted as, say, an int. Thelack of memory safety in particular destroys modularity. Every dyn memhanded over from one component to another must be tracked andaccounted for.
Not every language that is more complex than C will reduce the cost ofsoftware construction and maintenance. To wit, C++ started out as amore complex variant of C and to this day it is 'sold' that way eventhough it moved away from its roots over the past 10 years. Itscomplexities introduce seriously deeper safety problems, whichmeasurably impact software construction and maintenance problems. IBMbelieves that this cost is a factor of 3x to 5x when compared to Java,another language that is definitely more complex than C. The SanFrancisco project under Kathy Bohrer, a Rice grad from around yourtime, ran the project in C++, switched to Java, and convinced a lot ofpeople at IBM to measure this cost. A few years later the companyswitched all software to Java for these reasons. This is not to saythat Java is good; but it does say that they actually measured cost,compared, and went from simple and complex languages to other complexlanguages. If I were a senior software dev manager at a company, Iwould pay attention to someone who measures and compares instead ofsomeone who writes content-free polemics that actually sound correct.
The toggles examples from the article is apt here. The lack of safetyin C means that there is no isolation and every line in a system maypotentially affect the behavior of every other line -- just like thetoggles/switches mentioned. But now imagine, the guy had first built abox around the first two switches so that only their relevant behavioris visible -- say two states -- and then added a third one. In thatcase, the interactions would be fine.
LESSON 1: simplicity by itself is not an advantage in softwaredevelopment.
LESSON 2: complexity comes in many flavors, some good, some bad.
Now let's move on to DSLs. A DSL, like any abstraction, helps youreduce cost if it is well designed and meets your needs. If you don'thave a need in existing code for an abstraction, don't build it. Ifyou do have a need,
 -- understand the abstraction mechanisms of your language
-- study the concrete cases of repetition, extensive verbiage to saythings-- use the abstraction mechanisms of your language to create anabstraction that removes your repetitions, extensive verbiage.
If you work in Java, you don't have good abstraction mechanisms toeliminate domain-specific verbiage from your systems. My recentreading experience with three books on industrial DSL building toolsfirmly convinced me that current practice can easily flip intocounter-productive architecture acrobatics. If you build these DSLs,you may make the system more costly. If you work in Racket, you havegreat tools for building internal DSLs and you can smoothly integratemodules written in different DSLs. As I said, this thoughts apply toany abstraction mechanism, internal DSLs are just the most powerfulform of abstraction.
LESSON 3: if you have bad tools, building an abstraction may increasethe cost of building a system
LESSON 4: with good tools, you're likely to reduce the cost, butill-trained programmers can and do mess up
Let me finally address internal complexity vs external complexity.When you construct a language like Java, you are actually building anextremely complex system. But, the Java community succeeded beyondimagination and possibly beyond justification with making theirlanguage appear well-layered, well-structured and simple. Guy Steelehas stated this as "you have to make things sufficiently complex tomake them appear simple" and Matthew has stated similar experienceswith the design of Racket again and again. The key is that very fewJava programmers experience the complexity that the Java box hides;most have to cope with the complexities of the language. Here is anexample that is more straightforward. AMPL is an external DSL forwriting down mathematical programs (linear, integer, binary, graphs,networks). If you are an applied math person, the only differencebetween AMPL programs and paper and pencil is that the former usesASCII. (Perhaps they use unicode these days.) It is definitely at theright level and reduces the huge cost of maintaining mathematicalprograms in, say, Fortran, a vastly simpler language than AMPL -- asfar as the internals are concerned. AMPL is after all a complex PL,with niffty parsers and abstraction capabilities for plugging in allkinds of solvers. But it neatly separates the solver from the problemstatement, for example, which while internally complex, makes itexternally simple.
LESSON 5: Do not confuse external and internal complexity. A properseparation may vastly decrease the cost of creating programs.
Very last thought on 'complexity.' All of the above assumes that thetime of programmers is more costly than what it takes to run thecomputation. I know of cases where this is not true. In that case, youmight be better off sacrificing programmers on the altar of machinetime. The cases I know are extremely rare and involve expensiveprogrammers who understand a lot of continuous mathematics and whowork with computers that are extremely expensive to use.
;; ---
As for Racket: yes, it is good at DSLs because we had to pay attentionto building languages for our original educational goals. But for thelast 10 years, the focus has widened to include a lot of industrialand quasi-industrial uses. It is no longer the TI Scheme of yore.
Your milage may vary. -- Matthias

____________________
  Racket Users list:
  http://lists.racket-lang.org/users

Re: [racket] DSLs and complexity

Reply via email to