Re: [Math] Fluent API, inheritance and immutability

Gilles Thu, 08 Aug 2013 08:17:01 -0700

Hello.

Sorry for the confusion my code sample caused; it made sense in mymind.
:) I was speaking from the perspective that an abstract class is a
skeletal implementation of an interface, created so that implementing
the interface is easier. For the non-linear least squares (NLLS)packageI was thinking something like:https://gist.github.com/anonymous/6184665


Thanks for the effort of writing the example in more details.

To address a few concerns I saw,

Gilles wrote:
In this way, it does not; what I inferred from the partial codeabove was
that there would only be _one_ "create" method. But:
1. All "create" methods must be overridden at the next level of theclass
hierarchy (this is the duplication I was referring to in the first
post).
True, but concrete classes should not be extended.


Why not?

[Anyways, the duplication also occurs in the intermediate (abstract)levels.]

Adding another
abstract class to the hierarchy would mean implementing the create
method form the superclass and delegating to a new abstract create
method that includes the new parameters. The abstract class hierarchy
would have to parallel the interface hierarchy.


With a mutable instance, you avoid this; that's the point.

2. When a parameter is added somewhere in the hierarchy, all theclasses
below must be touched too.


True, but since abstract classes are just skeletal implementations of
interfaces you can't add any methods without breaking compatibility
anyway.

Adding a (concrete) method in an abstract class will not breakcompatibility.

(You would have to add the same method to the public interface
too.)


There you break compatibility; that's why we removed a lot of the
interfaces in CM, because the API is not stable, and abstract classes
allow non-breaking changes.

This does make it important to decide on a well written and
complete API before releasing it.


When the scope of the software is well circumscribed, that would be
possible. With the whole of [Math]ematics, much less so. :-}
And state-of-the-art in Java is a moving target, aimed at by changing
CM contributors with differing needs and tastes; this adds to the
unstable mix.

And, we must note, that the duplication still does not ensure "real"
immutability unless all the passed arguments are themselvesimmutable.
But:
1. We do not have control over them; e.g. in the case of theoptimizersthe "ConvergenceChecker" interface could possibly be implemented byanon-thread-safe class (I gave one example of such a thing a fewweeks
ago when a user wanted to track the optimizer's search path)
True. Thread safety is a tricky beast. I think we agree that the only
way to guarantee thread safety is to only depend on final concrete
classes that are thread safe themselves.

I don't think so. When objects are immutable, thread-safety follows(but

note that the current optimizers in CM were never thread-safe).

But thread-safety can exist even with mutable objects; it's just thatmore

care must be taken to ensure it.

This is directly at odds with
the inversion of control/dependency injection paradigm. I think a
reasonable compromise is to depend on interfaces and make sure allthe
provided implementations are thread safe.


Yes, that a way, but again easier said that done.

A simple sequential user won't
need to care about thread safety. A concurrent user will need to
understand the implications of Java threading to begin with. Accurate

documentation of which interfaces and methods are assumed to bethread

safe goes a long way here.


I don't think I'm wrong if I say that most concurrent bugs are found in
production rather than in contrived test cases.

[That's why I advocated for introducing "real" applications asuse-cases

for CM.]

2. Some users _complained_ (among other things :-) that we shouldnot
force immutability of some input (model function and Jacobian IIRC)
because in some use-cases, it would be unnecessarily costly.


I agree that copying any large matrices or arrays is prohibitively
expensive. For the NLLS package we would be copying a pointer to a
function that can generate a large matrix. I think adding some
documentation that functions should be thread safe if you want to use
them from multiple threads would be sufficient.

I you pass a "pointer" (i.e. a "reference" in Java), all bets are off:theclass is not inherently thread-safe. That's why I suggested to mandatea_deep_ "copy" method (with a stringent contract that should allow acallerto be sure that all objects owned by an instance are disconnected fromany

other objects).

Consequently, if (when creating a new instance) we assign areferencepassed to the fluent method, we cannot warrant thread-safetyanymore;which in turn poses the question of whether this false sense ofsecuritywarrants the increased code duplication and complexity (compare yourcodebelow with the same but without the constructors and "create"methods:
even a toy example is already cut by more than half).
Agreed that thread safety can only be guaranteed with the help oftheuser. The immutable+fluent combination does add an additional layerofindirection. On the other hand it is much simpler to debug andanalyze,
especially in a concurrent environment.


Immutable objects can be shared between threads.

Thread-safety is equally obtained if mutable objects are not sharedbetweenthreads, e.g. keep the optimizer confined in one thread (noimplementation

in CM features concurrency anyways).

Even if the optimizers are not immutable, it will be possible tobenefitfrom efficiency improvement brought by concurrency e.g. by ensuringthat theobjective function is thread-safe, several evaluations could beperformed inparallel. [Moreover the optimization logic is not really concurrent inmost

optimizers. So why have unnecessarily complicated code?]

Then if we really want to aim at thread-safety, I think that theapproachof mandating a "copy()" interface to all participating classes,would bea serious contender to just making all fields "final". Let's alsorecall
that immutability is not a goal but a means (the goal being
thread-safety).
I still think that the three "tools" mentioned in the subject linedo not
play along very well, unfortunately.
I was, and still am, a proponent of immutability but IMO thisdiscussionindicates that it should not be enforced at all cost. In particular,in
small and non-layered objects, it is easy to ensure thread-safety
(through
"final") but the objects that most benefit from a fluent API are big
(with
many configuration combinations), and their primary usage does not
necessarily benefit from transparent instantiation.
Given all these considerations, in the first steps for moving someCM
codes
to fluent APIs, I would aim for simplicity and _less_ code (if justto be
able to make adjustments more quickly).
Then, from "simple" code, we can more easily experiment (andcompare,
with
"diff") the merits and drawbacks of various routes towardsthread-safety.
OK?
Agreed on the goal of thread safety. Is the copy a shallow copy?


No! (cf. above.)

If it
is, then copy is a complete punt of thread-safety to the user,forcingthem to them to determine when and what they need to copy. I thinkthemutable+copy option would make it harder to understand client codeand
understand where copying is necessary.

No; my suggestion, if feasible, is that a user who needs his own,personal,

unshared instance would simply call "copy()":

public void myMethodOne(Optimizer optim) {
  // I don't trust that "optim" can be safe: I make a (deep) copy.
  final Optimizer myOptim = optim.copy();

  // By contract, "myOptim" is assumed to contain unshared fields.
  // I can pass it safely to another thread.
  myMethodTwo(myOptim);
}

private void myMethodTwo(Optimizer optim) {
  // Create a new thread, whatever...
}

But my current point is that this could be developed at a later point,

after the fluent API has been adopted (and more widely tried withinCM),

when someone has shown that e.g. the optimizer must be thread-safe to
achieve some actual use-case.

Ultimately the decision is up to
the maintainers and I think both options under discussion are a big
improvement over the current API. :)

Thanks for the great library.


Thanks for the discussion,
Gilles


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: [Math] Fluent API, inheritance and immutability

Reply via email to