Re: [fpc-devel] RFC: Support for new type "tuple" v0.1

Hans-Peter Diettrich Sat, 26 Jan 2013 23:41:35 -0800

Sven Barth schrieb:

* Description
What are tuples? Tuples are an accumulation of values of different orsame type where the order matters. Sounds familiar? They are in thisregard similar to records, but it's only the order of an element thatmatters, not its name. So what does make them special? Unlike recordsyou can only query or set all the elements of a tuple at once. Theybasically behave like multiple assignments. In effect they allow you toreturn e.g. a multivalued result value without resorting to the namingof record fields (you'll still need to declare a tuple type) or the needfor out parameters. This in turn allows you to use them for example in"for-in" loops.

The lack of element names results in bloated code and runtime overhead.See below.

* Declaration:

[...]

The usage of constructors and destructors also allows a realisation ofgroup assignment:
=== code begin ===

var
  a, b, e: Integer;
  c, d: String;
begin
  a := 42;
  c := 'Hello World';
  (b, d) := (a, c);
  a := 21;
  b := 84;

(a, b) := (b, a); // the compiler needs to ensure the correct usage oftemps here!


What will happen here?

At compile time a tuple type (integer; integer) has to be defined, andan instance must be allocated for it. Initialization and finalizationinformation/code must be added if required.

At runtime the arguments are copied into that tuple instance, thencopied into the target variables. All "copies" may be subject to typeconversions and reference counting.

Consider memory usage and runtime when tuples are nested, or containlarge data structures (records, static arrays...).

  a := 42;
  (a, e) := (a * 2, a); // (a, e) should be (84, 42), not (84, 84)


Such code tends to become cryptic with larger tuples.
High level (source code) debugging will be impossible :-(


[...]

* Possible extensions

Note: This section is not completely thought through!
An possible extension would be to allow the assignment of tuples torecords and/or arrays (and vice versa). [...]

Without references to distinct tuple elements the coder has to providelocal variables for *all* tuple elements, then decompose the *entire*tuple, before access to a single element will be possible. This may beaccomplished with less source code when a tuple can be assigned to arecord variable, but then it would be simpler to use records *instead*of tuples.

When a record type is modified, during development, all *compatible*tuples and tuple types must be updated accordingly.

* Possible uses

- use for group assignments which can make the code more readable

... or unreadable (see above).

- use for multivalues return values which can make the code morereadable (instead of using records or out parameters)

This IMO makes sense only when such tuples are passed along many times,before references to their elements occur. Otherwise full tupledecomposition is required when e.g. only a succ/fail indicator in theresult tuple has to be examined.

- use as result value for iterators (this way e.g. key and data ofcontainers can be queried)

This reminds me of SQL "SELECT [fieldlist]", where *specified* recordfields are copied. But I wonder how confusion can be eliminated in theorder of the tuple elements. Will (k,v) or (v,k) be the right order forkey and value? What are the proper types of key and value?

* Implementation notes
Tuples need to pay attention to managed types (strings, interfaces,etc.). Thus an Init RTTI will be required (which needs to be handled byfpc_initalize/fpc_finalize accordingly).It might be worthwhile to add a new node type for tupleconstructors/deconstructors (one node type should be sufficient) andhandle them in assignment nodes accordingly.


I'd reuse the record type node for that purpose.

* Open issues
Should anonymous tuples (together with tuple constructors) be allowed toparticipate in operator search as well? This would on the one hand allowthe following code, but on the other hand make operator lookup rulesless clear (because of assignment compatibility rules):
=== code begin ===

type
  TDoubleVector = tuple of (Double, Double, Double, Double);

operator + (aLeft, aRight: TDoubleVector): TDoubleVector;
// implement by e.g. using SSE instructions

// somewhere else
begin
  (d1, d2, d3, d4) := (d1, d2, d3, d4) + (1.0, 2.0, 3.0, 4.0);
end;

=== code end ===

SSE should be used with array types, where all elements *definitely*have the same type. Then one "+" operator can be implemented for openarrays of any size, what looks quite impossible for tuples.



Conclusion:

IMO tuples are *abstract* templates (mathematical notation) for*concrete* (record...) implementations. I see no need or purpose in theintroduction of such an abstract type into any concrete language, exceptwhen that languages lacks an already existing record (or equivalent) type.

Nonetheless the discussion revealed some possible improvements of recordhandling, like default constructors/initializers for records, outside"const" clauses.


DoDi

_______________________________________________
fpc-devel maillist  -  [email protected]
http://lists.freepascal.org/mailman/listinfo/fpc-devel

Re: [fpc-devel] RFC: Support for new type "tuple" v0.1

Reply via email to