Re: [Bug-apl] About the reduction of clone() calls

Juergen Sauermann Mon, 28 Apr 2014 07:25:33 -0700

Hi,

generally speaking unnecessary clone() of values should of course beavoided.

In GNU APL 1.0 and 1.1 there was a flag-based system of value ownershipwhere the last ownerwould delete the value when giving up its interest in the value. Thissystem began like thetmp flag, but then caused stale values on one hand and segfaults forvalues released too early on the

other hand all over the place.

I then changed to Value_P which is pretty much a shared pointer withreference counting. InitiallyI tried the standard shared_ptr<> class of C++ but that caused somecompiler/portability issues.

That change has brought a lot of stability into GNU APL,

As you have noticed yourself, going through the code and changing is allover the place is bad anderror-prone, and it would kind of undo the change from the flag-systemto Value_P (which was also

a huge effort as you may figure from SVN).

So the only solution remaining is rewriting the Value (or. more likely,the Value_P) class.Now, the Value class knows how many owners it has (the share_count belowalready exists, it is called owner_count).But it does not know who they are. If someone does X[Y]←Z then symbol Xwould need to clone its current valuewhen Value::index(Z) os performed, but Value::index(Z) does not knowanything about Symbol X and other owners

of the value,

I will send you the 170+ GNU APL testcases so that you can quickly checkif some change you make is breaking

something else.

Having several Values pointing to the same ravel is also a bit obscurebecause the Cells of the ravel need to bereleased (sub values and complex numbers need to be deleted) which isalmost impossible if you have nested

ravels or ravels of of different sizes.

I also doubt that you can gain 3-4 orders of magnitude because a valueis normally only cloned very few times.That does not mean that you can't prove otherwise, I can speed up +/⍳Nby 6 orders of magnitude but that does

not prove that this is a valuable optimization,

/// Jürgen


On 04/28/2014 02:57 PM, Elias Mårtenson wrote:

Hello Jürgen,
I don't know if you have given this issue any thought, but it hascertainly occupied my mind for the last few days.
It's clear that heavy array processing does far too much cloning thanshould be necessary. Especially in cases where you have lots ofoperations on smaller arrays (as opposed to few operations on largearrays).
This is because the code always performs a clone prior to adestructive operation because at that point it can never be sure thatthe array will not be used again. The (very few) exceptions to this ishandled by the "temp" system, which is really only used in the ravelfunction.
The way I see it, there are two approaches to solving this: The firstone being to go through the code with a fine-toothed comb andimplement the temp system everywhere. This is lots of work and iserror-prone.
The other solution would be to re-engineer the Value class so that itcan share underlying data with other Value instances. The Value classwould then get a counter called share_count or something, indicatinghow many other references there are to the same data. When clone() iscalled, it will simply create a new Value instance, share theunderlying data and increase share_count. Any destructive operationswould only copy the content if it's not already shared.
/Now, Value_P already implements some of these semantics. Could it bereused for this?/
/
/
While appealing, this copy-on-write solution might not be perfectthough. The assumption is that the caller would have decremented theshare count (effectively "releasing" the Value) before the calledfunction tries to modify it. Now, this "release" is similar to the the"temp" marking. Could there be a better way?
I've been experimenting with this on and off, and my interim results(as I've mentioned before) show that the potential performance boostsare massive. We're talking about 3-4 orders of magnitude. Definitelyworth quite a lot of effort, IMHO.
I'm likely going to continue pursuing this, because I personally feelfrustrated when I do something and it's not instantaneous, knowingthat I'm waiting for the interpreter to perform unnecessaryoperations. :-)
What are your thoughts on this? What would be the best approach?

Regards,
Elias

Re: [Bug-apl] About the reduction of clone() calls

Reply via email to