Re: [go-nuts] Re: An efficient runtime expression evaluation

Michael Jones Mon, 18 Jul 2016 03:26:52 -0700

Anything much faster than this needs vector operations in the interpreter so 
the “get to the OP function” overhead is once per time series rather than once 
per element in the series.

From: <golang-nuts@googlegroups.com> on behalf of Egon <egonel...@gmail.com>
Date: Monday, July 18, 2016 at 1:32 AM
To: golang-nuts <golang-nuts@googlegroups.com>
Cc: <ondrej.ko...@gmail.com>
Subject: [go-nuts] Re: An efficient runtime expression evaluation

On Monday, 18 July 2016 11:13:08 UTC+3, Egon wrote:

On Monday, 18 July 2016 10:30:14 UTC+3, Egon wrote:

On Monday, 18 July 2016 03:11:29 UTC+3, ondrej...@gmail.com wrote:

Cheers, I tried replicating my endeavours 
(https://play.golang.org/p/Qxoo2ASac6), sorry if it's still too verbose. It's 
essentially rewriting the inbuilt ast.Node into a simpler nested struct and 
then walking it.

In testing the performance, I started adding algebraic expressions, which make 
my walking more expensive, but don't change the 'native' expression evaluation 
(I guess due to constant folding).

As to your suggestion three - I do the variable lookup in the parsing stage, 
but I still need to retain the pointer, not the value itself, because I'm 
accessing an element of that given variable (time series), and this element 
(time period) changes at runtime.

https://play.golang.org/p/dd4hTpMKrp

Of course you can additionally add constant folding and similar... Additionally 
instead of working on a single float at a time, make each variable an array of 
8 floats, that are computed in parallel.

Just realized that it's trivial to write basic constant folding: 
https://play.golang.org/p/iqWX5_Mweb

Although it probably will be easier to maintain and extend as a separate pass: 
https://play.golang.org/p/xcz5mXoaOG

This brings the result to:

interpreter: 17.001ms

native: 7.0004ms

Which is approximately the best I would expect from an interpreter without JIT 
(and not computing multiple time-points at a time).

+ Egon

One performance gain I can think of is to implement some pruning through the 
abovementioned constant folding and other optimisations, but I'd rather leave 
that as the last resort. Another thing that comes to mind is that I could 
return nested closures in some way - meaning that '1+3*x' would be, in go-like 
pseudocode, add(func() { return one }, func mul(func() { return three}, func() 
{return model[x]} )), where the one/tree are values passed to the closure when 
parsing the equation; but that's just now off the top of my head.

I attached a pprof result in the header.

Thanks again.

On Friday, 8 July 2016 15:46:32 UTC+1, Egon wrote:

On Friday, 8 July 2016 16:25:40 UTC+3, Ondrej wrote:

Hi all,

I have a model with variables, let's call them a, b, c, ..., z. These are 
numerical values (time series loaded from a database) and I let the user 
specify their relationships in a JSON, say 'z = 12; x = a + 2/3 + 3*c; y = 
log(12*f) + exp(g)' etc. The syntax is trivial - it's basically just algebraic 
relationships + a few functions (log, log2, log10, exp, trigonometrics, ...; 
all 1:1 mappings to their math package equivalents).

Tip: include a working piece of code that you want to make faster, it makes it 
easier for people to see the problems and common issues.

Now, I get these relationships in a JSON and I parse them using go/parser. Then 
I walk the tree once and process it a bit - replacing keywords by pointers to 
my variable stores, replacing all the log/exp/sin with function pointers, 
leaving literals be literals etc. Each node is then a struct with a type and 
the actual contents (sadly a generic interface, because the value can be almost 
anything). The prep stage is now over.

When actually running the model, I loop through years and within each year I 
solve each variable - I walk the tree and evaluate it where needed. The only 
non-trivial action is when I get to a model variable, I need to do a bit of 
lookup (it's a time series, so I need to look up the correct time period and 
other bits). Otherwise it's just literals, operators and function calls, all of 
which is fairly straightforward.

This is all well and good. One of the issues is that it's rather slow. I 
thought it would be the recursive nature (and interface assertions), but 
converting all this into a shunting yard system didn't improve the performance 
dramatically. I've profiled the thing and removed a few hotspots, my question 
is not about profiling. I'm after a bit more general advice on how to handle 
these runtime evaluations and if there are better ways of doing so. Essentially 
some sort of a JIT (but Go does not have runtime assembly, right?), or maybe 
convert each expression into a closure or maybe a whole different algorithm 
or...?

Reduce the amount of code and indirection that you need to do, few basic ideas:

1. implement a VM https://play.golang.org/p/dlmZ2lGPY7

2. operate on vectors of variables instead of single values 
https://play.golang.org/p/25MIjIXs0D

3. try to do the lookup of all necessary variables before starting to compute 
with them; if possible

Obviously pprof is your friend. (https://blog.golang.org/profiling-go-programs)

+ Egon

-- 
You received this message because you are subscribed to the Google Groups 
"golang-nuts" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to golang-nuts+unsubscr...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

-- 
You received this message because you are subscribed to the Google Groups 
"golang-nuts" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to golang-nuts+unsubscr...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Re: [go-nuts] Re: An efficient runtime expression evaluation

Reply via email to