[Pharo-dev] Re: Array sum. is very slow

Nicolas Anquetil Sun, 09 Jan 2022 03:06:27 -0800


Definitly not easy to do benchmarking
I got these strange results:


n := 10000000.
floatArray := Array new: n. 

Time millisecondsToRun: [ floatArray doWithIndex: [:each :idx |
floatArray at: idx put: Random new ] ].
"-> 2871"

Time millisecondsToRun: [ floatArray doWithIndex: [:each :idx |
floatArray at: idx put: i ] ].
"-> 86"

Time millisecondsToRun: [1 to: n do: [:i | Random new ]].
"-> 829"

so
- assigning 'Random new' to 1M array elements takes 2.8 seconds.
- assigning a value to 1M array elements takes 0.08 seconds.
- computing 'Random new' 1M times takes 0.8 seconds


I wonder where the extra 2 seconds come from?
some optimization in the background?

I did the 3 of them several times in different order and the results
are similar.

nicolas

On Fri, 2022-01-07 at 15:36 +0000, Benoit St-Jean via Pharo-dev wrote:
> Can you come up with a simple "base case" so we can find the
> bottleneck/problem?
> 
> I'm not sure about what you're trying to do.
> 
> What do you get if you try this in a workspace (adjust the value of n
> to what you want, I tested it with 10 million items).
> 
> Let's get this one step at a time!
> 
> 
> 
> |  floatArray  n  rng t1 t2 t3 r1 r2 r3 |
> 
> n := 10000000.
> 
> rng := Random new.
> 
> floatArray := Array new: n. 
> floatArray doWithIndex: [:each :idx | floatArray at: idx put: rng
> next].
> 
> t1 := Time millisecondsToRun: [r1 := floatArray sum].
> t2 := Time millisecondsToRun: [| total |
>       
>                                                               total
> := 0.
>                                                               floatA
> rray do: [:each | total := total + each ].
>                                                               r2 :=
> total].
>                                                       
> t3 := Time millisecondsToRun: [r3 := floatArray inject: 0 into:  [:
> total :each | total + each ]].
> 
> Transcript cr.
> Transcript cr; show: 'Test with ', n printString, ' elements'.
> Transcript cr;show: 'Original #sum -> Time: ', t1 printString, '
> milliseconds, Total: ', r1 printString.
> Transcript cr;show: 'Naive #sum -> Time: ', t2 printString, '
> milliseconds, Total: ', r2 printString.  
> Transcript cr;show: 'Inject #sum -> Time: ', t3 printString, '
> milliseconds, Total: ', r3 printString.  
> 
> --------------------------
> 
> Here are the results I get on Squeak 5.3
> 
> Test with 10000000 elements
> Original #sum -> Time: 143 milliseconds, Total: 4.999271889099622e6
> Naive #sum -> Time: 115 milliseconds, Total: 4.999271889099622e6
> Inject #sum -> Time: 102 milliseconds, Total: 4.999271889099622e6
> 
> 
> 
> ----------------- 
> Benoît St-Jean 
> Yahoo! Messenger: bstjean 
> Twitter: @BenLeChialeux 
> Pinterest: benoitstjean 
> Instagram: Chef_Benito
> IRC: lamneth 
> GitHub: bstjean
> Blogue: endormitoire.wordpress.com 
> "A standpoint is an intellectual horizon of radius zero".  (A.
> Einstein)
> 
> 
>  On Thursday, January 6, 2022, 03:38:22 p.m. EST, Jimmie Houchin
> <[email protected]> wrote: 
> 
> 
> I have written a micro benchmark which stresses a language in areas 
> which are crucial to my application.
> 
> I have written this micro benchmark in Pharo, Crystal, Nim, Python, 
> PicoLisp, C, C++, Java and Julia.
> 
> On my i7 laptop Julia completes it in about 1 minute and 15 seconds, 
> amazing magic they have done.
> 
> Pharo takes over 2 hours. :(
> 
> In my benchmarks if I comment out the sum and average of the array. It 
> completes in 3.5 seconds.
> And when I sum the array it gives the correct results. So I can verify 
> its validity.
> 
> over the array and do calculations on each value of the array and
> update 
> the array and sum and average at each value simple to stress array 
> access and sum and average.
> 
> 28800 is simply derived from time series one minute values for 5 days,
> 4 
> weeks.
> 
> randarray := Array new: 28800.
> 
> 1 to: randarray size do: [ :i | randarray at: i put: Number random ].
> 
> here." randarray sum. randarray average ]] timeToRun.
> 
> randarrayttr. "0:00:00:36.135"
> 
> 
> I do 2 loops with 100 iterations each.
> 
> randarrayttr * 200. "0:02:00:27"
> 
> 
> I learned early on in this adventure when dealing with compiled 
> languages that if you don’t do a lot, the test may not last long enough
> to give any times.
> 
> Pharo is my preference. But this is an awful big gap in performance. 
> When doing backtesting this is huge. Does my backtest take minutes, 
> hours or days?
> 
> I am not a computer scientist nor expert in Pharo or Smalltalk. So I do
> not know if there is anything which can improve this.
> 
> 
> However I have played around with several experiments of my #sum:
> method.
> 
> This implementation reduces the time on the above randarray in half.
> 
> sum: col
> | sum |
> sum := 0.
> 1 to: col size do: [ :i |
>       sum := sum + (col at: i) ].
> ^ sum
> 
> randarrayttr2 := [ 1 to: randarray size do: [ :i | "other calculations 
> here."
>      ltsa sum: randarray. ltsa sum: randarray ]] timeToRun.
> randarrayttr2. "0:00:00:18.563"
> 
> And this one reduces it a little more.
> 
> sum10: col
> | sum |
> sum := 0.
> 1 to: ((col size quo: 10) * 10) by: 10 do: [ :i |
>       sum := sum + (col at: i) + (col at: (i + 1)) + (col at: (i + 2))
> + 
> (col at: (i + 3)) + (col at: (i + 4))
>           + (col at: (i + 5)) + (col at: (i + 6)) + (col at: (i + 7)) +
> (col at: (i + 8)) + (col at: (i + 9))].
> ((col size quo: 10) * 10 + 1) to: col size do: [ :i |
>       sum := sum + (col at: i)].
> ^ sum
> 
> randarrayttr3 := [ 1 to: randarray size do: [ :i | "other calculations 
> here."
>      ltsa sum10: randarray. ltsa sum10: randarray ]] timeToRun.
> randarrayttr3. "0:00:00:14.592"
> 
> It closes the gap with plain Python3 no numpy. But that is a pretty low
> standard.
> 
> Any ideas, thoughts, wisdom, directions to pursue.
> 
> Thanks
> 
> Jimmie
>

[Pharo-dev] Re: Array sum. is very slow

Reply via email to