Florian Klaempfl wrote:
Daniël Mantione schrieb:

To get a large speedup, I think you should instead of making pairs of doubles, do the pixels in parallel. I.e. in this benchmark, a row is 3000 pixels wide, so, make an array of 3000 doubles, and do the operation with arrays. With proper compiler optimization, it should be possible to achieve speeds close to 2 flops a clock cycle.


This is planned to be done, but currently it only spites an ie out :)

I added the spectralnorm benchmark to svn:
http://svn.freepascal.org/svn/fpc/trunk/tests/bench/shootout/src/spectralnorm.pp

It can serve as an example where large vectors can be used. The shootout uses n=2500.

Vincent
_______________________________________________
fpc-devel maillist  -  fpc-devel@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-devel

Reply via email to