On May 05 2011, at 16:46, David Gilbert was caught saying:
> On 5 May 2011 16:08, Måns Rullgård <m...@mansr.com> wrote:
> > David Gilbert <david.gilb...@linaro.org> writes:
> >> Not quite:
> >>   a) Neon memcpy/memset is worse on A9 than non-neon versions (better
> >> on A8 typically)
> >
> > That is not my experience at all.  On the contrary, I've seen memcpy
> > throughput on A9 roughly double with use of NEON for large copies.
> > For small copies, plain ARM is might be faster since the overhead of
> > preparing for a properly aligned NEON loop is avoided.
> >
> > What do you base your claims on?
> 
> My tests here:
> https://wiki.linaro.org/WorkingGroups/ToolChain/Benchmarks/InitialMemcpy
> 
> at the bottom of the page are sets of graphs for A9 (left) and A8 (right);
> on A9 the Neon memcpy's (red and green) top out much lower than their non-neon
> best equivalents (black and cyan).  I've seen different results for
> very non-aligned
> copies, where the vld/vst on Neon work very well.

Looking at the top part of the page, it looks like when doing large size
copies, NEON has an obvious advantage; however, I'm not sure how often
we do copies of that magnitude in the kernel (I would hope rarely) but
I don't know that we have numbers tracking average copy sizes for 
different workloads. I don't think going for a one-size-fits all
approach is the ideal and instead we should provide both build
and and runtime configurability (something similar to the RAID
code's boot-up performance tests) to allow for selection of the
appropriate memcpy implementation.

> > I don't see the connection between Thumb2 and memcpy performance.
> > Thumb2 can do anything 32-bit ARM can.
> 
> There are the purists who says write everything in Thumb2 now; however
> there is an
> interesting question of which is faster, and IMHO the ARM code is
> likely to be a bit
> faster in most cases.

Do we have numbers for this? :)

~Deepak

-- 
"People think all we need to fix our predicament is a free source of
energy, but I think we need to change out behaviour. More energy would
just deplete the Earth's lifeblood faster." - Janine Benyius

_______________________________________________
linaro-dev mailing list
linaro-dev@lists.linaro.org
http://lists.linaro.org/mailman/listinfo/linaro-dev

Reply via email to