On 6 May 2011 14:13, Julian Brown <jul...@codesourcery.com> wrote: > Hi, > > This is the second of two patches to add unaligned-access support to > the ARM backend. It builds on the first patch to provide support for > unaligned accesses when expanding block moves (i.e. for builtin memcpy > operations). It makes some effort to use load/store multiple > instructions where appropriate (when accessing sufficiently-aligned > source or destination addresses), and also makes some effort to > generate fast code (for -O1/2/3) or small code (for -Os), though some > of the heuristics may need tweaking still
Sorry it's taken me a while to get around to this one. Do you know what difference this makes to performance on some standard benchmarks on let's say an A9 and an M4 as I see that this gets triggered only when we have less than 64 bytes to copy. ? Please add a few testcases from the examples that you've shown here to be sure that ldm's are being generated in the right cases. cheers Ramana