On 04/04/2018 06:43 AM, Peter Bergner wrote: > On 4/4/18 4:06 AM, Tamar Christina wrote: >> Now that I know how the loads are done, I have a patch should be both >> correct and generate better code in most cases. >> It just calculates bitsize inside the loop and does the copying in the >> largest mode possible that's equal or less than the bits >> That are to be copied. This avoids the issue with reading too much, honors >> padding and alignment and still generates better code >> In most cases. >> >> I'm running the regression tests and should have a final version soon. > > If you give me your patch when it's ready, I'll do bootstrap and regression > testing on powerpc*-linux and verify it fixes the issues we hit. Similarly, I've got a jenkins instance here were I can get a bootstrap and regression test on the usual targets like aarch64, armv7, i686, powerpc (32, 64, 64le), s390x, sparc64, x86_64. But more interestingly it'll also do a bootstrap test on alpha, hppa, m68k, sh4 and other oddballs like aarch64-be.
A patch for the tip of the trunk is all I need. It doesn't run the testsuite, but the ability to bootstrap on the lesser used targets gives a level of code generator validation that is helpful. Takes about 24hrs to cycle through everything... jeff