On Wed, 2016-01-06 at 00:35 +0100, Hannes Frederic Sowa wrote: > > Tom, did you have a look if it makes sense to add a second carry > addition train with the adcx instruction, which does not signal carry > via the carry flag but with the overflow flag? This instruction should > not have any dependencies with the adc instructions and could help the > CPU to parallelize the code even more (increased instructions per cycle).
I guess adcx would (possibly) bring improvements for large areas, but for this case the bottleneck is to bring data from memory. Note this topic was discussed 2 years ago and no conclusive action was taken. https://www.mail-archive.com/linux-kernel@vger.kernel.org/msg529610.html -- To unsubscribe from this list: send the line "unsubscribe netdev" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html