Hello All: Memset and Memcpy calls are extensively used in many benchmarks. Inlining or expansion the memcpy and memset calls improves the performance of many performance Benchmark. I have implemented the expansion of strcmp to the optimizaed sequence of instruction In open64 compiler for AMD x86 target.
Can I suggest and propose to expand the memset and memcpy calls to the sequence Of instruction as many targets like ARM are moving implementation of memcpy and Memset in assembly instead of C. This makes it easier to expand the memcpy and Memset call in gcc. To implement this in GCC we need to expand similarly to the implementation as builtins. Let me know what do you think. Thanks & Regards Ajit