On 09/16/2013 01:07 AM, Peter Zijlstra wrote:
On Fri, Sep 13, 2013 at 11:18:21PM -0700, Andi Kleen wrote:
tip-bot for Peter Zijlstra <tip...@zytor.com> writes:
+
+ at = (struct pebs_record_nhm *)(unsigned long)ds->pebs_buffer_base;
+ top = (struct pebs_record_nhm *)(unsigned long)ds->pebs_index;
ds->pebs_index = ds->pebs_buffer_base;
+ n = (top - at) / x86_pmu.pebs_record_size;
This adds a full slow division to the PEBS hot path.
Does not seem like a improvement to me.
There already was an implicit division there, and
sizeof(pebs_record_hsw) = 176, can it really optimize that constant
division?
gcc can optimize ANY constant division, if nothing else then by
reciprocal multiplication.
-hpa
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/