Re: [i386] scalar ops that preserve the high part of a vector

Michael Zolotukhin Thu, 06 Dec 2012 22:50:10 -0800

Hi guys,
Could I ask several questions just to clarify the things up?

1) Does the root problem lay in the fact that even for scalar
additions we perform the addition on the whole vector and only then
drop the higher parts of the vector? I.e. to fix the test from the PR
we need to replace plus on vector mode with plus on scalar mode?

2) Is one of the main requirements having the same pattern for V4SF
and V2DF version?

3) I don't see vec_concat in patterns from your patches, is it
explicitly generated by some x86-expander?

Anyway, I really like the idea of having some uniformity in describing
patterns for scalar instructions, so thank you for the work!

On 6 December 2012 17:42, Kirill Yukhin <kirill.yuk...@gmail.com> wrote:
>>> Yes, the approach taken in this patch looks really good to me. There
>>> should be no code differences with your patch, but let's ask HJ for
>>> his opinion on intrinsics header changes.
>>
>> Hi Kirill,
>>
>> Can you take  a look?  Thanks.
>
> Hi guys, I like changes in intrinsics header.
>
> Thanks, K

-- 
---
Best regards,
Michael V. Zolotukhin,
Software Engineer
Intel Corporation.

Re: [i386] scalar ops that preserve the high part of a vector

Reply via email to