On Wed, Dec 20, 2017 at 2:48 PM, Richard Sandiford
<richard.sandif...@linaro.org> wrote:
> Richard Biener <richard.guent...@gmail.com> writes:
>> On Tue, Dec 12, 2017 at 4:46 PM, Richard Sandiford
>> <richard.sandif...@linaro.org> wrote:
>>> Richard Biener <richard.guent...@gmail.com> writes:
>>>> On Sun, Dec 10, 2017 at 12:20 AM, Richard Sandiford
>>>> <richard.sandif...@linaro.org> wrote:
>>>>> This patch changes vec_perm_indices from a plain vec<> to a class
>>>>> that stores a canonicalised permutation, using the same encoding
>>>>> as for VECTOR_CSTs.  This means that vec_perm_indices now carries
>>>>> information about the number of vectors being permuted (currently
>>>>> always 1 or 2) and the number of elements in each input vector.
>>>>
>>>> Before I dive into  the C++ details can you explain why it needs this
>>>> info and how it encodes it for variable-length vectors?  To interleave
>>>> two vectors you need sth like { 0, N, 1, N+1, ... }, I'm not sure we
>>>> can directly encode N here, can we?  extract even/odd should just
>>>> work as { 0, 2, 4, 6, ...} without knowledge of whether we permute
>>>> one or two vectors (the one vector case just has two times the same
>>>> vector) or how many elements each of the vectors (or the result) has.
>>>
>>> One of the later patches switches the element types to HOST_WIDE_INT,
>>> so that we can represent all ssizetypes.  Then there's a poly_int
>>> patch (not yet posted) to make that poly_int64, so that we can
>>> represent the N even for variable-length vectors.
>>>
>>> The class needs to know the number of elements because that affects
>>> the canonical representation.  E.g. extract even on fixed-length
>>> vectors with both inputs the same should be { 0, 2, 4, ..., 0, 2, 4 ... },
>>> which we can't encode as a simple series.  Interleave low with both
>>> inputs the same should be { 0, 0, 1, 1, ... } for both fixed-length and
>>> variable-length vectors.
>>
>> Huh?  extract even is { 0, 2, 4, 6, 8 ... } indexes in the selection vector
>> are referencing concat'ed input vectors.  So yes, for two same vectors
>> that's effectively { 0, 2, 4, ..., 0, 2, 4, ... } but I don't see why
>> that should
>> be the canonical form?
>
> Current practice is to use the single-input form where possible,
> if both inputs are the same (see e.g. the VEC_PERM_EXPR handling
> in fold-const.c).  It means that things like:
>
>     _1 = VEC_PERM_EXPR <a, a, { 0, 2, 4, 6, 0, 2, 4, 6 }>;
>     _2 = VEC_PERM_EXPR <a, a, { 0, 2, 4, 6, 8, 10, 12, 14 }>;
>     _3 = VEC_PERM_EXPR <a, b, { 0, 2, 4, 6, 0, 2, 4, 6 }>;
>
> get folded to the same sequence, and so can be CSEd.
>
> We could instead convert the single-input form to use the two-input
> selector, but that would be harder.  The advantage of treating the
> single-input form as canonical is that it works even for irregular
> permutes.

Ok, I see.  Maybe adding a comment along this helps.

Thanks,
Richard.

> Thanks,
> Richard
>
>>> Also, operator[] is supposed to return an in-range selector even if
>>> the selector element is only implicitly encoded.  So we need to know
>>> the number of input elements there.
>>>
>>> Separating the number of input elements into the number of inputs
>>> and the number of elements per input isn't really necessary, but made
>>> it easier to provide routines for testing whether all selected
>>> elements come from a particular input, and for rotating the selector
>>> by a whole number of inputs.
>>>
>>> Thanks,
>>> Richard

Reply via email to