在 2022/3/30 下午6:02, eop Chen 写道:


Weiwei Li <liwei...@iscas.ac.cn <mailto:liwei...@iscas.ac.cn>> 於 2022年3月30日 下午4:27 寫道:
在 2022/3/30 下午3:42, 陳約廷 写道:

Weiwei Li <liwei...@iscas.ac.cn <mailto:liwei...@iscas.ac.cn>> 於 2022年3月28日 下午7:56 寫道:


在 2022/3/7 下午3:10, ~eopxd 写道:
From: eopXD <eop.c...@sifive.com <mailto:eop.c...@sifive.com>>

Another question: max_elems is equal to total_elems when lmul >= 0.

So max_elems can be reused  here instead of caculating total_elems again.


I don’t understand your second question though. If nf = 3, there will be 3 registers involved with the instruction (namely reg, reg+1, reg+2). Why do we care about
(reg+3)?

I just consider register group here. Reg, reg+1, reg+2 and reg+3 may belong to the same register group.

Regards,

Weiwei Li


According to v-spec (under section 7.8):

    Each field will be held in successively numbered vector register
    groups. When EMUL>1
     each field will occupy a vector register group held in multiple
    successively numbered
    vector registers, and the vector register group for each field
    must follow the usual vector
    register alignment constraints (e.g., when EMUL=2 and NFIELDS=4,
    each field’s vector
    register group must start at an even vector register, but does not
    have to start at a multiple
    of 8 vector register number).


I think the spec has explained itself that NFIELDS represents the number of register groups involved in this instruction. Therefore in a register group of 4 (LMUL = m2), NFIELD should be no more than 2. The `vlmax` here would be (VLEN * 4 / EEW). In this sense, if the `vl` provided for the vector instruction is within the range 2 * vlmax / 4 <= vl <= 3 * vlmax / 4, the elements in the 4th register (namely reg+3)
will all be counted as tail elements.

I hope this answers your question.

OK, Thanks a lot. This truly answers my question, even though what I really want to know is the case for EMUL=1, and NFIELDS=3.

since NFIELDS represents the number of register groups, not take the total of EMUL * NFIELDS into one register group ,

so reg+3 should not take into tail elements for my case.

Regards,

Weiwei Li


Regards,

eop Chen


Reply via email to