Re: [vpp-dev] [v17.07.01]: vec_add2() causing crash on ARMv8

Saxena, Nitin Mon, 02 Oct 2017 09:18:07 -0700

Hi Dave, Brian,


In VPP v17.07.01. Size of uword on aarch64 machine is 8 byte. 
(src/vppinfra/types.h) and in my case value of &dq->interrupt_pending is 4 byte 
aligned address which is why I am getting SIGBUS error on my ThunderX2. Am I 
missing / or doing something error here? Brian are you using 64 bit machine? As 
I am surprised on all x86_64 and aarch64 machines uword should be 8 byte.


I have tested on ThunderX2 that __sync_lock_test_and_set(addr,new) works well 
on uint32_t data type saved at 4 byte aligned pointer. So that works well.


Please let me know whats a way forward. Using vec_add2_ha() in such case can be 
the possible solution since Dave is not in favor of changing vec_add2() with 8 
byte default alignment.


Thanks,

Nitin

________________________________
From: vpp-dev-boun...@lists.fd.io <vpp-dev-boun...@lists.fd.io> on behalf of 
Dave Barach (dbarach) <dbar...@cisco.com>
Sent: Saturday, September 30, 2017 2:24 AM
To: Brian Brooks
Cc: Narayana, Prasad Athreya; Damjan Marion (damarion); vpp-dev@lists.fd.io; 
Saxena, Nitin
Subject: Re: [vpp-dev] [v17.07.01]: vec_add2() causing crash on ARMv8

As a quick hack: try moving "u32 interrupt_pending;" to the start of the 
structure...

Thanks… Dave

-----Original Message-----
From: Brian Brooks [mailto:brian.bro...@arm.com]
Sent: Friday, September 22, 2017 12:33 PM
To: Dave Barach (dbarach) <dbar...@cisco.com>
Cc: Saxena, Nitin <nitin.sax...@cavium.com>; vpp-dev@lists.fd.io; Damjan Marion 
(damarion) <damar...@cisco.com>; Narayana, Prasad Athreya 
<prasadathreya.naray...@cavium.com>
Subject: Re: [vpp-dev] [v17.07.01]: vec_add2() causing crash on ARMv8

On 09/28 11:57:36, Dave Barach (dbarach) wrote:
> Dear Nitin,
>
> First off: exactly which LDXR / STXR instruction variant pairs is generated? 
> I begin to wonder if __sync_lock_test_and_set(...) might not be doing you any 
> favors. Given that dq->interrupt_pending is a u32, I would have expected a 
> 4-byte instruction with (at worst) a 4-byte alignment requirement.

It's true that a LDXR of 4 bytes only requires 4 byte alignment (not 8).

For the TAS, objdump vhost-user.o shows

  ldxr   w0, [x1]
  stxr   w3, w2, [x1]
  cbnz   w3, ..

These instructions are operating on 4 byte data because of the use of a
'w' register instead of a 'x' register to hold the actual value.

Nitin, can you confirm you see the same generated code? If so, is
&dq->interrupt_pending 4 byte aligned?

> Are there any alignment restrictions on the 1-byte variants LDXRB / STXRB?
>
> If not: since we use dq->interrupt_pending as a one-bit flag, declaring it as 
> a u8 - or casting &dq->interrupt_pending to (u8 *) in an arch-dependent 
> fashion - might make the pain go away.
>
> Aligning every vector in the system will waste memory, and will not legislate 
> this class of problem out of existence. So, I wouldn't want to force 8-byte 
> alignment in the way you mention.
>
> Anyhow, aligning the first vector element to an 8-byte boundary says little 
> about the layout of elements within each vector element, especially if the 
> structure is packed.
>
> If dq->interrupt_pending needs to be aligned to a specific boundary without 
> fail, the only completely reliable method would be to pack and pad the 
> structure e.g. to a multiple of 8 octets and ensure that interrupt_pending 
> lands on the required boundary. Then use vec_add2_ha (...) to manipulate the 
> vector.
>
> HTH... Dave
>
> From: vpp-dev-boun...@lists.fd.io [mailto:vpp-dev-boun...@lists.fd.io] On 
> Behalf Of Saxena, Nitin
> Sent: Thursday, September 28, 2017 4:53 AM
> To: vpp-dev@lists.fd.io
> Cc: Narayana, Prasad Athreya <prasadathreya.naray...@cavium.com>
> Subject: [vpp-dev] [v17.07.01]: vec_add2() causing crash on ARMv8
>
>
> Hi All,
>
>
>
> I got a crash with vpp v17.07.01 on ARMv8 Soc 
> @src/vnet/devices/virtio/vhost-user.c: Line no: 1852
>
>
> if (clib_smp_swap (&dq->interrupt_pending, 0) ||
>     (node->state == VLIB_NODE_STATE_POLLING)){
> }
>
> While debugging it turns out that value of (&dq->interrupt_pending) was not 8 
> byte aligned hence causing SIGBUS error on ARMv8 SoC. Further debugging tells 
> that dq was added in vector using vec_add2 (src/vnet/devices/devices.c Line 
> no: 152)
>
> vec_add2 (rt->devices_and_queues, dq, 1)
>
> which uses 0 byte alignment. Changing vec_add2 to vec_add2_aligned() fixed 
> the problem. My question is can we completely define vec_add2() as
>
> #define vec_add2(V,P,N)           vec_add2_ha(V,P,N,0,8) instead of #define 
> vec_add2(V,P,N)           vec_add2_ha(V,P,N,0,0)
>
> This can be helpful for all architecture.
>
> Thanks,
> Nitin
>
>
>

> _______________________________________________
> vpp-dev mailing list
> vpp-dev@lists.fd.io
> https://lists.fd.io/mailman/listinfo/vpp-dev

_______________________________________________
vpp-dev mailing list
vpp-dev@lists.fd.io
https://lists.fd.io/mailman/listinfo/vpp-dev

_______________________________________________
vpp-dev mailing list
vpp-dev@lists.fd.io
https://lists.fd.io/mailman/listinfo/vpp-dev

Re: [vpp-dev] [v17.07.01]: vec_add2() causing crash on ARMv8

Reply via email to