On 08/24/2017 06:02 PM, Halil Pasic wrote: > > > On 08/24/2017 05:53 PM, Farhan Ali wrote: >> >> >> On 08/24/2017 11:50 AM, Thomas Huth wrote: >>> On 24.08.2017 17:47, Halil Pasic wrote: >>>> >>>> >>>> On 08/24/2017 05:35 PM, Thomas Huth wrote: >>>>> On 24.08.2017 17:13, Cornelia Huck wrote: >>>>>> On Thu, 24 Aug 2017 11:05:08 -0400 >>>>>> Farhan Ali <al...@linux.vnet.ibm.com> wrote: >>>>>> >>>>>>> Hi, >>>>>>> >>>>>>> There is an issue in QEMU bios which is exposed by commit >>>>>>> >>>>>>> commit 198c0d1f9df8c429502cb744fc26b6ba6e71db74 >>>>>>> Author: Halil Pasic <pa...@linux.vnet.ibm.com> >>>>>>> Date: Thu Jul 27 17:48:42 2017 +0200 >>>>>>> >>>>>>> s390x/css: check ccw address validity >>>>>>> >>>>>>> According to the PoP channel command words (CCW) must be doubleword >>>>>>> aligned and 31 bit addressable for format 1 and 24 bit addressable >>>>>>> for >>>>>>> format 0 CCWs. >>>>>>> >>>>>>> If the channel subsystem encounters a ccw address which does not >>>>>>> satisfy >>>>>>> this alignment requirement a program-check condition is recognised. >>>>>>> >>>>>>> The situation with 31 bit addressable is a bit more complicated: >>>>>>> both the >>>>>>> ORB and a format 1 CCW TIC hold the address of (the rest of) the >>>>>>> channel >>>>>>> program, that is the address of the next CCW in a word, and the PoP >>>>>>> mandates that bit 0 of that word shall be zero -- or a >>>>>>> program-check >>>>>>> condition is to be recognized -- and does not belong to the field >>>>>>> holding >>>>>>> the ccw address. >>>>>>> >>>>>>> Since in code the corresponding fields span across the whole word >>>>>>> (unlike >>>>>>> in PoP where these are defined as 31 bit wide) we can check this by >>>>>>> applying a mask. The 24 addressable case isn't affecting TIC >>>>>>> because the >>>>>>> address is composed of a halfword and a byte portion (no additional >>>>>>> zero >>>>>>> bit requirements) and just slightly complicates the ORB case where >>>>>>> also >>>>>>> bits 1-7 need to be zero. >>>>>>> >>>>>>> The same requirements (especially n-bit addressability) apply to >>>>>>> the >>>>>>> ccw addresses generated while chaining. >>>>>>> >>>>>>> Let's make our CSS implementation follow the AR more closely. >>>>>>> >>>>>>> Signed-off-by: Halil Pasic <pa...@linux.vnet.ibm.com> >>>>>>> Message-Id: <20170727154842.23427-1-pa...@linux.vnet.ibm.com> >>>>>>> Reviewed-by: Dong Jia Shi <bjsdj...@linux.vnet.ibm.com> >>>>>>> Signed-off-by: Cornelia Huck <coh...@redhat.com> >>>>>>> >>>>>>> >>>>>>> It looks like the bios does not create a double word aligned CCW. >>>>>>> Looking at the bios code we the CCW1 struct is not aligned >>>>>>> >>>>>>> /* channel command word (type 1) */ >>>>>>> struct ccw1 { >>>>>>> __u8 cmd_code; >>>>>>> __u8 flags; >>>>>>> __u16 count; >>>>>>> __u32 cda; >>>>>>> } __attribute__ ((packed)); >>>>>>> >>>>>>> and it looks like the compiler does not guarantee a doubleword >>>>>>> alignment. >>>>>> >>>>>> :( >>>>>> >>>>>>> >>>>>>> The weird thing about it is I see it break in one of my system and works >>>>>>> fine in another system. Trying a simple fix of aligning the struct also >>>>>>> doesn't seem to work all the time. >>>>>> >>>>>> I have not seen this problem on any of the systems I tested on (well, I >>>>>> would not have merged this if I did...) - RHEL 7 and F26. Do we need a >>>>>> dynamic allocation to guarantee alignment? >>>>> >>>>> I guess the problem is the __attribute__((packed)) here - AFAIK GCC then >>>>> sometimes assumes that these structs can be byte-aligned. Does it work >>>>> if you remove the __attribute__((packed)) here? If yes, I think that >>>>> would be a valid fix, since there should not be any padding in this >>>>> struct at all (and if you're afraid, you could add an >>>>> assert(sizeof(struct ccw1) == 8) somewhere). >>>> >>>> I don't think this packed is doing us any good. But even >>>> with the packed removed I not sure we would end up being >>>> 8 byte aligned (dobuleword). Wouldn't it be just 4 byte >>>> aligned (according to the ELF ABI supplement for s390) >>>> as the most strictly aligned member is __u32? >>> >>> True, so that could still be an issue. Looking at the cio.h in the >>> kernel, they define the struct like this: >>> >>> struct ccw1 { >>> __u8 cmd_code; >>> __u8 flags; >>> __u16 count; >>> __u32 cda; >>> } __attribute__ ((packed,aligned(8))); >>> >>> So I guess adding the aligned(8) is the right way to go? >>> >>> Thomas >>> >> >> This was my initial fix and it works on my system. But for some reason this >> fix does not work on my colleague's system. So I am hesitant about >> submitting this fix > > That sounds mysterious. Could you debug that with your colleague > and verify that the problem is indeed the alignment? If it > is I would have to re-read the gcc doc because I can't think of > anything else than a bug either in my understanding or in their product. > > A stupid question is he testing with the current master?
FWIW, The ELF ABI mandates an 8 byte alignment (double word in IBM speak) for every stack frame, so with aligned(8) this should work.