Hi Johan,

On 07/18/2016 09:53 AM, Johan Kooijman wrote:
Hi Jeff,

was the issue ever resolved? Don't have permissions to view the bugzilla.

There are proposal patches in the bugzilla, I have requested more information about upstream status.
As soon I have updates, I will reply here.

For now, if you have the hardware and want to give a test against our latest upstream build jobs, links below:

ovirt-node 3.6:
http://jenkins.ovirt.org/job/ovirt-node_ovirt-3.6_create-iso-el7_merged/

ovirt-node 4.0 (next):
http://jenkins.ovirt.org/job/ovirt-node-ng_ovirt-4.0-snapshot_build-artifacts-fc23-x86_64/

Thanks!


On Thu, Mar 17, 2016 at 4:34 PM, Jeff Spahr <[email protected] <mailto:[email protected]>> wrote:

    I had the same issue, and I also have a support case open.  They
    referenced https://bugzilla.redhat.com/show_bug.cgi?id=1288237
    which is private.  I didn't have any success getting that bugzilla
    changed to public.  We couldn't keep waiting for the issue to be
    fixed so we replaced the NICs with Broadcom/Qlogic that we knew
    had no issues in other hosts.

    On Thu, Mar 17, 2016 at 11:27 AM, Sigbjorn Lie
    <[email protected] <mailto:[email protected]>> wrote:

        Hi,

        Is this on CentOS/RHEL 7.2?

        Log in as root as see if you can see any messages from ixgbe
        about "tx queue hung" in dmesg. I
        currently have an open support case for RHEL7.2 and the ixgbe
        driver, where there is a driver
        issue causing the network adapter to reset continuously when
        there are network traffic.


        Regards,
        Siggi



        On Thu, March 17, 2016 12:52, Nir Soffer wrote:
        > On Thu, Mar 17, 2016 at 10:49 AM, Johan Kooijman
        <[email protected] <mailto:[email protected]>> wrote:
        >
        >> Hi all,
        >>
        >>
        >> Since we upgraded to the latest ovirt node running 7.2,
        we're seeing that
        >> nodes become unavailable after a while. It's running fine,
        with a couple of VM's on it, untill it
        >> becomes non responsive. At that moment it doesn't even
        respond to ICMP. It'll come back by
        >> itself after a while, but oVirt fences the machine before
        that time and restarts VM's elsewhere.
        >>
        >>
        >> Engine tells me this message:
        >>
        >>
        >> VDSM host09 command failed: Message timeout which can be
        caused by
        >> communication issues
        >>
        >> Is anyone else experiencing these issues with ixgbe
        drivers? I'm running on
        >> Intel X540-AT2 cards.
        >>
        >
        > We will need engine and vdsm logs to understand this issue.
        >
        >
        > Can you file a bug and attach ful logs?
        >
        >
        > Nir
        > _______________________________________________
        > Users mailing list
        > [email protected] <mailto:[email protected]>
        > http://lists.ovirt.org/mailman/listinfo/users
        >
        >


        _______________________________________________
        Users mailing list
        [email protected] <mailto:[email protected]>
        http://lists.ovirt.org/mailman/listinfo/users



    _______________________________________________
    Users mailing list
    [email protected] <mailto:[email protected]>
    http://lists.ovirt.org/mailman/listinfo/users




--
Met vriendelijke groeten / With kind regards,
Johan Kooijman


_______________________________________________
Users mailing list
[email protected]
http://lists.ovirt.org/mailman/listinfo/users

_______________________________________________
Users mailing list
[email protected]
http://lists.ovirt.org/mailman/listinfo/users

Reply via email to