Make sure you have sar/sysstat enabled and changed to do 1 minute samples.

sar -d will show disk perf.  If one of the disks "blips" at the
firmware level (working on a hard to read block maybe), the util% on
that device will be significantly higher than all other disks so will
stand out.  Then you can look deeper at the smart data.

sar generically will show your cpu/system time and sar -n DEV will
show detailed network traffic, sar -n EDEV will show network errors.

With it set to 1 minute you should be able to detect most blips.

On Sun, Sep 26, 2021 at 10:26 AM Jamie Fargen <ja...@fargenable.com> wrote:
>
> Are there network switches under your control? It sounds similar to what 
> happens when MTU on the systems MTU do not match or one system MTU is set 
> above the value on the switch ports.
>
> Next time the issue occurs use ping with the do not fragment flag.
> ex $ ping -m DO -s 8972 ip.address
>
> This example should be the highest value to work in the case of MTU size 
> 9000, there is 28 byte overhead for IPv4 packets.
>
> Second, are you sure no one is attaching to the network and duplicating the 
> MAC address of your NFS server or perhaps the system that is stalled? If the 
> switches are manageable you would have to insure that the MAC addresses are 
> being learned on the correct ports.
>
> -Jamie
>
>
> On Sun, Sep 26, 2021 at 10:24 AM Tom Horsley <horsley1...@gmail.com> wrote:
>>
>> On Sun, 26 Sep 2021 10:26:19 -0300
>> George N. White III wrote:
>>
>> > If you have cron jobs that use a lot of network bandwidth it may work
>> > fine until some network issue causing lots of retransmits bogs it down.
>>
>> Which is why you should check the dumb stuff first! Has a critter
>> chewed on the ethernet cable to the server?
>> _______________________________________________
>> users mailing list -- users@lists.fedoraproject.org
>> To unsubscribe send an email to users-le...@lists.fedoraproject.org
>> Fedora Code of Conduct: 
>> https://docs.fedoraproject.org/en-US/project/code-of-conduct/
>> List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines
>> List Archives: 
>> https://lists.fedoraproject.org/archives/list/users@lists.fedoraproject.org
>> Do not reply to spam on the list, report it: 
>> https://pagure.io/fedora-infrastructure
>
> _______________________________________________
> users mailing list -- users@lists.fedoraproject.org
> To unsubscribe send an email to users-le...@lists.fedoraproject.org
> Fedora Code of Conduct: 
> https://docs.fedoraproject.org/en-US/project/code-of-conduct/
> List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines
> List Archives: 
> https://lists.fedoraproject.org/archives/list/users@lists.fedoraproject.org
> Do not reply to spam on the list, report it: 
> https://pagure.io/fedora-infrastructure
_______________________________________________
users mailing list -- users@lists.fedoraproject.org
To unsubscribe send an email to users-le...@lists.fedoraproject.org
Fedora Code of Conduct: 
https://docs.fedoraproject.org/en-US/project/code-of-conduct/
List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines
List Archives: 
https://lists.fedoraproject.org/archives/list/users@lists.fedoraproject.org
Do not reply to spam on the list, report it: 
https://pagure.io/fedora-infrastructure

Reply via email to