Hi,
I have a problem with internal LB in the VPC network, the alert "Health checks
failed: 2 filing checks on router b-27-VM..." appears.
>From what I checked in the cloud.log on this router, all tests pass OK,
>unfortunately, apart from restarting the router (which did not change
>anything) and migrating it to another host, there is nothing we can do with
>it. At the same time, all health checks pass on VPC routers.
Are there any ways to diagnose this? Will restarting the VPC network recreate
vr LB?
Log from CS mgmt.:
2024-07-15 12:36:46,917 DEBUG [c.c.a.t.Request]
(RouterStatusMonitor-1:ctx-a525026f) (logid:a85b756e) Seq
1-6985927446981855677: Sending { Cmd , MgmtId: 94770532512064, via:
1(sda.dco.webdisk.io), Ver: v1, Flags: 100011,
[{"com.cloud.agent.api.routing.GetRouterMonitorResultsCommand":{"performFreshChecks":"false","validateBasicTestsOnly":"true","accessDetails":{"router.name":"b-27-VM","router.ip":"169.254.231.8"},"wait":"0","bypassHostMaintenance":"false"}}]
}
"},"reconfigureAfterUpdate":"true","deleteFromProcessedCache":"true","accessDetails":{"router.name":"b-27-VM","router.health.checks.advanced.interval":"10","router.health.checks.enabled":"true","router.health.checks.basic.interval":"3","router.health.checks.excluded":"","router.ip":"169.254.231.8"},"wait":"0","bypassHostMaintenance":"false"}}]
}
2024-07-15 12:36:49,750 DEBUG [c.c.a.t.Request]
(RouterStatusMonitor-1:ctx-3d117d9c) (logid:bebffb77) Seq
1-6985927446981855682: Sending { Cmd , MgmtId: 94770532512064, via:
1(sda.dco.webdisk.io), Ver: v1, Flags: 100011,
[{"com.cloud.agent.api.routing.GetRouterMonitorResultsCommand":{"performFreshChecks":"false","validateBasicTestsOnly":"false","accessDetails":{"router.name":"b-27-VM","router.ip":"169.254.231.8"},"wait":"0","bypassHostMaintenance":"false"}}]
}
2024-07-15 12:36:50,384 WARN [c.c.a.AlertManagerImpl]
(RouterStatusMonitor-1:ctx-3d117d9c) (logid:bebffb77) alertType=[9]
dataCenterId=[1] podId=[1] clusterId=[null] message=[Health checks failed: 2
failing checks on router b-27-VM / aa329dc3-4852-432e-9f37-16819b92b916].
2024-07-15 12:36:50,393 WARN [c.c.a.AlertManagerImpl]
(RouterStatusMonitor-1:ctx-3d117d9c) (logid:bebffb77) No recipients set in
global setting 'alert.email.addresses', skipping sending alert with subject
[Health checks failed: 2 failing checks on router b-27-VM /
aa329dc3-4852-432e-9f37-16819b92b916] and content [Health checks failed: 2
failing checks on router b-27-VM / aa329dc3-4852-432e-9f37-16819b92b916].
2024-07-15 12:36:50,393 WARN [c.c.n.r.VirtualNetworkApplianceManagerImpl]
(RouterStatusMonitor-1:ctx-3d117d9c) (logid:bebffb77) Health checks failed: 2
failing checks on router b-27-VM / aa329dc3-4852-432e-9f37-16819b92b916.
Checking failed health checks to see if router needs recreate
Regards,
Piotr