These logs are from this morning.
I rebuilt my CloudStack environment and now the System VMs have
connectivity, so much so that I can access the VMs via SSH and ping the
management, public, and storage networks through it.
I solved the logs you sent me by simply rebuilding the environment (and
network) from scratch.
The SSL logs I mentioned are reported below:
2025-08-11 20:34:11,995 DEBUG [c.c.c.ClusterManagerImpl]
(Cluster-Worker-5:[ctx-f152a4c5]) (logid:099f2acb) Cluster PDU
97591085894372 -> 97591085894372 completed. time: 4ms. agent: 0, pdu
seq: 263, pdu ack seq: 0, json:
{"managementServerHostId":1,"managementServerHostUuid":"4cfdc9b0-565e-4920-bc32-d079f456fc8b","managementServerRunId":1754939509540,"collectionTime":"Aug
11, 2025, 8:34:11
PM","sessions":2,"cpuUtilization":0.0,"totalJvmMemoryBytes":1124597760,"freeJvmMemoryBytes":384206344,"maxJvmMemoryBytes":1908932607,"processJvmMemoryBytes":0,"jvmUptime":15765252,"jvmStartTime":1754939486632,"availableProcessors":32,"loadAverage":0.17,"totalInit":1591017472,"totalUsed":1012164680,"totalCommitted":1401356288,"pid":217209,"jvmName":"217209@cloudstack-onexbh","jvmVendor":"Ubuntu","jvmVersion":"17.0.15+6-Ubuntu-0ubuntu120.04","osDistribution":"Ubuntu
20.04.6
LTS","agentCount":5,"heapMemoryUsed":741131168,"heapMemoryTotal":1908932608,"threadsBlockedCount":0,"threadsDaemonCount":30,"threadsRunnableCount":23,"threadsTerminatedCount":0,"threadsTotalCount":1427,"threadsWaitingCount":1317,"systemMemoryTotal":101304950784,"systemMemoryFree":94171963392,"systemMemoryUsed":1912644,"systemMemoryVirtualSize":22671781888,"logInfo":"","systemTotalCpuCycles":43622.244,"systemLoadAverages":[0.17,0.1,0.09],"systemCyclesUsage":[652771,292009,862649072],"dbLocal":true,"usageLocal":false,"systemBootTime":"Aug
8, 2025, 5:24:32 PM","kernelVersion":"5.4.0-216-generic"}
2025-08-11 20:34:11,995 DEBUG [c.c.c.ClusterManagerImpl]
(Cluster-Worker-5:[ctx-f152a4c5]) (logid:099f2acb) Cluster PDU
97591085894372 -> 97591085894372. agent: 0, pdu seq: 263, pdu ack seq:
0, json:
{"managementServerHostId":1,"managementServerHostUuid":"4cfdc9b0-565e-4920-bc32-d079f456fc8b","managementServerRunId":1754939509540,"collectionTime":"Aug
11, 2025, 8:34:11
PM","sessions":2,"cpuUtilization":0.0,"totalJvmMemoryBytes":1124597760,"freeJvmMemoryBytes":384206344,"maxJvmMemoryBytes":1908932607,"processJvmMemoryBytes":0,"jvmUptime":15765252,"jvmStartTime":1754939486632,"availableProcessors":32,"loadAverage":0.17,"totalInit":1591017472,"totalUsed":1012164680,"totalCommitted":1401356288,"pid":217209,"jvmName":"217209@cloudstack-onexbh","jvmVendor":"Ubuntu","jvmVersion":"17.0.15+6-Ubuntu-0ubuntu120.04","osDistribution":"Ubuntu
20.04.6
LTS","agentCount":5,"heapMemoryUsed":741131168,"heapMemoryTotal":1908932608,"threadsBlockedCount":0,"threadsDaemonCount":30,"threadsRunnableCount":23,"threadsTerminatedCount":0,"threadsTotalCount":1427,"threadsWaitingCount":1317,"systemMemoryTotal":101304950784,"systemMemoryFree":94171963392,"systemMemoryUsed":1912644,"systemMemoryVirtualSize":22671781888,"logInfo":"","systemTotalCpuCycles":43622.244,"systemLoadAverages":[0.17,0.1,0.09],"systemCyclesUsage":[652771,292009,862649072],"dbLocal":true,"usageLocal":false,"systemBootTime":"Aug
8, 2025, 5:24:32 PM","kernelVersion":"5.4.0-216-generic"}
2025-08-11 20:34:11,995 DEBUG [c.c.c.ClusterServiceServletImpl]
(Cluster-Worker-5:[ctx-f152a4c5]) (logid:099f2acb) Executing
ClusterServicePdu with service URL:
https://127.0.0.1:9090/clusterservice
2025-08-11 20:34:11,998 ERROR [c.c.c.ClusterServiceServletImpl]
(Cluster-Worker-5:[ctx-f152a4c5]) (logid:099f2acb) Exception from :
https://127.0.0.1:9090/clusterservice, method : null, exception :
javax.net.ssl.SSLPeerUnverifiedException: Certificate for <127.0.0.1>
doesn't match any of the subject alternative names:
[fe80:0:0:0:5ac2:32ff:fe02:ae4, 10.31.4.50, 10.31.3.1,
cloudstack-onexbh, cloudstack.internal]
at
org.apache.http.conn.ssl.SSLConnectionSocketFactory.verifyHostname(SSLConnectionSocketFactory.java:507)
at
org.apache.http.conn.ssl.SSLConnectionSocketFactory.createLayeredSocket(SSLConnectionSocketFactory.java:437)
at
org.apache.http.conn.ssl.SSLConnectionSocketFactory.connectSocket(SSLConnectionSocketFactory.java:384)
at
org.apache.http.impl.conn.DefaultHttpClientConnectionOperator.connect(DefaultHttpClientConnectionOperator.java:142)
at
org.apache.http.impl.conn.PoolingHttpClientConnectionManager.connect(PoolingHttpClientConnectionManager.java:376)
at
org.apache.http.impl.execchain.MainClientExec.establishRoute(MainClientExec.java:393)
at
org.apache.http.impl.execchain.MainClientExec.execute(MainClientExec.java:236)
at
org.apache.http.impl.execchain.ProtocolExec.execute(ProtocolExec.java:186)
at
org.apache.http.impl.execchain.RetryExec.execute(RetryExec.java:89)
at
org.apache.http.impl.execchain.RedirectExec.execute(RedirectExec.java:110)
at
org.apache.http.impl.client.InternalHttpClient.doExecute(InternalHttpClient.java:185)
at
org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:83)
at
org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:108)
at
com.cloud.cluster.ClusterServiceServletImpl.executePostMethod(ClusterServiceServletImpl.java:143)
at
com.cloud.cluster.ClusterServiceServletImpl.execute(ClusterServiceServletImpl.java:106)
at
com.cloud.cluster.ClusterManagerImpl.onSendingClusterPdu(ClusterManagerImpl.java:275)
at
com.cloud.cluster.ClusterManagerImpl$1.runInContext(ClusterManagerImpl.java:235)
at
org.apache.cloudstack.managed.context.ManagedContextRunnable$1.run(ManagedContextRunnable.java:49)
at
org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:56)
at
org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:103)
at
org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:53)
at
org.apache.cloudstack.managed.context.ManagedContextRunnable.run(ManagedContextRunnable.java:46)
at
java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136)
at
java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635)
at java.base/java.lang.Thread.run(Thread.java:840)
2025-08-11 20:34:11,998 DEBUG [c.c.c.ClusterManagerImpl]
(Cluster-Worker-5:[ctx-f152a4c5]) (logid:099f2acb) Cluster PDU
97591085894372 -> 97591085894372 completed. time: 3ms. agent: 0, pdu
seq: 263, pdu ack seq: 0, json:
{"managementServerHostId":1,"managementServerHostUuid":"4cfdc9b0-565e-4920-bc32-d079f456fc8b","managementServerRunId":1754939509540,"collectionTime":"Aug
11, 2025, 8:34:11
PM","sessions":2,"cpuUtilization":0.0,"totalJvmMemoryBytes":1124597760,"freeJvmMemoryBytes":384206344,"maxJvmMemoryBytes":1908932607,"processJvmMemoryBytes":0,"jvmUptime":15765252,"jvmStartTime":1754939486632,"availableProcessors":32,"loadAverage":0.17,"totalInit":1591017472,"totalUsed":1012164680,"totalCommitted":1401356288,"pid":217209,"jvmName":"217209@cloudstack-onexbh","jvmVendor":"Ubuntu","jvmVersion":"17.0.15+6-Ubuntu-0ubuntu120.04","osDistribution":"Ubuntu
20.04.6
LTS","agentCount":5,"heapMemoryUsed":741131168,"heapMemoryTotal":1908932608,"threadsBlockedCount":0,"threadsDaemonCount":30,"threadsRunnableCount":23,"threadsTerminatedCount":0,"threadsTotalCount":1427,"threadsWaitingCount":1317,"systemMemoryTotal":101304950784,"systemMemoryFree":94171963392,"systemMemoryUsed":1912644,"systemMemoryVirtualSize":22671781888,"logInfo":"","systemTotalCpuCycles":43622.244,"systemLoadAverages":[0.17,0.1,0.09],"systemCyclesUsage":[652771,292009,862649072],"dbLocal":true,"usageLocal":false,"systemBootTime":"Aug
8, 2025, 5:24:32 PM","kernelVersion":"5.4.0-216-generic"}
2025-08-11 20:34:12,063 DEBUG [c.c.s.S.VmStatsCollector]
(StatsCollector-3:[ctx-e9f9863e]) (logid:3a21ac65) VmStatsCollector is
running to process VMs across 3 UP hosts
2025-08-11 20:34:12,067 DEBUG [c.c.a.m.ClusteredAgentManagerImpl]
(StatsCollector-3:[ctx-e9f9863e]) (logid:3a21ac65) Wait time setting on
com.cloud.agent.api.GetVmStatsCommand is 1800 seconds
2025-08-11 20:34:12,067 DEBUG [c.c.a.m.ClusteredDirectAgentAttache]
(StatsCollector-3:[ctx-e9f9863e]) (logid:3a21ac65) Seq
5-3902650552093245880: Routed from 97591085894372
2025-08-11 20:34:12,068 DEBUG [c.c.a.m.D.Task]
(DirectAgent-90:[ctx-4073f5c6]) (logid:8267a6c4) Seq
5-3902650552093245880: Executing request
2025-08-11 20:34:12,068 DEBUG [c.c.h.v.r.VmwareResource]
(DirectAgent-90:[ctx-4073f5c6]) (logid:3a21ac65) Executing resource
command GetVmStatsCommand: [].
I remain at your disposal and thank you for your support!!
Kayo Henrique
Analista de Infraestrutura e Redes
OneX Data Centers
Em 2025-08-12 07:03, Prashanth Reddy escreveu:
From the logs I don't see an issue with SSL certs itself, we clearly
see management servers are unable to connect to the systemVM's.
For the SSVM and CPVM to work, management servers should be able to ssh
into the systemVM's on the POD IP's ( in case of vmware) assigned. From
the logs we see this as failing as below
Logs for SSVM ( similar logs are seen for CPVM as well)
2025-08-11 11:07:53,220 DEBUG [c.c.h.v.r.VmwareResource]
(DirectAgent-180:[ctx-d02be6b8, 10.42.0.23, job-44/job-138, cmd:
StartCommand]) (logid:2459ff24) VM s-31-VM has been started
successfully with hostname s-31-VM.
2025-08-11 11:07:53,220 DEBUG [c.c.a.r.v.VirtualRoutingResource]
(DirectAgent-180:[ctx-d02be6b8, 10.42.0.23, job-44/job-138, cmd:
StartCommand]) (logid:2459ff24) Trying to connect to 10.42.0.93
2025-08-11 11:07:56,276 DEBUG [c.c.a.r.v.VirtualRoutingResource]
(DirectAgent-180:[ctx-d02be6b8, 10.42.0.23, job-44/job-138, cmd:
StartCommand]) (logid:2459ff24) Could not connect to 10.42.0.93
2025-08-11 11:08:01,276 DEBUG [c.c.a.r.v.VirtualRoutingResource]
(DirectAgent-180:[ctx-d02be6b8, 10.42.0.23, job-44/job-138, cmd:
StartCommand]) (logid:2459ff24) Trying to connect to 10.42.0.93
2025-08-11 11:08:04,340 DEBUG [c.c.a.r.v.VirtualRoutingResource]
(DirectAgent-180:[ctx-d02be6b8, 10.42.0.23, job-44/job-138, cmd:
StartCommand]) (logid:2459ff24) Could not connect to 10.42.0.93
2025-08-11 11:08:09,340 DEBUG [c.c.a.r.v.VirtualRoutingResource]
(DirectAgent-180:[ctx-d02be6b8, 10.42.0.23, job-44/job-138, cmd:
StartCommand]) (logid:2459ff24) Unable to logon to 10.42.0.93
2025-08-11 11:08:09,340 DEBUG [c.c.a.r.v.VirtualRoutingResource]
(DirectAgent-180:[ctx-d02be6b8, 10.42.0.23, job-44/job-138, cmd:
StartCommand]) (logid:2459ff24) Trying to connect to 10.42.0.93
2025-08-11 11:23:52,828 DEBUG [c.c.a.r.v.VirtualRoutingResource]
(DirectAgent-180:[ctx-d02be6b8, 10.42.0.23, job-44/job-138, cmd:
StartCommand]) (logid:2459ff24) Trying to connect to 10.42.0.93
2025-08-11 11:23:55,892 DEBUG [c.c.a.r.v.VirtualRoutingResource]
(DirectAgent-180:[ctx-d02be6b8, 10.42.0.23, job-44/job-138, cmd:
StartCommand]) (logid:2459ff24) Could not connect to 10.42.0.93
2025-08-11 11:24:00,892 DEBUG [c.c.a.r.v.VirtualRoutingResource]
(DirectAgent-180:[ctx-d02be6b8, 10.42.0.23, job-44/job-138, cmd:
StartCommand]) (logid:2459ff24) Unable to logon to 10.42.0.93
2025-08-11 11:24:03,956 ERROR [c.c.u.FileUtil]
(DirectAgent-180:[ctx-d02be6b8, 10.42.0.23, job-44/job-138, cmd:
StartCommand]) (logid:2459ff24) Failed to scp files to system VM due
to, No route to host
2025-08-11 11:24:07,028 ERROR [c.c.u.FileUtil]
(DirectAgent-180:[ctx-d02be6b8, 10.42.0.23, job-44/job-138, cmd:
StartCommand]) (logid:2459ff24) Failed to scp files to system VM due
to, No route to host
2025-08-11 11:24:10,100 ERROR [c.c.u.FileUtil]
(DirectAgent-180:[ctx-d02be6b8, 10.42.0.23, job-44/job-138, cmd:
StartCommand]) (logid:2459ff24) Failed to scp files to system VM due
to, No route to host
2025-08-11 11:24:10,100 ERROR [c.c.h.v.r.VmwareResource]
(DirectAgent-180:[ctx-d02be6b8, 10.42.0.23, job-44/job-138, cmd:
StartCommand]) (logid:2459ff24) Failed to scp files to system VM.
Patching of systemVM failed
com.cloud.utils.exception.CloudRuntimeException: Failed to scp files to
system VM due to, No route to host
2025-08-11 11:24:10,123 DEBUG [c.c.a.t.Request]
(Work-Job-Executor-91:[ctx-9e73a045, job-44/job-138, ctx-025b0546])
(logid:2459ff24) Seq 1-5696209103693548290: Received: { Ans: , MgmtId:
97591085894372, via: 1(10.42.0.23), Ver: v1, Flags: 110, { StartAnswer
} }
2025-08-11 11:24:10,130 INFO
[c.c.v.ClusteredVirtualMachineManagerImpl]
(Work-Job-Executor-91:[ctx-9e73a045, job-44/job-138, ctx-025b0546])
(logid:2459ff24) Unable to start VM on Host
{"id":1,"name":"10.42.0.23","type":"Routing","uuid":"8733a859-b04f-4341-9ceb-182b7917628f"}
due to Failed to scp files to system VM. Patching of systemVM failed
due to: Failed to scp files to system VM due to, No route to host
2025-08-11 11:24:10,139 DEBUG
[c.c.v.ClusteredVirtualMachineManagerImpl]
(Work-Job-Executor-91:[ctx-9e73a045, job-44/job-138, ctx-025b0546])
(logid:2459ff24) Cleaning up resources for the vm VM instance
{"id":31,"instanceName":"s-31-VM","state":"Starting","type":"SecondaryStorageVm","uuid":"700c9d06-fc6e-40ce-a42f-d59e657771b3"}
in Starting state
From the logs your pod network is using the following vswitch on vmware
- "name":"vSwitch_Storage,402,vmwaresvs"
You can try to ssh into the SystemVM's directly from the management
servers once they are running on vmware and see if that works - You can
ssh to systemVM's on vmware using the steps here The System VM Template
— Apache CloudStack 4.20.1.0
documentation<https://docs.cloudstack.apache.org/en/latest/adminguide/systemvm.html#accessing-system-vms>
The System VM Template — Apache CloudStack 4.20.1.0
documentation<https://docs.cloudstack.apache.org/en/latest/adminguide/systemvm.html#accessing-system-vms>
CloudStack uses several types of system Instances to perform tasks in
the cloud. In general CloudStack manages these system VMs and creates,
starts, and stops them as needed based on scale and immediate needs.
Unlike user VMs, system VMs are expunged on destroying them. However,
the administrator should be aware of them and their roles to assist in
debugging issues. The System VM Template The ...
docs.cloudstack.apache.org
If the ssh to systemVM's is not working , To isolate the issue may be
you can deploy a VM directly on vmware with a nic on the vswitch stated
above and see if you can ssh into the VM from all your management
servers once running.
Thanks
Prashanth
________________________________
From: Kayo Henrique <kayo.henri...@onexdatacenter.com.br>
Sent: Tuesday, August 12, 2025 1:15 AM
To: Users <users@cloudstack.apache.org>
Subject: POSSIBLE SSL ERROR ON SYSTEM VMS - POSSÍVEL ERRO DE SSL NAS
SYSTEM VMS
*IN ENGLISH*
Hello,
I've rebuilt my CloudStack environment to VMware and I'm having a
problem.
It appears my System VMs are powered on, have connectivity, and are
pinging all networks, but the System VM services (SSVM and CPVM) aren't
working.
The evidence images and the management server log file are available at
the link below:
https://drive.onexdatacenter.com.br/s/gRreLjZ4bg5KPHM
I did some research and discovered that it might be related to the SSL
certificate, but I don't quite understand how it works!
I'm here to help!
//
*EM PORTUGUÊS*
Olá,
Refiz meu ambiente de CloudStack para VMware e estou com um problema.
Aparentemente minhas System VMs estão ligadas, com conectividade,
pingando todas as redes, mas os serviços das System VMs (SSVM e CPVM)
não funcionam.
As imagens de evidência e o arquivo de logs do management server estão
presentes no link abaixo:
https://drive.onexdatacenter.com.br/s/gRreLjZ4bg5KPHM
Pesquisei um pouco sobre e descobri que pode estar relacionado ao
certificado SSL, mas não entendi muito bem como funcionaria isso!
Fico à disposição!!
--
Atenciosamente,
Kayo Henrique
Analista de Infraestrutura e Redes
OneX Data Centers