An update on this.

I finally succeeded in getting my hosts out of alert state by reverting to an 
earlier version of the kernel (see other thread).  Unfortunately when it came 
up I realized I had installed the 4.3 systemvm template and not the 4.4 one so 
I just reverted to my 4.1.1 installation and did the upgrade to 4.3 which 
appears to work.  I am seeing some errors in the log which I’ll post to a 
separate thread.  I’ll wait to test 4.4 when it is released.

With regards to the db.properties file.  I looked into it in more detail and 
the problem was that the file had somehow gotten re-ordered in a random fashion 
which is why I could not make sense out of the diff during the install.  Not 
sure how that happened but my concern mentioned below is probably a non-issue.

regards,
-Carlos


On Jun 30, 2014, at 5:11 PM, Carlos Reátegui <create...@gmail.com> wrote:

> I set encryption to none in db.properties and updated the passwords in 
> host_details to unencrypted versions so I could make progress. 
> 
> I don’t know what exactly the problem was but this is probably something that 
> needs better testing.  I’m pretty sure I had all the encryption stuff correct 
> in the db.properties file but could not get it to work.
> 
> It would be nice if there was a specialized merging utility for the 
> db.properties given the change in the organization of the file.  I am 
> guessing if the file had not been reorganized it would have been more obvious 
> how to merge the 2 and I may have avoided this issue. 
> 
> Now my hosts come up in an alert state and I’m not sure where to go from 
> here.  Please note I am not using bridge mode because I wanted to to a 4 nic 
> bridge which bridge does not allow (only 2 nics).  This was working fine in 
> 4.1 so hopefully this is not a requirement for 4.4.  I am not using security 
> groups which was my understanding is what requires bridge networking:
> 
> The error in the log is this:
> 2014-06-30 14:06:50,073 WARN  [c.c.h.x.r.CitrixResourceBase] 
> (DirectAgent-1:ctx-35941dc7) Failed to configure brige firewall
> 2014-06-30 14:06:50,073 WARN  [c.c.h.x.r.CitrixResourceBase] 
> (DirectAgent-1:ctx-35941dc7) Check host 172.30.45.32 for CSP is installed or 
> not and check network mode for bridge
> 2014-06-30 14:06:50,074 DEBUG [c.c.a.m.DirectAgentAttache] 
> (DirectAgent-1:ctx-35941dc7) Seq 2-6232418934327345153: Response Received: 
> 2014-06-30 14:06:50,075 DEBUG [c.c.a.t.Request] (DirectAgent-1:ctx-35941dc7) 
> Seq 2-6232418934327345153: Processing:  { Ans: , MgmtId: 233845174730253, 
> via: 2, Ver: v1, Flags: 110, [{"com.cloud.agent.api
> .SetupAnswer":{"_reconnect":true,"result":false,"details":"Failed to 
> configure brige firewall","wait":0}}] }
> 2014-06-30 14:06:50,075 DEBUG [c.c.a.t.Request] 
> (AgentTaskPool-2:ctx-b360d1bb) Seq 2-6232418934327345153: Received:  { Ans: , 
> MgmtId: 233845174730253, via: 2, Ver: v1, Flags: 110, { SetupAnswer } }
> 2014-06-30 14:06:50,076 DEBUG [c.c.a.m.AgentAttache] 
> (DirectAgent-1:ctx-35941dc7) Seq 2-6232418934327345153: No more commands found
> 2014-06-30 14:06:50,076 WARN  [c.c.h.x.d.XcpServerDiscoverer] 
> (AgentTaskPool-2:ctx-b360d1bb) Unable to setup agent 2 due to Failed to 
> configure brige firewall
> 2014-06-30 14:06:50,079 INFO  [c.c.u.e.CSExceptionErrorCode] 
> (AgentTaskPool-2:ctx-b360d1bb) Could not find exception: 
> com.cloud.exception.ConnectionException in error code list for exceptions
> 2014-06-30 14:06:50,079 WARN  [c.c.a.m.AgentManagerImpl] 
> (AgentTaskPool-2:ctx-b360d1bb) Monitor XcpServerDiscoverer says there is an 
> error in the connect process for 2 due to Reinitialize agent after se
> tup.
> 2014-06-30 14:06:50,079 INFO  [c.c.a.m.AgentManagerImpl] 
> (AgentTaskPool-2:ctx-b360d1bb) Host 2 is disconnecting with event 
> AgentDisconnected
> 2014-06-30 14:06:50,085 DEBUG [c.c.a.m.AgentManagerImpl] 
> (AgentTaskPool-2:ctx-b360d1bb) The next status of agent 2is Alert, current 
> status is Connecting
> 2014-06-30 14:06:50,085 DEBUG [c.c.a.m.AgentManagerImpl] 
> (AgentTaskPool-2:ctx-b360d1bb) Deregistering link for 2 with state Alert
> 2014-06-30 14:06:50,085 DEBUG [c.c.a.m.AgentManagerImpl] 
> (AgentTaskPool-2:ctx-b360d1bb) Remove Agent : 2
> 2014-06-30 14:06:50,085 DEBUG [c.c.a.m.DirectAgentAttache] 
> (AgentTaskPool-2:ctx-b360d1bb) Processing disconnect 2(srvengxen02)
> 2014-06-30 14:06:50,085 DEBUG [c.c.a.m.AgentManagerImpl] 
> (AgentTaskPool-2:ctx-b360d1bb) Sending Disconnect to listener: 
> com.cloud.hypervisor.xen.discoverer.XcpServerDiscoverer
> 2014-06-30 14:06:50,085 DEBUG [c.c.a.m.AgentManagerImpl] 
> (AgentTaskPool-2:ctx-b360d1bb) Sending Disconnect to listener: 
> com.cloud.hypervisor.hyperv.discoverer.HypervServerDiscoverer
> 2014-06-30 14:06:50,085 DEBUG [c.c.a.m.AgentManagerImpl] 
> (AgentTaskPool-2:ctx-b360d1bb) Sending Disconnect to listener: 
> org.apache.cloudstack.engine.orchestration.NetworkOrchestrator
> 2014-06-30 14:06:50,086 DEBUG [c.c.a.m.AgentManagerImpl] 
> (AgentTaskPool-2:ctx-b360d1bb) Sending Disconnect to listener: 
> com.cloud.network.security.SecurityGroupListener
> 2014-06-30 14:06:50,086 DEBUG [c.c.a.m.AgentManagerImpl] 
> (AgentTaskPool-2:ctx-b360d1bb) Sending Disconnect to listener: 
> com.cloud.vm.ClusteredVirtualMachineManagerImpl
> 2014-06-30 14:06:50,086 DEBUG [c.c.a.m.AgentManagerImpl] 
> (AgentTaskPool-2:ctx-b360d1bb) Sending Disconnect to listener: 
> com.cloud.storage.secondary.SecondaryStorageListener
> 2014-06-30 14:06:50,086 DEBUG [c.c.a.m.AgentManagerImpl] 
> (AgentTaskPool-2:ctx-b360d1bb) Sending Disconnect to listener: 
> com.cloud.deploy.DeploymentPlanningManagerImpl
> 2014-06-30 14:06:50,086 DEBUG [c.c.a.m.AgentManagerImpl] 
> (AgentTaskPool-2:ctx-b360d1bb) Sending Disconnect to listener: 
> com.cloud.storage.listener.StoragePoolMonitor
> 2014-06-30 14:06:50,086 DEBUG [c.c.a.m.AgentManagerImpl] 
> (AgentTaskPool-2:ctx-b360d1bb) Sending Disconnect to listener: 
> com.cloud.storage.download.DownloadListener
> 2014-06-30 14:06:50,086 DEBUG [c.c.a.m.AgentManagerImpl] 
> (AgentTaskPool-2:ctx-b360d1bb) Sending Disconnect to listener: 
> com.cloud.network.SshKeysDistriMonitor
> 2014-06-30 14:06:50,086 DEBUG [c.c.a.m.AgentManagerImpl] 
> (AgentTaskPool-2:ctx-b360d1bb) Sending Disconnect to listener: 
> com.cloud.network.router.VirtualNetworkApplianceManagerImpl
> 2014-06-30 14:06:50,086 DEBUG [c.c.a.m.AgentManagerImpl] 
> (AgentTaskPool-2:ctx-b360d1bb) Sending Disconnect to listener: 
> com.cloud.consoleproxy.ConsoleProxyListener
> 2014-06-30 14:06:50,091 DEBUG [c.c.a.m.AgentManagerImpl] 
> (AgentTaskPool-2:ctx-b360d1bb) Sending Disconnect to listener: 
> com.cloud.network.SshKeysDistriMonitor
> 2014-06-30 14:06:50,091 DEBUG [c.c.a.m.AgentManagerImpl] 
> (AgentTaskPool-2:ctx-b360d1bb) Sending Disconnect to listener: 
> com.cloud.network.router.VpcVirtualNetworkApplianceManagerImpl
> 2014-06-30 14:06:50,091 DEBUG [c.c.a.m.AgentManagerImpl] 
> (AgentTaskPool-2:ctx-b360d1bb) Sending Disconnect to listener: 
> com.cloud.storage.LocalStoragePoolListener
> 2014-06-30 14:06:50,091 DEBUG [c.c.a.m.AgentManagerImpl] 
> (AgentTaskPool-2:ctx-b360d1bb) Sending Disconnect to listener: 
> com.cloud.storage.upload.UploadListener
> 2014-06-30 14:06:50,093 DEBUG [c.c.a.m.AgentManagerImpl] 
> (AgentTaskPool-2:ctx-b360d1bb) Sending Disconnect to listener: 
> com.cloud.capacity.StorageCapacityListener
> 2014-06-30 14:06:50,093 DEBUG [c.c.a.m.AgentManagerImpl] 
> (AgentTaskPool-2:ctx-b360d1bb) Sending Disconnect to listener: 
> com.cloud.capacity.ComputeCapacityListener
> 2014-06-30 14:06:50,093 DEBUG [c.c.a.m.AgentManagerImpl] 
> (AgentTaskPool-2:ctx-b360d1bb) Sending Disconnect to listener: 
> com.cloud.network.NetworkUsageManagerImpl$DirectNetworkStatsListener
> 2014-06-30 14:06:50,093 DEBUG [c.c.n.NetworkUsageManagerImpl] 
> (AgentTaskPool-2:ctx-b360d1bb) Disconnected called on 2 with status Alert
> 2014-06-30 14:06:50,093 DEBUG [c.c.a.m.AgentManagerImpl] 
> (AgentTaskPool-2:ctx-b360d1bb) Sending Disconnect to listener: 
> com.cloud.agent.manager.AgentManagerImpl$BehindOnPingListener
> 2014-06-30 14:06:50,093 DEBUG [c.c.h.Status] (AgentTaskPool-2:ctx-b360d1bb) 
> Transition:[Resource state = Enabled, Agent event = AgentDisconnected, Host 
> id = 2, name = srvengxen02]
> 2014-06-30 14:06:50,102 DEBUG [c.c.a.m.ClusteredAgentManagerImpl] 
> (AgentTaskPool-2:ctx-b360d1bb) Notifying other nodes of to disconnect
> 2014-06-30 14:06:50,108 DEBUG [c.c.h.x.r.CitrixResourceBase] 
> (DirectAgent-2:ctx-59808af2) Copying 
> /usr/share/cloudstack-management/webapps/client/WEB-INF/classes/scripts/vm/hypervisor/xenserver/xenserver60/../../../../network/domr//router_proxy.sh
>  to /opt/cloud/bin on 172.30.45.32 with permission 0755
> 2014-06-30 14:06:50,108 DEBUG [c.c.h.x.r.CitrixResourceBase] 
> (DirectAgent-2:ctx-59808af2) Unable to create destination path: 
> /opt/cloud/bin on 172.30.45.32 but trying anyway
> 2014-06-30 14:06:50,110 WARN  [c.c.r.ResourceManagerImpl] 
> (AgentTaskPool-2:ctx-b360d1bb) Unable to connect due to 
> com.cloud.exception.ConnectionException: Reinitialize agent after setup.
>        at 
> com.cloud.hypervisor.xen.discoverer.XcpServerDiscoverer.processConnect(XcpServerDiscoverer.java:656)
>        at 
> com.cloud.agent.manager.AgentManagerImpl.notifyMonitorsOfConnection(AgentManagerImpl.java:514)
>        at 
> com.cloud.agent.manager.AgentManagerImpl.handleDirectConnectAgent(AgentManagerImpl.java:1427)
>        at 
> com.cloud.resource.ResourceManagerImpl.createHostAndAgent(ResourceManagerImpl.java:1765)
>        at 
> com.cloud.resource.ResourceManagerImpl.createHostAndAgent(ResourceManagerImpl.java:1891)
>        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>        at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
>        at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>        at java.lang.reflect.Method.invoke(Method.java:606)
>        at 
> org.springframework.aop.support.AopUtils.invokeJoinpointUsingReflection(AopUtils.java:317)
> 
> 
> 
> 
> On Jun 30, 2014, at 1:57 PM, Carlos Reátegui <create...@gmail.com> wrote:
> 
>> Making a little progress but still stuck…
>> 
>> I realized that when I did the upgrade it had asked me if to keep the old 
>> dp.properties or use the new one.  The structure of the file seemed 
>> different enough and I did not recall using anything but the defaults so I 
>> went ahead and told it to use the new one.  Seems this was not the right 
>> thing to do.
>> 
>> I have updated the password/ecryption settings to match the old file but it 
>> is still not working.  Now I am getting stuck here:
>> 
>> 2014-06-30 13:50:32,139 DEBUG [c.c.s.d.VMTemplateDaoImpl] (main:null) Found 
>> parameter routing unique name null
>> 2014-06-30 13:50:32,139 DEBUG [c.c.s.d.VMTemplateDaoImpl] (main:null) Use 
>> console proxy template : routing
>> 2014-06-30 13:50:32,143 INFO  [c.c.u.d.GenericDaoBase] (main:null) Cache 
>> created: [ name = HostPodDaoImpl status = STATUS_ALIVE eternal = false 
>> overflowToDisk = false maxEntriesLocalHeap = 50 maxEntriesLocalDisk = 0 
>> memoryStoreEvictionPolicy = LRU timeToLiveSeconds = 600 timeToIdleSeconds = 
>> 300 persistence = none diskExpiryThreadIntervalSeconds = 120 
>> cacheEventListeners: net.sf.ehcache.statistics.LiveCacheStatisticsWrapper  
>> hitCount = 0 memoryStoreHitCount = 0 diskStoreHitCount = 0 missCountNotFound 
>> = 0 missCountExpired = 0 maxBytesLocalHeap = 0 overflowToOffHeap = false 
>> maxBytesLocalOffHeap = 0 maxBytesLocalDisk = 0 pinned = false ]
>> 2014-06-30 13:50:32,157 INFO  [c.c.u.d.GenericDaoBase] (main:null) Cache 
>> created: [ name = DedicatedResourceDaoImpl status = STATUS_ALIVE eternal = 
>> false overflowToDisk = false maxEntriesLocalHeap = 30 maxEntriesLocalDisk = 
>> 0 memoryStoreEvictionPolicy = LRU timeToLiveSeconds = 3600 timeToIdleSeconds 
>> = 300 persistence = none diskExpiryThreadIntervalSeconds = 120 
>> cacheEventListeners: net.sf.ehcache.statistics.LiveCacheStatisticsWrapper  
>> hitCount = 0 memoryStoreHitCount = 0 diskStoreHitCount = 0 missCountNotFound 
>> = 0 missCountExpired = 0 maxBytesLocalHeap = 0 overflowToOffHeap = false 
>> maxBytesLocalOffHeap = 0 maxBytesLocalDisk = 0 pinned = false ]
>> 2014-06-30 13:50:32,168 INFO  [c.c.u.d.GenericDaoBase] (main:null) Cache 
>> created: [ name = HypervisorCapabilitiesDaoImpl status = STATUS_ALIVE 
>> eternal = false overflowToDisk = false maxEntriesLocalHeap = 100 
>> maxEntriesLocalDisk = 0 memoryStoreEvictionPolicy = LRU timeToLiveSeconds = 
>> 600 timeToIdleSeconds = 300 persistence = none 
>> diskExpiryThreadIntervalSeconds = 120 cacheEventListeners: 
>> net.sf.ehcache.statistics.LiveCacheStatisticsWrapper  hitCount = 0 
>> memoryStoreHitCount = 0 diskStoreHitCount = 0 missCountNotFound = 0 
>> missCountExpired = 0 maxBytesLocalHeap = 0 overflowToOffHeap = false 
>> maxBytesLocalOffHeap = 0 maxBytesLocalDisk = 0 pinned = false ]
>> 2014-06-30 13:50:32,175 INFO  [c.c.u.d.GenericDaoBase] (main:null) Cache 
>> created: [ name = UserDaoImpl status = STATUS_ALIVE eternal = false 
>> overflowToDisk = false maxEntriesLocalHeap = 5000 maxEntriesLocalDisk = 0 
>> memoryStoreEvictionPolicy = LRU timeToLiveSeconds = 300 timeToIdleSeconds = 
>> 300 persistence = none diskExpiryThreadIntervalSeconds = 120 
>> cacheEventListeners: net.sf.ehcache.statistics.LiveCacheStatisticsWrapper  
>> hitCount = 0 memoryStoreHitCount = 0 diskStoreHitCount = 0 missCountNotFound 
>> = 0 missCountExpired = 0 maxBytesLocalHeap = 0 overflowToOffHeap = false 
>> maxBytesLocalOffHeap = 0 maxBytesLocalDisk = 0 pinned = false ]
>> 2014-06-30 13:50:32,180 INFO  [c.c.u.d.GenericDaoBase] (main:null) Cache 
>> created: [ name = ServiceOfferingDaoImpl status = STATUS_ALIVE eternal = 
>> false overflowToDisk = false maxEntriesLocalHeap = 50 maxEntriesLocalDisk = 
>> 0 memoryStoreEvictionPolicy = LRU timeToLiveSeconds = 600 timeToIdleSeconds 
>> = 300 persistence = none diskExpiryThreadIntervalSeconds = 120 
>> cacheEventListeners: net.sf.ehcache.statistics.LiveCacheStatisticsWrapper  
>> hitCount = 0 memoryStoreHitCount = 0 diskStoreHitCount = 0 missCountNotFound 
>> = 0 missCountExpired = 0 maxBytesLocalHeap = 0 overflowToOffHeap = false 
>> maxBytesLocalOffHeap = 0 maxBytesLocalDisk = 0 pinned = false ]
>> 2014-06-30 13:50:32,187 INFO  [c.c.u.d.GenericDaoBase] (main:null) Cache 
>> created: [ name = DataCenterDaoImpl status = STATUS_ALIVE eternal = false 
>> overflowToDisk = false maxEntriesLocalHeap = 50 maxEntriesLocalDisk = 0 
>> memoryStoreEvictionPolicy = LRU timeToLiveSeconds = 600 timeToIdleSeconds = 
>> 300 persistence = none diskExpiryThreadIntervalSeconds = 120 
>> cacheEventListeners: net.sf.ehcache.statistics.LiveCacheStatisticsWrapper  
>> hitCount = 0 memoryStoreHitCount = 0 diskStoreHitCount = 0 missCountNotFound 
>> = 0 missCountExpired = 0 maxBytesLocalHeap = 0 overflowToOffHeap = false 
>> maxBytesLocalOffHeap = 0 maxBytesLocalDisk = 0 pinned = false ]
>> 2014-06-30 13:50:32,188 INFO  [c.c.u.d.GenericDaoBase] (main:null) Cache 
>> created: [ name = Ip Alloc status = STATUS_ALIVE eternal = false 
>> overflowToDisk = false maxEntriesLocalHeap = 50 maxEntriesLocalDisk = 0 
>> memoryStoreEvictionPolicy = LRU timeToLiveSeconds = 600 timeToIdleSeconds = 
>> 300 persistence = none diskExpiryThreadIntervalSeconds = 120 
>> cacheEventListeners: net.sf.ehcache.statistics.LiveCacheStatisticsWrapper  
>> hitCount = 0 memoryStoreHitCount = 0 diskStoreHitCount = 0 missCountNotFound 
>> = 0 missCountExpired = 0 maxBytesLocalHeap = 0 overflowToOffHeap = false 
>> maxBytesLocalOffHeap = 0 maxBytesLocalDisk = 0 pinned = false ]
>> 2014-06-30 13:50:32,189 INFO  [c.c.u.d.GenericDaoBase] (main:null) Cache 
>> created: [ name = vnet Alloc status = STATUS_ALIVE eternal = false 
>> overflowToDisk = false maxEntriesLocalHeap = 50 maxEntriesLocalDisk = 0 
>> memoryStoreEvictionPolicy = LRU timeToLiveSeconds = 600 timeToIdleSeconds = 
>> 300 persistence = none diskExpiryThreadIntervalSeconds = 120 
>> cacheEventListeners: net.sf.ehcache.statistics.LiveCacheStatisticsWrapper  
>> hitCount = 0 memoryStoreHitCount = 0 diskStoreHitCount = 0 missCountNotFound 
>> = 0 missCountExpired = 0 maxBytesLocalHeap = 0 overflowToOffHeap = false 
>> maxBytesLocalOffHeap = 0 maxBytesLocalDisk = 0 pinned = false ]
>> 2014-06-30 13:50:32,198 INFO  [c.c.u.d.GenericDaoBase] (main:null) Cache 
>> created: [ name = VlanDaoImpl status = STATUS_ALIVE eternal = false 
>> overflowToDisk = false maxEntriesLocalHeap = 30 maxEntriesLocalDisk = 0 
>> memoryStoreEvictionPolicy = LRU timeToLiveSeconds = 3600 timeToIdleSeconds = 
>> 300 persistence = none diskExpiryThreadIntervalSeconds = 120 
>> cacheEventListeners: net.sf.ehcache.statistics.LiveCacheStatisticsWrapper  
>> hitCount = 0 memoryStoreHitCount = 0 diskStoreHitCount = 0 missCountNotFound 
>> = 0 missCountExpired = 0 maxBytesLocalHeap = 0 overflowToOffHeap = false 
>> maxBytesLocalOffHeap = 0 maxBytesLocalDisk = 0 pinned = false ]
>> 2014-06-30 13:50:32,232 DEBUG [c.c.u.c.DBEncryptionUtil] (main:null) Error 
>> while decrypting: true
>> 
>> The key is still the default password and I have decrypted all the ENC 
>> parameters from the db.properties file and they seem ok.  What am I missing?
>> 
>> thanks,
>> Carlos
>> 
>> 
>> On Jun 30, 2014, at 1:16 PM, Carlos Reátegui <create...@gmail.com> wrote:
>> 
>>> I found the comments in: 
>>> https://issues.apache.org/jira/browse/CLOUDSTACK-3990 useful but how do I 
>>> find out the database key so that I can set the pw.
>>> 
>>> Also in looking at my previous backups for the host_details table it seems 
>>> like the password entry changes on a regular basis.
>>> 
>>> Is there something the keeps updating the db key and re-ecrypts the host 
>>> passwords?
>>> 
>>> On Jun 30, 2014, at 1:01 PM, Carlos Reátegui <create...@gmail.com> wrote:
>>> 
>>>> Hi All,
>>>> 
>>>> I am having problems bringing my system back up.  I have not checked the 
>>>> credentials of my hosts but the upgraded management server is unable to 
>>>> connect to them.  Where is the password stored?
>>>> 
>>>> thanks.
>>>> Carlos
>>>> 
>>>> 
>>>> 2014-06-30 12:55:59,277 DEBUG [c.c.a.m.ClusteredAgentManagerImpl] 
>>>> (ClusteredAgentManager Timer:ctx-060c8ace) Loading directly connected host 
>>>> 1(srvengxen01)
>>>> 2014-06-30 12:56:04,394 DEBUG [c.c.n.l.LBHealthCheckManagerImpl] 
>>>> (LBHealthCheck-1:ctx-c6869648) LB HealthCheck Manager is running and 
>>>> getting the updates from LB providers and updating service status
>>>> 2014-06-30 12:56:04,428 DEBUG [c.c.n.l.LBHealthCheckManagerImpl] 
>>>> (LBHealthCheck-1:ctx-c6869648) LB HealthCheck Manager is running and 
>>>> getting the updates from LB providers and updating service status
>>>> 2014-06-30 12:56:06,844 DEBUG [c.c.h.x.r.XenServerConnectionPool] 
>>>> (ClusteredAgentManager Timer:ctx-060c8ace) Unable to create master 
>>>> connection to host(172.30.45.31) , due to The credentials given by the 
>>>> user are incorrect, so access has been denied, and you have not been 
>>>> issued a session handle.
>>>> 2014-06-30 12:56:06,848 DEBUG [c.c.h.Status] (ClusteredAgentManager 
>>>> Timer:ctx-060c8ace) Transition:[Resource state = Enabled, Agent event = 
>>>> AgentDisconnected, Host id = 1, name = srvengxen01]
>>>> 2014-06-30 12:56:06,862 WARN  [c.c.a.m.ClusteredAgentManagerImpl] 
>>>> (ClusteredAgentManager Timer:ctx-060c8ace)  can not load directly 
>>>> connected host 1(srvengxen01) due to 
>>>> com.cloud.utils.exception.CloudRuntimeException: Unable to create master 
>>>> connection to host(172.30.45.31) , due to The credentials given by the 
>>>> user are incorrect, so access has been denied, and you have not been 
>>>> issued a session handle.
>>>>     at 
>>>> com.cloud.hypervisor.xen.resource.XenServerConnectionPool.getConnect(XenServerConnectionPool.java:168)
>>>>     at 
>>>> com.cloud.hypervisor.xen.resource.CitrixResourceBase.CheckXenHostInfo(CitrixResourceBase.java:5722)
>>>>     at 
>>>> com.cloud.hypervisor.xen.resource.CitrixResourceBase.configure(CitrixResourceBase.java:5705)
>>>>     at 
>>>> com.cloud.resource.DiscovererBase.reloadResource(DiscovererBase.java:157)
>>>>     at 
>>>> com.cloud.agent.manager.AgentManagerImpl.loadDirectlyConnectedHost(AgentManagerImpl.java:672)
>>>>     at 
>>>> com.cloud.agent.manager.ClusteredAgentManagerImpl.scanDirectAgentToLoad(ClusteredAgentManagerImpl.java:218)
>>>>     at 
>>>> com.cloud.agent.manager.ClusteredAgentManagerImpl.runDirectAgentScanTimerTask(ClusteredAgentManagerImpl.java:184)
>>>>     at 
>>>> com.cloud.agent.manager.ClusteredAgentManagerImpl.access$100(ClusteredAgentManagerImpl.java:98)
>>>>     at 
>>>> com.cloud.agent.manager.ClusteredAgentManagerImpl$DirectAgentScanTimerTask.runInContext(ClusteredAgentManagerImpl.java:234)
>>>>     at 
>>>> org.apache.cloudstack.managed.context.ManagedContextTimerTask$1.runInContext(ManagedContextTimerTask.java:30)
>>>>     at 
>>>> org.apache.cloudstack.managed.context.ManagedContextRunnable$1.run(ManagedContextRunnable.java:49)
>>>>     at 
>>>> org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:56)
>>>>     at 
>>>> org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:103)
>>>>     at 
>>>> org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:53)
>>>>     at 
>>>> org.apache.cloudstack.managed.context.ManagedContextRunnable.run(ManagedContextRunnable.java:46)
>>>>     at 
>>>> org.apache.cloudstack.managed.context.ManagedContextTimerTask.run(ManagedContextTimerTask.java:27)
>>>>     at java.util.TimerThread.mainLoop(Timer.java:555)
>>>>     at java.util.TimerThread.run(Timer.java:505)
>>>> Caused by: The credentials given by the user are incorrect, so access has 
>>>> been denied, and you have not been issued a session handle.
>>>>     at com.xensource.xenapi.Types.checkResponse(Types.java:322)
>>>>     at com.xensource.xenapi.Connection.dispatch(Connection.java:350)
>>>>     at com.xensource.xenapi.Session.loginWithPassword(Session.java:537)
>>>>     at 
>>>> com.cloud.hypervisor.xen.resource.XenServerConnectionPool.loginWithPassword(XenServerConnectionPool.java:321)
>>>>     at 
>>>> com.cloud.hypervisor.xen.resource.XenServerConnectionPool.getConnect(XenServerConnectionPool.java:154)
>>>>     ... 17 more
>>>> 2014-06-30 12:56:06,864 DEBUG [c.c.a.m.ClusteredAgentManagerImpl] 
>>>> (ClusteredAgentManager Timer:ctx-060c8ace) Loading directly connected host 
>>>> 2(srvengxen02)
>>>> 2014-06-30 12:56:09,225 DEBUG [c.c.s.StatsCollector] 
>>>> (StatsCollector-1:ctx-8458e286) HostStatsCollector is running...
>>>> 2014-06-30 12:56:09,226 DEBUG [c.c.s.StatsCollector] 
>>>> (StatsCollector-2:ctx-aa245eed) VmStatsCollector is running...
>>>> 2014-06-30 12:56:09,227 DEBUG [c.c.s.StatsCollector] 
>>>> (StatsCollector-3:ctx-19894fa1) StorageCollector is running...
>>>> 2014-06-30 12:56:09,230 DEBUG [c.c.s.StatsCollector] 
>>>> (StatsCollector-4:ctx-d66c71fb) AutoScaling Monitor is running...
>>>> 
>>>> 
>>>> 
>>>> On Jun 30, 2014, at 9:54 AM, Carlos Reátegui <create...@gmail.com> wrote:
>>>> 
>>>>> Hi Sudha,
>>>>> Thanks for checking in.  I was out for the weekend and just getting back 
>>>>> to this now.
>>>>> 
>>>>> My main question at this point is if it is ok for me to kill the system 
>>>>> vms with the xe vm-shutdown command since the script provided by 
>>>>> cloudstack does not work with ubuntu.
>>>>> 
>>>>> Also it would be great if someone could have a look at my logs to see if 
>>>>> they look normal. I am seeing a lot of HA-Worker messages but I do not 
>>>>> have an HA deployment (unless this is the thread that keeps the system 
>>>>> vas running).
>>>>> 
>>>>> thanks,
>>>>> Carlos
>>>>> 
>>>>> 
>>>>> 
>>>>> On Jun 29, 2014, at 11:51 PM, Sudha Ponnaganti 
>>>>> <sudha.ponnaga...@citrix.com> wrote:
>>>>> 
>>>>>> Hi Carlos,
>>>>>> 
>>>>>> Were you able to resolve the following? Was your upgrade successful?
>>>>>> 
>>>>>> Thanks
>>>>>> /sudha
>>>>>> 
>>>>>> -----Original Message-----
>>>>>> From: Carlos Reátegui [mailto:create...@gmail.com] 
>>>>>> Sent: Friday, June 27, 2014 8:55 PM
>>>>>> To: CloudStack-Users
>>>>>> Cc: dev@cloudstack.apache.org
>>>>>> Subject: 4.4 upgrade issues
>>>>>> 
>>>>>> I am trying out the upgrade instructions from 
>>>>>> http://docs.cloudstack.apache.org/projects/cloudstack-release-notes/en/4.3/rnotes.html#upgrade-from-4-1-x-to-4-3
>>>>>>  but going to 4.4 built from source today.
>>>>>> 
>>>>>> My setup: XenServer 6.0.2 Hosts, Management Server on Ubuntu 12.04, 
>>>>>> Primary and Secondary on NFS, Basic Network, no security groups
>>>>>> 
>>>>>> -----
>>>>>> Notes on the docs:
>>>>>> 
>>>>>> 8.4 - 8.6: This is only for hosts that use the cloudstack agent. Does 
>>>>>> not apply to KVM. In general this whole section does not do a good job 
>>>>>> of explaining what is on the MS vs the Hosts.
>>>>>> 
>>>>>> 13: This fails on ubuntu because: cloudstack-sysvmadm sources 
>>>>>> /etc/rc.d/init.d/functions which does not exist on ubuntu/debian systems.
>>>>>> 
>>>>>> 14: Copy vhf-util from where? Also the path 
>>>>>> /usr/share/cloudstack-common/scripts/vm/hypervisor/xenserver does not 
>>>>>> exist on the hosts so I am assuming this is on the MS, however the MS 
>>>>>> already has it since it is an upgrade and was put there by the original 
>>>>>> install.  Or is this a new version that needs to be grabbed from 
>>>>>> somewhere?
>>>>>> 
>>>>>> Other: earlier versions like 4.1 worked with JDK 1.6 current releases 
>>>>>> require 1.7 but the Upgrade doc does not mention that.
>>>>>> 
>>>>>> --
>>>>>> Issues:
>>>>>> 
>>>>>> Saw the following in catalina.out, not sure if it is an issues:
>>>>>> Jun 27, 2014 5:28:42 PM org.apache.catalina.loader.WebappClassLoader 
>>>>>> validateJarFile
>>>>>> INFO: 
>>>>>> validateJarFile(/usr/share/cloudstack-management/webapps/client/WEB-INF/lib/servlet-api-2.5-20081211.jar)
>>>>>>  - jar not loaded. See Servlet Spec 2.3, section 9.7.2. Offending class: 
>>>>>> javax/servlet/Servlet.class Jun 27, 2014 5:28:42 PM 
>>>>>> org.apache.catalina.loader.WebappClassLoader validateJarFile
>>>>>> INFO: 
>>>>>> validateJarFile(/usr/share/cloudstack-management/webapps/client/WEB-INF/lib/tomcat-embed-core-7.0.30.jar)
>>>>>>  - jar not loaded. See Servlet Spec 2.3, section 9.7.2. Offending class: 
>>>>>> javax/servlet/Servlet.class
>>>>>> 
>>>>>> Since the above script in step 13 did not work is it ok to do "xe 
>>>>>> vm-shutdown vm=." on each of the system vms?  Will CloudStack notice 
>>>>>> they are ton and start new ones?
>>>>>> 
>>>>>> Here are my log files (please note I stopped the service prior to 
>>>>>> capturing these logs in case you are wondering):
>>>>>> Management server log: 
>>>>>> https://www.dropbox.com/s/7xhkutt8e724il1/management-server.log
>>>>>> Catalina log: 
>>>>>> https://www.dropbox.com/s/f45ypkbazhkogyj/catalina.2014-06-27.log
>>>>>> 
>>>>>> 
>>>>>> 
>>>>>> 
>>>>> 
>>>> 
>>> 
>> 
> 

Reply via email to