we can maybe switch to an agent properties for the system vms that use
a size-based roll for logging.

On Mon, Mar 10, 2014 at 5:30 PM, Anirban Chakraborty <abc...@juniper.net> wrote:
> Thanks for all the responses. I do not see cloud.log and cloud.out logs are 
> zipped in /var/log and /var/log/cloud respectively. Only file that was zipped 
> was cron.log. The two largest files are:
> cloud.out.2 with following:
> --
> + keyvalues=' root=LABEL console=tty0 xencons=ttyS0,115200 console=hvc0 
> console=hvc0 template=domP type=secstorage host=10.84.58.252 port=8250 
> name=s-1-VM zone=1 pod=1 guid=s-1-VM 
> resource=org.apache.cloudstack.storage.resource.NfsSecondaryStorageResource 
> instance=SecStorage sslcopy=true role=templateProcessor mtu=1500'
> + for i in '$CMDLINE'
> ++ cut -s -d= -f1
> ++ echo eth2ip=10.84.59.176
> + KEY=eth2ip
> ++ cut -s -d= -f2
> ++ echo eth2ip=10.84.59.176
> + VALUE=10.84.59.176
> + '[' eth2ip == '' ']'
> + case $KEY in
> + keyvalues=' root=LABEL console=tty0 xencons=ttyS0,115200 console=hvc0 
> console=hvc0 template=domP type=secstorage host=10.84.58.252 port=8250 
> name=s-1-VM zone=1 pod=1 guid=s-1-VM resource=org.apache.cloudstack.sJava 
> HotSpot(TM) Client VM warning: Insufficient space for shared memory file:
>    /tmp/hsperfdata_root/8004
> Try using the -Djava.io.tmpdir= option to select an alternate temp location.
>
> log4j:WARN No appenders could be found for logger 
> (com.cloud.agent.AgentShell).
> log4j:WARN Please initialize the log4j system properly.
> log4j:WARN See http://logging.apache.org/log4j/1.2/faq.html#noconfig for more 
> info.
> log4j:WARN No such property [maxFileSize] in 
> org.apache.log4j.rolling.RollingFileAppender.
> log4j:WARN No such property [maxBackupIndex] in 
> org.apache.log4j.rolling.RollingFileAppender.
> log4j:WARN Please set a rolling policy for the RollingFileAppender named 
> 'FILE3'
> 08:31:55,320  INFO AgentShell:318 - Agent started
> log4j:ERROR Failed to flush writer,
> java.io.IOException: No space left on device
> --
> and, repetition of following in /var/log/cloud.log.1
> --
> 2014-03-01 07:05:08,607 DEBUG [cloud.utils.ProcessUtil] (main:null)   PID TTY 
>          TIME CMD
>  3938 ?        00:00:36 java
>
> 2014-03-01 07:05:08,607 ERROR [cloud.agent.AgentShell] (main:null) Unable to 
> start agent: Java process is being started twice.  If this is not true, 
> remove /var/run/agent.SecStorage.pid
> 2014-03-01 07:05:19,028 INFO  [cloud.agent.AgentShell] (main:null) Agent 
> started
> 2014-03-01 07:05:19,030 INFO  [cloud.agent.AgentShell] (main:null) 
> Implementation Version is 4.3.0-SNAPSHOT
> 2014-03-01 07:05:19,030 INFO  [cloud.agent.AgentShell] (main:null) 
> agent.properties found at /usr/local/cloud/systemvm/conf/agent.properties
> 2014-03-01 07:05:19,038 DEBUG [cloud.agent.AgentShell] (main:null) Found 
> property: instance
> 2014-03-01 07:05:19,038 DEBUG [cloud.agent.AgentShell] (main:null) Found 
> property: resource
> 2014-03-01 07:05:19,038 INFO  [cloud.agent.AgentShell] (main:null) Defaulting 
> to using properties file for storage
> 2014-03-01 07:05:19,039 INFO  [cloud.agent.AgentShell] (main:null) Defaulting 
> to the constant time backoff algorithm
> 2014-03-01 07:05:19,048 INFO  [cloud.utils.LogUtils] (main:null) log4j 
> configuration found at /usr/local/cloud/systemvm/conf/log4j-cloud.xml
> 2014-03-01 07:05:19,062 DEBUG [cloud.agent.AgentShell] (main:null) Checking 
> to see if agent.SecStorage.pid exists.
> 2014-03-01 07:05:19,064 DEBUG [cloud.utils.ProcessUtil] (main:null) 
> environment.properties could not be opened
> 2014-03-01 07:05:19,071 DEBUG [cloud.utils.ProcessUtil] (main:null) 
> Executing: bash -c ps -p 3938
> 2014-03-01 07:05:19,077 DEBUG [cloud.utils.ProcessUtil] (main:null) Execution 
> is successful.
>
> eventually the log gets full with,
> 2014-03-10 09:54:09,420 INFO  [utils.nio.NioClient] (Agent-Selector:null) 
> Connecting to 10.84.58.252:8250
> 2014-03-10 09:54:36,916 WARN  [utils.nio.NioConnection] (Agent-Selector:null) 
> Unable to connect to remote: is there a server running on port 8250
> --
> Looks like the agent on s-1-vm could not connect to the management server at 
> some point of time after the system vm startup and the log gets filled with 
> above 'Unable to connect' messages.
>
> Anirban
>
> On Mar 7, 2014, at 11:37 PM, Rajesh Battala <rajesh.batt...@citrix.com> wrote:
>
>> AFAIK, log roation is enabled in the systemvm.
>> Can you check whether the logs are getting zipped .?
>>
>> -----Original Message-----
>> From: Anirban Chakraborty [mailto:abc...@juniper.net]
>> Sent: Saturday, March 8, 2014 12:46 PM
>> To: dev@cloudstack.apache.org
>> Subject: system vm disk space issue in ACS 4.3
>>
>> Hi All,
>>
>> I am seeing system vm disk has no space left after running for few days. 
>> Cloudstack UI shows the agent in v-2-VM in alert state, while agent state of 
>> s-1-VM shows blank (hyphen in the UI).
>> Both the system vms are running and ssh-able from the host. The log in 
>> s-1-Vm shows following errors:
>>
>> root@s-1-VM:~# grep 'Exception' /var/log/cloud/*.*
>> /var/log/cloud/cloud.out.2:java.io.IOException: No space left on device
>> /var/log/cloud/cloud.out.2:java.io.IOException: No space left on device
>>
>> whereas logs in v-1-VM shows
>> /var/log/cloud/cloud.out.3:java.io.IOException: No space left on device
>> /var/log/cloud/cloud.out.3:java.io.IOException: No space left on device
>> /var/log/cloud/cloud.out.3:07:18:00,547  INFO CSExceptionErrorCode:87 - 
>> Could not find exception: com.cloud.exception.AgentControlChannelException 
>> in error code list for exceptions
>> /var/log/cloud/cloud.out.3:com.cloud.exception.AgentControlChannelException: 
>> Unable to post agent control request as link is not available
>>
>> Looks like cloud agent is filling up the log, which is leading to the disk 
>> full state.
>>
>> Is this a known issue? Thanks.
>>
>> Anirban
>>
>>
>
>

Reply via email to