[
https://issues.apache.org/jira/browse/CLOUDSTACK-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13727008#comment-13727008
]
Murali Reddy commented on CLOUDSTACK-3998:
------------------------------------------
code injection based faults
http://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-hdfs/FaultInjectFramework.html
> explore simulator based fault injection for resiliency testing
> ---------------------------------------------------------------
>
> Key: CLOUDSTACK-3998
> URL: https://issues.apache.org/jira/browse/CLOUDSTACK-3998
> Project: CloudStack
> Issue Type: Task
> Security Level: Public(Anyone can view this level - this is the
> default.)
> Components: Management Server
> Reporter: Murali Reddy
> Assignee: Murali Reddy
> Fix For: Future
>
>
> We could inject controlled faults in to simulated hypervisors, network
> elements, storage resources, system VM's and try out tests for testing the
> resiliency of CloudStack core.
> For example, we can have test case (that runs only for Simulator) where we
> instruct a simulated hypervisor resource to not to respond to ping from
> CloudStack. Expected result would be core to treat hypervisor host as
> disconnected and trigger HA enabled VM's.
> My initial thinking is to expose set of test API from simulator plug-in that
> would help inject transient/permanent/intermittent faults into simulated
> resources like
> - host that lost network connectivity with MS
> - delayed response to agent commands (simulate overloaded hypervisor stacks
> like XAPI, vCenter) and long running tasks like snapshots
> - non responding edge service VM's
> need to think little more on the failure categories and points, and best
> abstract them test API so that can enable flexible and rich resilience tests.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira