[ 
https://issues.apache.org/jira/browse/CLOUDSTACK-9317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15887950#comment-15887950
 ] 

ASF GitHub Bot commented on CLOUDSTACK-9317:
--------------------------------------------

Github user ProjectMoon commented on the issue:

    https://github.com/apache/cloudstack/pull/1908
  
    We have been testing this PR internally for some time now, and we have seen 
some strange `arping` issues. We have seen that `/etc/cloudstack/ips.json` on 
the router has IPs on them that aren't cleaned up. Since #1908 is now merged, 
I'm worried that the changes in this PR will not play well with #1908.
    
    Specifically, we have seen that IPs left over in the `ips.json` file can be 
stuck on `"add": true` even if the IP is no longer on the account. 
    
    Thus, the code that was added in PR 1907 concerns me:
    
    ```
     if not address['add'] and not ip.configured():
        logging.info("Skipping %s as the add flag is set to %s " % 
(address['public_ip'], address['add']))
        continue
    ```
    
    The `if` condition fails in a case we've been investigating. Because our IP 
is `"add": true`, but `configured()` returns `False`, the IP gets reconfigured 
anyway. This doesn't add the IP to the router's interface, but it _does_ run an 
`arping` command (see CsAddress.py). Essentially, if this PR is merged as-is, 
we might wind up with a regression on CLOUDSTACK-9500. 
    
    I will do a bit of manual testing combining both #1908 and #1907 to see 
what happens. It may just be that after restarting the network (and thus 
cleaning the router), these issues will vanish.


> Disabling static NAT on many IPs can leave wrong IPs on the router
> ------------------------------------------------------------------
>
>                 Key: CLOUDSTACK-9317
>                 URL: https://issues.apache.org/jira/browse/CLOUDSTACK-9317
>             Project: CloudStack
>          Issue Type: Bug
>      Security Level: Public(Anyone can view this level - this is the 
> default.) 
>          Components: Management Server, Virtual Router
>    Affects Versions: 4.7.0, 4.7.1, 4.7.2
>            Reporter: Jeff Hair
>
> The current behavior of enabling or disabling static NAT will call the apply 
> IP associations method in the management server. The method is not 
> thread-safe. If it's called from multiple threads, each thread will load up 
> the list of public IPs in different states (add or revoke)--correct for the 
> thread, but not correct overall. Depending on execution order on the virtual 
> router, the router can end up with public IPs assigned to it that are not 
> supposed to be on it anymore. When another account acquires the same IP, this 
> of course leads to network problems.
> The problem has been in CS since at least 4.2, and likely affects all 
> recently released versions. Affected version is set to 4.7.x because that's 
> what we verified against.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to