Re: Question: network issues of single node.

2020-06-08 Thread Sergey Chugunov
Of course I meant ticket [1] increased cluster stability in situation of blinking network. [1] https://issues.apache.org/jira/browse/IGNITE-7163 On Mon, Jun 8, 2020 at 1:51 PM Sergey Chugunov wrote: > Vladimir, > > Adding to what Alexey has said I remember that cases of short-term network > iss

Re: Question: network issues of single node.

2020-06-08 Thread Sergey Chugunov
Vladimir, Adding to what Alexey has said I remember that cases of short-term network issues (blinking network) were also a driver for this improvement. They are indeed hard to reproduce but have been seen in real world set-ups and have proven to increase cluster stability. On Sat, Jun 6, 2020 at

Re: Question: network issues of single node.

2020-06-08 Thread Alexey Goncharuk
Vladimir, Such behavior can be introduced by an erroneous firewall configuration (I can't find a link, but I remember that quite a large number of major incidents are caused by an incorrect configuration change). If such a case can be detected, we prefer Ignite to shutdown some of the nodes rather

Re: Question: network issues of single node.

2020-06-06 Thread Denis Magda
Finally, I got your question. Back in 2017-2018, there was a Discovery SPI's stabilization activity. The networking component could fail in various hard-to-reproduce scenarios affecting cluster availability and consistency. That ticket reminds me of those notorious issues that would fire once a we

Re: Question: network issues of single node.

2020-06-05 Thread Vladimir Steshin
Denis, I have no nodes that I'm unable to interconnect. This case is simulated in IgniteDiscoveryMassiveNodeFailTest.testMassiveFailSelfKill() Introduced in [1]. I’m asking if it is real or supposed problem. Where it was met? Which network configuration/issues could be? [1] https://issues.

Re: Question: network issues of single node.

2020-06-04 Thread Denis Magda
Vladimir, I'm suggesting to share the log files from the nodes that are unable to interconnect so that the community can check them for potential issues. Instead of sharing the logs from all the 5 nodes, try to start a two-nodes cluster with the nodes that fail to discover each other and attach th

Re: Question: network issues of single node.

2020-06-04 Thread Vladimir Steshin
Denis, hi.     Sorry, I didn’t catch your idea. Are you saying this can happen and suggest experiment? I’m not descripting a probable case. It is already done in [1]. I’m asking is it real, where it was met. 04.06.2020 23:33, Denis Magda пишет: Vladimir, Please do the following experiment.

Re: Question: network issues of single node.

2020-06-04 Thread Denis Magda
Vladimir, Please do the following experiment. Start a 2-nodes cluster booting node 3 and, for instance, node 5. Those won't be able to interconnect according to your description. Attach the log files from both nodes for analysis. This should be a networking issue. - Denis On Thu, Jun 4, 2020 at

Question: network issues of single node.

2020-06-04 Thread Vladimir Steshin
    Hi, Igniters.     I wanted to ask how one node may not be able to connect to another whereas rest of the cluster can. This got covered in [1]. In short: node 3 can't connect to nodes 4 and 5 but can to 1. At the same time, node 2 can connect to 4. Questions: 1) Is it real case? Where th