首页 > 解决方案 > 新节点不与 Docker Swarm 通信

问题描述

我想要一些帮助。Swarm 集群中的一个节点已经重新启动,并且在它重新加入集群后,它不会与来自其他节点的任何容器通信。

通过测试,我决定创建一台机器并加入集群,问题又出现了。

它不是防火墙,因为所有机器都有 IPTables 发布的所有端口。

我使用 Rancher 来管理 Swarm。

我得到了很多这种类型的日志:

pr 24 03:22:42 test dockerd[19266]: time="2021-04-24T03:22:42.115509948+01:00" level=info msg="NetworkDB stats test(6f5a9f02af5c) - netID:jwl6mpxebzyagcwm0yefj66ue leaving:false netPeers:2 entries:4 Queue qLen:0 netMsg/s:0"
Apr 24 03:22:42 test dockerd[19266]: time="2021-04-24T03:22:42.115772255+01:00" level=info msg="NetworkDB stats test(6f5a9f02af5c) - netID:gdbbdug3xlck2af5imjv7lkym leaving:false netPeers:2 entries:4 Queue qLen:0 netMsg/s:0"
Apr 24 03:22:42 test dockerd[19266]: time="2021-04-24T03:22:42.116025658+01:00" level=info msg="NetworkDB stats test(6f5a9f02af5c) - netID:ufgghvl8ac8kwjzotf4af4mzi leaving:false netPeers:1 entries:3 Queue qLen:0 netMsg/s:0"
Apr 24 03:22:42 test dockerd[19266]: time="2021-04-24T03:22:42.116544815+01:00" level=info msg="NetworkDB stats test(6f5a9f02af5c) - netID:618k5qnxoi3exmrbaalayfs40 leaving:false netPeers:1 entries:9 Queue qLen:0 netMsg/s:0"
Apr 24 03:22:42 test dockerd[19266]: time="2021-04-24T03:22:42.116946047+01:00" level=info msg="NetworkDB stats test(6f5a9f02af5c) - netID:3pq8cumqng1b316yhh79xlz9w leaving:false netPeers:1 entries:3 Queue qLen:0 netMsg/s:0"
Apr 24 03:22:42 test dockerd[19266]: time="2021-04-24T03:22:42.117317571+01:00" level=info msg="NetworkDB stats test(6f5a9f02af5c) - netID:acslbyqrneb292zhmuggcvss3 leaving:false netPeers:2 entries:6 Queue qLen:0 netMsg/s:0"
Apr 24 03:22:42 test dockerd[19266]: time="2021-04-24T03:22:42.117688429+01:00" level=info msg="NetworkDB stats test(6f5a9f02af5c) - netID:50buizavpb4e15ftcxyv54je9 leaving:false netPeers:2 entries:4 Queue qLen:0 netMsg/s:0"
Apr 24 03:22:42 test dockerd[19266]: time="2021-04-24T03:22:42.118014507+01:00" level=info msg="NetworkDB stats test(6f5a9f02af5c) - netID:jvzn1od89453s7fmv6ah4ej79 leaving:false netPeers:2 entries:4 Queue qLen:0 netMsg/s:0"
Apr 24 03:22:42 test dockerd[19266]: time="2021-04-24T03:22:42.616079301+01:00" level=warning msg="memberlist: Failed fallback ping: Unexpected msgType (13) from ping from=172.93.178.4:7946"
Apr 24 03:22:43 test dockerd[19266]: time="2021-04-24T03:22:43.114306895+01:00" level=info msg="memberlist: Suspect 888612b1e1ac has failed, no acks received"
Apr 24 03:22:45 test dockerd[19266]: time="2021-04-24T03:22:45.616271578+01:00" level=warning msg="memberlist: Failed fallback ping: Unexpected msgType (13) from ping from=172.93.178.4:7946"
Apr 24 03:22:46 test dockerd[19266]: time="2021-04-24T03:22:46.114555037+01:00" level=info msg="memberlist: Suspect 888612b1e1ac has failed, no acks received"
Apr 24 03:22:47 test dockerd[19266]: time="2021-04-24T03:22:47.495997298+01:00" level=info msg="memberlist: Marking 888612b1e1ac as failed, suspect timeout reached (1 peer confirmations)"
Apr 24 03:22:47 test dockerd[19266]: time="2021-04-24T03:22:47.496257637+01:00" level=info msg="Node 888612b1e1ac/172.93.178.4, left gossip cluster"
Apr 24 03:22:47 test dockerd[19266]: time="2021-04-24T03:22:47.496416687+01:00" level=info msg="Node 888612b1e1ac change state NodeActive --> NodeFailed"
Apr 24 03:22:47 test dockerd[19266]: time="2021-04-24T03:22:47.496487533+01:00" level=info msg="Node 888612b1e1ac/172.93.178.4, added to failed nodes list"
Apr 24 03:22:47 test dockerd[19266]: time="2021-04-24T03:22:47.616828709+01:00" level=warning msg="memberlist: Failed fallback ping: Unexpected msgType (13) from ping from=172.93.178.4:7946"
Apr 24 03:22:48 test dockerd[19266]: time="2021-04-24T03:22:48.114647699+01:00" level=info msg="memberlist: Suspect 888612b1e1ac has failed, no acks received"

仍在集群中的其他两台机器完美通信。

只有另外两个不在覆盖网络上通信的新的。

任何想法?谢谢!

标签: dockerdocker-swarm

解决方案


推荐阅读