docker - 新节点不与 Docker Swarm 通信
问题描述
我想要一些帮助。Swarm 集群中的一个节点已经重新启动,并且在它重新加入集群后,它不会与来自其他节点的任何容器通信。
通过测试,我决定创建一台机器并加入集群,问题又出现了。
它不是防火墙,因为所有机器都有 IPTables 发布的所有端口。
我使用 Rancher 来管理 Swarm。
我得到了很多这种类型的日志:
pr 24 03:22:42 test dockerd[19266]: time="2021-04-24T03:22:42.115509948+01:00" level=info msg="NetworkDB stats test(6f5a9f02af5c) - netID:jwl6mpxebzyagcwm0yefj66ue leaving:false netPeers:2 entries:4 Queue qLen:0 netMsg/s:0"
Apr 24 03:22:42 test dockerd[19266]: time="2021-04-24T03:22:42.115772255+01:00" level=info msg="NetworkDB stats test(6f5a9f02af5c) - netID:gdbbdug3xlck2af5imjv7lkym leaving:false netPeers:2 entries:4 Queue qLen:0 netMsg/s:0"
Apr 24 03:22:42 test dockerd[19266]: time="2021-04-24T03:22:42.116025658+01:00" level=info msg="NetworkDB stats test(6f5a9f02af5c) - netID:ufgghvl8ac8kwjzotf4af4mzi leaving:false netPeers:1 entries:3 Queue qLen:0 netMsg/s:0"
Apr 24 03:22:42 test dockerd[19266]: time="2021-04-24T03:22:42.116544815+01:00" level=info msg="NetworkDB stats test(6f5a9f02af5c) - netID:618k5qnxoi3exmrbaalayfs40 leaving:false netPeers:1 entries:9 Queue qLen:0 netMsg/s:0"
Apr 24 03:22:42 test dockerd[19266]: time="2021-04-24T03:22:42.116946047+01:00" level=info msg="NetworkDB stats test(6f5a9f02af5c) - netID:3pq8cumqng1b316yhh79xlz9w leaving:false netPeers:1 entries:3 Queue qLen:0 netMsg/s:0"
Apr 24 03:22:42 test dockerd[19266]: time="2021-04-24T03:22:42.117317571+01:00" level=info msg="NetworkDB stats test(6f5a9f02af5c) - netID:acslbyqrneb292zhmuggcvss3 leaving:false netPeers:2 entries:6 Queue qLen:0 netMsg/s:0"
Apr 24 03:22:42 test dockerd[19266]: time="2021-04-24T03:22:42.117688429+01:00" level=info msg="NetworkDB stats test(6f5a9f02af5c) - netID:50buizavpb4e15ftcxyv54je9 leaving:false netPeers:2 entries:4 Queue qLen:0 netMsg/s:0"
Apr 24 03:22:42 test dockerd[19266]: time="2021-04-24T03:22:42.118014507+01:00" level=info msg="NetworkDB stats test(6f5a9f02af5c) - netID:jvzn1od89453s7fmv6ah4ej79 leaving:false netPeers:2 entries:4 Queue qLen:0 netMsg/s:0"
Apr 24 03:22:42 test dockerd[19266]: time="2021-04-24T03:22:42.616079301+01:00" level=warning msg="memberlist: Failed fallback ping: Unexpected msgType (13) from ping from=172.93.178.4:7946"
Apr 24 03:22:43 test dockerd[19266]: time="2021-04-24T03:22:43.114306895+01:00" level=info msg="memberlist: Suspect 888612b1e1ac has failed, no acks received"
Apr 24 03:22:45 test dockerd[19266]: time="2021-04-24T03:22:45.616271578+01:00" level=warning msg="memberlist: Failed fallback ping: Unexpected msgType (13) from ping from=172.93.178.4:7946"
Apr 24 03:22:46 test dockerd[19266]: time="2021-04-24T03:22:46.114555037+01:00" level=info msg="memberlist: Suspect 888612b1e1ac has failed, no acks received"
Apr 24 03:22:47 test dockerd[19266]: time="2021-04-24T03:22:47.495997298+01:00" level=info msg="memberlist: Marking 888612b1e1ac as failed, suspect timeout reached (1 peer confirmations)"
Apr 24 03:22:47 test dockerd[19266]: time="2021-04-24T03:22:47.496257637+01:00" level=info msg="Node 888612b1e1ac/172.93.178.4, left gossip cluster"
Apr 24 03:22:47 test dockerd[19266]: time="2021-04-24T03:22:47.496416687+01:00" level=info msg="Node 888612b1e1ac change state NodeActive --> NodeFailed"
Apr 24 03:22:47 test dockerd[19266]: time="2021-04-24T03:22:47.496487533+01:00" level=info msg="Node 888612b1e1ac/172.93.178.4, added to failed nodes list"
Apr 24 03:22:47 test dockerd[19266]: time="2021-04-24T03:22:47.616828709+01:00" level=warning msg="memberlist: Failed fallback ping: Unexpected msgType (13) from ping from=172.93.178.4:7946"
Apr 24 03:22:48 test dockerd[19266]: time="2021-04-24T03:22:48.114647699+01:00" level=info msg="memberlist: Suspect 888612b1e1ac has failed, no acks received"
仍在集群中的其他两台机器完美通信。
只有另外两个不在覆盖网络上通信的新的。
任何想法?谢谢!
解决方案
推荐阅读
- python - python - TypeError:需要一个类似字节的对象,而不是'str'
- javascript - nuxt.js 中的碰撞检测。代码运行没有错误。但是有些功能不起作用
- laravel - Laravel 8 - Jetstream +惯性.js - Vue开发工具不起作用
- ios - 无法使用 fastlane spaceauth 请求会话:Spaceship::AccessForbiddenError
- r - 当方程相互依赖时,替代 for 循环进行快速计算
- mongodb - Mongodb 4.2.8:无法将会话添加到缓存中,因为活动会话数过多
- javascript - 使用 Javascript 和 Moment.js 获得几天
- flutter - 应用程序在颤动中打开后立即从设备读取数据
- node.js - node:internal/modules/cjs/loader:926 抛出错误;
- omnet++ - Omnet++ 在模块中抛出错误,但我找不到它