emqx etcd 重启后无法加入集群

emqx etcd 4节点此前已经正常跑了16个月以上,今天重启1节点后无法加入集群

system os:

NAME="Ubuntu"
VERSION="16.04.4 LTS (Xenial Xerus)"
ID=ubuntu
ID_LIKE=debian
PRETTY_NAME="Ubuntu 16.04.4 LTS"
VERSION_ID="16.04"
HOME_URL="http://www.ubuntu.com/"
SUPPORT_URL="http://help.ubuntu.com/"
BUG_REPORT_URL="http://bugs.launchpad.net/ubuntu/"
VERSION_CODENAME=xenial
UBUNTU_CODENAME=xenial

docker version:

Docker version 19.03.8, build afacb8b7f0

emqx docker image version:

emqx/emqx:v4.0.0

重启or docker rm 后debug日志 只有waiting emqx 然后timeout error , 这个在哪看具体错误的日志, emqx 有没有提供更详尽的日志用来分析为什么无法加入集群

2022-10-27 23:31:07.928 [info]     application: emqx_recon
    started_at: 'emqx5@172.17.110.217'
2022-10-27 23:31:07.928 [info] [Plugins] Started plugins: [recon,emqx_recon]
2022-10-27 23:31:07.928 [info] [Plugins] Load plugin emqx_recon successfully
2022-10-27 23:31:08.339 [info] Ekka(Membership): Node emqx4@172.17.110.215 up
['2022-10-27T23:31:10Z']:waiting emqx
['2022-10-27T23:31:12Z']:waiting emqx
['2022-10-27T23:31:13Z']:waiting emqx
['2022-10-27T23:31:15Z']:waiting emqx
['2022-10-27T23:31:17Z']:waiting emqx
2022-10-27 23:31:17.938 [info] Ekka(Membership): Node emqx4@172.17.110.215 down
2022-10-27 23:31:17.939 [info] Ekka(Membership): Mnesia emqx4@172.17.110.215 down
['2022-10-27T23:31:19Z']:waiting emqx
['2022-10-27T23:31:19Z']:timeout error

duplicated by docker emqx cluster join timeout 如何排查

启动顺序这一条很重要

目前有 emqx2 emqx3 emqx4 emqx5
首先是emqx5 挂了, docker restart emqx 但是无法启动 waiting timeout …
然后我又主动停了emqx4 导致emqx4 也无法启动加入集群

即使我docker rm emqx5 重新加入也不行

如果需要顺序重启 那如果我emqx5 挂了, 我就需要把所有的emqx1 emqx2 emqx3 emqx4 停掉然后启动emqx5 线上服务这样搞会很有问题

同问,这个启动顺序的问题要怎么解决呢