EMQX错误分析

版本:EMQX 5.1.6
2023-11-09T09:44:18.314365+00:00 [error] message=channel_error driver=tcp socket=“#Port<0.52>” reason=“ehostunreach” action=stopping
2023-11-09T09:53:37.417172+00:00 [error] msg: gen_rpc_error, mfa: gen_rpc_acceptor:handle_event/4, line: 231, action: stopping, driver: tcp, error: channel_error, peer: 192.168.3.14:59924, reason: etimedout, socket: #Port<0.324>

还有这个:[error] message=channel_error driver=tcp socket=“#Port<0.125355>” reason=“econnreset” action=stopping

这个生成了一个文件:2023-11-13T08:54:30.140544+00:00 [error] crasher: initial call: disk_log:init/2, pid: <0.3561.0>, registered_name: , exit: {{{failed,{error,{file_error,“/opt/emqx/data/mnesia/emqx@192.168.3.13/PREVIOUS.LOG”,enoent}}},[{disk_log,reopen,2}]},[{disk_log,do_exit,4,[{file,“disk_log.erl”},{line,1175}]},{proc_lib,init_p_do_apply,3,[{file,“proc_lib.erl”},{line,240}]}]}, ancestors: [disk_log_sup,kernel_safe_sup,kernel_sup,<0.1987.0>], message_queue_len: 0, messages: , links: [<0.2036.0>], dictionary: [{quiet,false},{write_cache_timer_is_running,true}], trap_exit: true, status: running, heap_size: 1598, stack_size: 28, reductions: 2678332; neighbours:
2023-11-13T08:54:30.143207+00:00 [error] Supervisor: {local,disk_log_sup}. Context: child_terminated. Reason: {{failed,{error,{file_error,“/opt/emqx/data/mnesia/emqx@192.168.3.13/PREVIOUS.LOG”,enoent}}},[{disk_log,reopen,2}]}. Offender: id=disk_log,pid=<0.3561.0>.
2023-11-13T08:54:30.259483+00:00 [error] Mnesia(‘emqx@192.168.3.13’): ** ERROR ** (core dumped to file: “/opt/emqx/MnesiaCore.emqx@192.168.3.13_1699_865670_257626”), ** FATAL ** {error,{“Cannot rename disk_log file”,latest_log,"/opt/emqx/data/mnesia/

请提供尽可能全的日志。信息太少分析不出来什么原因的,只能猜测。

刚刚被删除的那条中似乎看到了 “partition network”
再结合 “ehostunreach” 看,可能是网络问题导致了集群脑裂。