kubespray部署成功后,虚拟机重启后集群无法使用?
来源:3-2 Containerd全面上手实践

香泪
2021-08-15
[root@kubespray-1 ~]# kubectl get pods -o wide
NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES
nginx-ds-b59cv 1/1 Running 0 2m44s 10.233.132.6 kubespray-3
nginx-ds-bbjq5 1/1 Running 0 2m44s 10.233.143.2 kubespray-1
nginx-ds-tfb5l 1/1 Running 0 2m44s 10.233.232.3 kubespray-2
[root@kubespray-1 ~]# ping 10.233.143.2
PING 10.233.143.2 (10.233.143.2) 56(84) bytes of data.
64 bytes from 10.233.143.2: icmp_seq=1 ttl=64 time=0.045 ms
64 bytes from 10.233.143.2: icmp_seq=2 ttl=64 time=0.030 ms
64 bytes from 10.233.143.2: icmp_seq=3 ttl=64 time=0.032 ms
64 bytes from 10.233.143.2: icmp_seq=4 ttl=64 time=0.047 ms
64 bytes from 10.233.143.2: icmp_seq=5 ttl=64 time=0.032 ms
64 bytes from 10.233.143.2: icmp_seq=6 ttl=64 time=0.031 ms
64 bytes from 10.233.143.2: icmp_seq=7 ttl=64 time=0.029 ms
64 bytes from 10.233.143.2: icmp_seq=8 ttl=64 time=0.041 ms
64 bytes from 10.233.143.2: icmp_seq=9 ttl=64 time=0.030 ms
^C
— 10.233.143.2 ping statistics —
9 packets transmitted, 9 received, 0% packet loss, time 8000ms
rtt min/avg/max/mdev = 0.029/0.035/0.047/0.007 ms
[root@kubespray-1 ~]# ping 10.233.132.6
PING 10.233.132.6 (10.233.132.6) 56(84) bytes of data.
^C
— 10.233.132.6 ping statistics —
8 packets transmitted, 0 received, 100% packet loss, time 7000ms
[root@kubespray-1 ~]#
查看etcd启动有问题,目前不知如何处理
[root@kubespray-1 ~]# etcdctl member list
{“level”:“warn”,“ts”:“2021-08-15T16:47:56.752+0800”,“caller”:“clientv3/retry_interceptor.go:62”,“msg”:“retrying of unary invoker failed”,“target”:“endpoint://client-3395f380-4f50-41f4-ae99-6fdd29b3c49d/127.0.0.1:2379”,“attempt”:0,“error”:“rpc error: code = DeadlineExceeded desc = latest balancer error: all SubConns are in TransientFailure, latest connection error: connection closed”}
Error: context deadline exceeded
[root@kubespray-1 ~]# systemctl status etcd
● etcd.service - etcd
Loaded: loaded (/etc/systemd/system/etcd.service; enabled; vendor preset: disabled)
Active: active (running) since 日 2021-08-15 16:10:40 CST; 42min ago
Main PID: 938 (etcd)
Tasks: 10
Memory: 97.9M
CGroup: /system.slice/etcd.service
└─938 /usr/local/bin/etcd
8月 15 16:33:47 kubespray-1 etcd[938]: rejected connection from “192.168.253.128:56668” (error “tls: client didn’t provide a certificate”, ServerName “”)
8月 15 16:34:27 kubespray-1 etcd[938]: rejected connection from “192.168.253.128:56794” (error “tls: client didn’t provide a certificate”, ServerName “”)
8月 15 16:44:48 kubespray-1 etcd[938]: rejected connection from “192.168.253.128:58556” (error “tls: client didn’t provide a certificate”, ServerName “”)
8月 15 16:47:51 kubespray-1 etcd[938]: rejected connection from “127.0.0.1:45148” (error “tls: first record does not look like a TLS handshake”, ServerName “”)
8月 15 16:47:52 kubespray-1 etcd[938]: rejected connection from “127.0.0.1:45150” (error “tls: first record does not look like a TLS handshake”, ServerName “”)
8月 15 16:47:54 kubespray-1 etcd[938]: rejected connection from “127.0.0.1:45156” (error “tls: first record does not look like a TLS handshake”, ServerName “”)
8月 15 16:47:56 kubespray-1 etcd[938]: rejected connection from “127.0.0.1:45158” (error “tls: first record does not look like a TLS handshake”, ServerName “”)
Warning: Journal has been rotated since unit was started. Log output is incomplete or unavailable.
1回答
-
刘果国
2021-08-15
先排查etcd问题,每个节点上看下完整的etcd启动日志。
022021-12-02
相似问题