ubuntu - 由于缺少 ~/.kube/config,oc cluster up 无法启动 kube-apiserver
问题描述
我尝试在我的桌面(Ubuntu 18)上运行 OKD。我遵循说明:https ://opensource.com/article/18/11/local-okd-cluster-linux (类似)。
- 我安装了 Docker:
$ docker version
...
Version: 19.03.12
- 不安全的注册表:
$ sudo cat /etc/docker/daemon.json
{
"insecure-registries" : [ "172.30.0.0/16" ]
}
- 重启 Docker 守护进程:
$ docker info
...
Insecure Registries:
172.30.0.0/16
127.0.0.0/8
- 禁用防火墙:
$ sudo ufw status
Status: inactive
- 下载的 OKD 客户端工具:
$ ./oc version
oc v3.11.0+0cbc58b
kubernetes v1.11.0+d4cacc0
features: Basic-Auth GSSAPI Kerberos SPNEGO
- 我尝试启动集群,但它失败了。
$ ./oc cluster up
Getting a Docker client ...
Checking if image openshift/origin-control-plane:v3.11 is available ...
Creating shared mount directory on the remote host ...
Determining server IP ...
Checking if OpenShift is already running ...
Checking for supported Docker version (=>1.22) ...
Checking if insecured registry is configured properly in Docker ...
Checking if required ports are available ...
Checking if OpenShift client is configured properly ...
Checking if image openshift/origin-control-plane:v3.11 is available ...
Starting OpenShift using openshift/origin-control-plane:v3.11 ...
I1104 16:54:53.631254 6317 config.go:40] Running "create-master-config"
I1104 16:54:56.048019 6317 config.go:46] Running "create-node-config"
I1104 16:54:57.639381 6317 flags.go:30] Running "create-kubelet-flags"
I1104 16:54:58.559780 6317 run_kubelet.go:49] Running "start-kubelet"
I1104 16:54:58.862023 6317 run_self_hosted.go:181] Waiting for the kube-apiserver to be ready ...
漫长的等待,然后:
E1104 16:59:58.864017 6317 run_self_hosted.go:571] API server error: Get https://127.0.0.1:8443/healthz?timeout=32s: dial tcp 127.0.0.1:8443: connect: connection refused ()
Error: timed out waiting for the condition
具有更高的日志记录级别(我知道我必须删除openshift.local.clusterup
目录或--base-dir
在我想要更新时通过oc cluster up
):
$ ./oc cluster up --loglevel=5
...
I1104 17:07:50.991364 14512 run_self_hosted.go:181] Waiting for the kube-apiserver to be ready
I1104 17:07:50.992053 14512 run_self_hosted.go:557] Server isn't healthy yet. Waiting a little while. Get https://127.0.0.1:8443/healthz?timeout=32s: dial tcp 127.0.0.1:8443: connect: connection refused
I1104 17:07:51.992467 14512 run_self_hosted.go:557] Server isn't healthy yet. Waiting a little while. Get https://127.0.0.1:8443/healthz?timeout=32s: dial tcp 127.0.0.1:8443: connect: connection refused
I1104 17:07:52.993484 14512 run_self_hosted.go:557] Server isn't healthy yet. Waiting a little while. Get https://127.0.0.1:8443/healthz?timeout=32s: dial tcp 127.0.0.1:8443: connect: connection refused
...
I1104 17:08:10.992682 14512 run_self_hosted.go:557] Server isn't healthy yet. Waiting a little while. Get https://127.0.0.1:8443/healthz?timeout=32s: net/http: TLS handshake timeout
...
error: unable to recognize "/namespace.yaml": Get https://127.0.0.1:8443/api?timeout=32s: dial tcp 127.0.0.1:8443: connect: connection refused
...
The connection to the server 127.0.0.1:8443 was refused - did you specify the right host or port?
...
E1104 17:08:52.435348 14512 interface.go:34] Failed to install "openshift-service-cert-signer-operator": failed to install "openshift-service-cert-signer-operator": cannot create container using image openshift/origin-cli:v3.11; caused by: cannot create container using image openshift/origin-cli:v3.11
E1104 17:08:53.087022 14512 interface.go:34] Failed to install "kube-dns": failed to install "kube-dns": cannot create container using image openshift/origin-cli:v3.11; caused by: cannot create container using image openshift/origin-cli:v3.11
I1104 17:08:53.087047 14512 interface.go:41] Finished installing "kube-proxy" "kube-dns" "openshift-service-cert-signer-operator" "openshift-apiserver"
Error: [failed to install "kube-proxy": cannot create container using image openshift/origin-cli:v3.11; caused by: cannot create container using image openshift/origin-cli:v3.11, failed to install "openshift-apiserver": cannot create container using image openshift/origin-cli:v3.11; caused by: cannot create container using image openshift/origin-cli:v3.11, failed to install "openshift-service-cert-signer-operator": cannot create container using image openshift/origin-cli:v3.11; caused by: cannot create container using image openshift/origin-cli:v3.11, failed to install "kube-dns": cannot create container using image openshift/origin-cli:v3.11; caused by: cannot create container using image openshift/origin-cli:v3.11]
我尝试检查出了什么问题,我注意到缺少配置存在问题。
$ ./oc cluster status
Error: invalid configuration: Missing or incomplete configuration info. Please login or point to an existing, complete config file:
1. Via the command-line flag --config
2. Via the KUBECONFIG environment variable
3. In your home directory as ~/.kube/config
To view or setup config directly use the 'config' command.
我没有环境变量:
$ echo $KUBECONFIG
而且我没有~/.kube/config
文件:
$ cat ~/.kube/config
cat: /home/my-username/.kube/config: No such file or directory
$ ls ~/.kube/
ls: cannot access '/home/my-username/.kube/': No such file or directory
我知道oc cluster up
应该创建~/.kube/config
. 但就我而言,它不会创建。
由于缺少配置,甚至kubectl
无法正常工作(我认为它应该适用于从二进制https://kubernetes.io/docs/tasks/tools/install-kubectl/#install-kubectl-binary-with-curl-进行的典型 kubectl 安装在 Linux 上):
$ ./kubectl version --client
error: no configuration has been provided
$ ./kubectl config view
apiVersion: v1
clusters: []
contexts: []
current-context: ""
kind: Config
preferences: {}
users: []
解决方案
我解决了这个问题。
我有禁用防火墙(ufw)的 Ubuntu。但是 iptables 仍然有效。
命令sudo iptables -L
向我展示了这四个规则:
Chain KUBE-SERVICES (1 references)
target prot opt source destination
REJECT tcp -- anywhere 172.30.237.36 /* default/router:80-tcp has no endpoints */ tcp dpt:http reject-with icmp-port-unreachable
REJECT tcp -- anywhere 172.30.1.1 /* default/docker-registry:5000-tcp has no endpoints */ tcp dpt:5000 reject-with icmp-port-unreachable
REJECT tcp -- anywhere 172.30.237.36 /* default/router:443-tcp has no endpoints */ tcp dpt:https reject-with icmp-port-unreachable
REJECT tcp -- anywhere 172.30.237.36 /* default/router:1936-tcp has no endpoints */ tcp dpt:1936 reject-with icmp-port-unreachable
我不知道它们来自哪里(我可以猜到 - 下面)。我决定删除它。
sudo iptables -L --line-numbers
sudo iptables -D KUBE-SERVICES 1
sudo iptables -D KUBE-SERVICES 1
sudo iptables -D KUBE-SERVICES 1
sudo iptables -D KUBE-SERVICES 1
并重新启动系统(确保重新加载 iptables)。
在./oc cluster up
它启动并~/.kube/config
创建之后。
Server Information ...
OpenShift server started.
The server is accessible via web console at:
https://127.0.0.1:8443
我猜规则可能来自oc cluster up
我在开始时运行一次而不添加 (我"insecure-registries" : [ "172.30.0.0/16" ]
试图/etc/docker/daemon.json
检查这是否是强制性的)
推荐阅读
- amazon-web-services - 在 k8s 中每分钟运行一个 cronjob 不起作用
- excel - VBA自动将行号增加到Excel中的新列
- r - R Shiny:是否有可能只监控输入的一个子集?
- docker - Docker Windows 容器和 NFS 卷
- python-3.x - groupby 当列包含需要添加的多维数组时
- regex - 如何在java中修剪以字符开头和结尾的特定行
- spring-boot - 如何设置 Spring Cloud Gateway 应用程序以便它可以使用 Spring Cloud Kubernetes 的服务发现?
- jpa - 当外键列为空时,jpa 标准中的多选会引发非法参数异常
- regex - 使用 bash 对文件进行 Grep 和正则表达式
- docker - 确保 docker 容器有证书