news 2026/6/23 21:33:37

k8s部署metrics-server

作者头像

张小明

前端开发工程师

1.2k 24
文章封面图
k8s部署metrics-server

k8s部署metrics-server是 Kubernetes 实现资源监控(如kubectl top、HPA 自动扩缩容)的核心组件,在部署过程中遇到过以下问题

  • 镜像拉取失败(k8s.gcr.io镜像国内无法访问);
  • 证书验证问题(需跳过 TLS 验证或配置正确证书);
  • API Server 连接问题(需指定kubelet-insecure-tls)。

部署步骤如下

1.步骤 1:下载官方部署文件(并修改)

# 下载官方 yaml(也可手动创建) wget https://github.com/kubernetes-sigs/metrics-server/releases/latest/download/components.yaml -O metrics-server.yaml

2.步骤 2:修改metrics-server.yaml关键配置

打开metrics-server.yaml,做以下 3 处核心修改:

# 原镜像(国内无法访问) # image: k8s.gcr.io/metrics-server/metrics-server:v0.7.0 # 替换为阿里云镜像(适配 v0.7.0 版本) image: registry.cn-hangzhou.aliyuncs.com/google_containers/metrics-server:v0.7.0

3. 添加启动参数(解决证书 / 连接问题)

Deploymentargs部分,新增以下参数(关键!):

spec: template: spec: containers: - name: metrics-server args: - --cert-dir=/tmp - --secure-port=4443 # 新增以下 3 个参数 - --kubelet-insecure-tls # 跳过 kubelet TLS 验证(测试环境推荐,生产建议配置证书) - --kubelet-preferred-address-types=InternalIP,ExternalIP,Hostname # 指定 kubelet 地址类型 - --metric-resolution=15s # 监控数据采集间隔

4.可选:调整资源限制(根据集群规模)

resources: requests: cpu: 100m memory: 100Mi limits: cpu: 500m memory: 512Mi

5.部署metrics-server

kubectl apply -f metrics-server.yaml

6.验证部署

kubectl get pods -n kube-system -l k8s-app=metrics-server # 正常输出(STATUS 为 Running): # NAME READY STATUS RESTARTS AGE # metrics-server-7f987d68c4-9x8zl 1/1 Running 0 5m
检查 Pod 日志(排查启动失败)
kubectl logs -n kube-system $(kubectl get pods -n kube-system -l k8s-app=metrics-server -o name) # 常见日志错误及解决: # - "x509: certificate signed by unknown authority" → 确认已加 --kubelet-insecure-tls # - "unable to reach kubelet" → 检查 --kubelet-preferred-address-types 参数 # - "image pull failed" → 确认镜像地址正确
验证 API 可用性(核心!)

metrics-server会注册metrics.k8s.ioAPI,检查是否正常:

# 查看节点资源使用 kubectl top nodes # 输出示例: # NAME CPU(cores) CPU% MEMORY(bytes) MEMORY% # k8s-master 123m 6% 1200Mi 30% # k8s-node1 89m 4% 980Mi 25% # 查看 Pod 资源使用 kubectl top pods -n kube-system # 输出包含 metrics-server 自身的资源占用

二.本次部署环境使用修改后的yaml文件如下,可直接使用

apiVersion: v1 kind: ServiceAccount metadata: labels: k8s-app: metrics-server name: metrics-server namespace: kube-system --- apiVersion: rbac.authorization.k8s.io/v1 kind: ClusterRole metadata: labels: k8s-app: metrics-server rbac.authorization.k8s.io/aggregate-to-admin: "true" rbac.authorization.k8s.io/aggregate-to-edit: "true" rbac.authorization.k8s.io/aggregate-to-view: "true" name: system:aggregated-metrics-reader rules: - apiGroups: - metrics.k8s.io resources: - pods - nodes verbs: - get - list - watch --- apiVersion: rbac.authorization.k8s.io/v1 kind: ClusterRole metadata: labels: k8s-app: metrics-server name: system:metrics-server rules: - apiGroups: - "" resources: - pods - nodes - nodes/stats - namespaces - configmaps verbs: - get - list - watch --- apiVersion: rbac.authorization.k8s.io/v1 kind: RoleBinding metadata: labels: k8s-app: metrics-server name: metrics-server-auth-reader namespace: kube-system roleRef: apiGroup: rbac.authorization.k8s.io kind: Role name: extension-apiserver-authentication-reader subjects: - kind: ServiceAccount name: metrics-server namespace: kube-system --- apiVersion: rbac.authorization.k8s.io/v1 kind: ClusterRoleBinding metadata: labels: k8s-app: metrics-server name: metrics-server:system:auth-delegator roleRef: apiGroup: rbac.authorization.k8s.io kind: ClusterRole name: system:auth-delegator subjects: - kind: ServiceAccount name: metrics-server namespace: kube-system --- apiVersion: rbac.authorization.k8s.io/v1 kind: ClusterRoleBinding metadata: labels: k8s-app: metrics-server name: system:metrics-server roleRef: apiGroup: rbac.authorization.k8s.io kind: ClusterRole name: system:metrics-server subjects: - kind: ServiceAccount name: metrics-server namespace: kube-system --- apiVersion: v1 kind: Service metadata: labels: k8s-app: metrics-server name: metrics-server namespace: kube-system spec: ports: - name: https port: 443 protocol: TCP targetPort: 8443 selector: k8s-app: metrics-server --- apiVersion: apps/v1 kind: Deployment metadata: labels: k8s-app: metrics-server name: metrics-server namespace: kube-system spec: selector: matchLabels: k8s-app: metrics-server strategy: rollingUpdate: maxUnavailable: 0 template: metadata: labels: k8s-app: metrics-server spec: containers: - args: - --cert-dir=/tmp - --secure-port=8443 - --kubelet-preferred-address-types=InternalIP,ExternalIP,Hostname - --kubelet-use-node-status-port - --metric-resolution=15s - --kubelet-insecure-tls - --authorization-always-allow-paths=/livez,/readyz image: swr.cn-east-2.myhuaweicloud.com/kuboard-dependency/metrics-server:v0.5.0 imagePullPolicy: IfNotPresent livenessProbe: failureThreshold: 3 httpGet: path: /livez port: https scheme: HTTPS periodSeconds: 10 name: metrics-server ports: - containerPort: 8443 name: https protocol: TCP readinessProbe: failureThreshold: 3 httpGet: path: /readyz port: https scheme: HTTPS initialDelaySeconds: 20 periodSeconds: 10 resources: requests: cpu: 100m memory: 200Mi securityContext: readOnlyRootFilesystem: true runAsNonRoot: true runAsUser: 1000 volumeMounts: - mountPath: /tmp name: tmp-dir nodeSelector: kubernetes.io/os: linux priorityClassName: system-cluster-critical serviceAccountName: metrics-server volumes: - emptyDir: {} name: tmp-dir --- apiVersion: apiregistration.k8s.io/v1 kind: APIService metadata: labels: k8s-app: metrics-server name: v1beta1.metrics.k8s.io spec: group: metrics.k8s.io groupPriorityMinimum: 100 insecureSkipTLSVerify: true service: name: metrics-server namespace: kube-system version: v1beta1 versionPriority: 100
版权声明: 本文来自互联网用户投稿,该文观点仅代表作者本人,不代表本站立场。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如若内容造成侵权/违法违规/事实不符,请联系邮箱:809451989@qq.com进行投诉反馈,一经查实,立即删除!
网站建设 2026/6/23 19:34:11

【高并发场景下的Agent服务稳定性保障】:基于Docker的百万级压测实录

第一章:高并发Agent服务的稳定性挑战在构建大规模分布式系统时,Agent 作为采集、上报和执行指令的核心组件,常常面临高并发场景下的稳定性问题。随着业务规模扩大,单个 Agent 需要处理来自成千上万个任务节点的数据聚合与实时通信…

作者头像 李华
网站建设 2026/6/23 19:52:55

认知科学视角下的游戏化编程学习平台教学效果实证研究

认知科学视角下的游戏化编程学习平台教学效果实证研究 【免费下载链接】codecombat Game for learning how to code. 项目地址: https://gitcode.com/gh_mirrors/co/codecombat 基于认知负荷理论与内在动机原理,游戏化编程学习平台通过精心设计的教学机制&am…

作者头像 李华
网站建设 2026/6/23 5:56:31

5步构建AI永久记忆系统:告别重复对话的智能助手

5步构建AI永久记忆系统:告别重复对话的智能助手 【免费下载链接】servers Model Context Protocol Servers 项目地址: https://gitcode.com/GitHub_Trending/se/servers 你是否厌倦了每次与AI对话都要重复介绍自己的基本信息?现在,通过…

作者头像 李华
网站建设 2026/6/23 3:32:08

终极文件预览神器:Peek如何让Windows文件查看体验焕然一新

终极文件预览神器:Peek如何让Windows文件查看体验焕然一新 【免费下载链接】QuickLook 项目地址: https://gitcode.com/gh_mirrors/qui/QuickLook 还在为查看不同格式文件而频繁切换各种专业软件烦恼吗?Peek作为一款开源免费的文件快速预览工具&…

作者头像 李华
网站建设 2026/6/22 22:52:37

为什么90%的量子开发者都忽略代码导航配置?一文看懂Q#与Python联动机制

第一章:量子开发中的代码导航盲区在量子计算与传统软件工程交汇的当下,开发者面临前所未有的代码结构复杂性。量子程序通常由经典控制逻辑与量子线路混合构成,这种异构特性使得常规IDE的跳转、引用查找功能频繁失效,形成“导航盲区…

作者头像 李华