news 2026/5/5 13:49:03

k8s部署metrics-server

作者头像

张小明

前端开发工程师

1.2k 24
文章封面图
k8s部署metrics-server

k8s部署metrics-server是 Kubernetes 实现资源监控(如kubectl top、HPA 自动扩缩容)的核心组件,在部署过程中遇到过以下问题

  • 镜像拉取失败(k8s.gcr.io镜像国内无法访问);
  • 证书验证问题(需跳过 TLS 验证或配置正确证书);
  • API Server 连接问题(需指定kubelet-insecure-tls)。

部署步骤如下

1.步骤 1:下载官方部署文件(并修改)

# 下载官方 yaml(也可手动创建) wget https://github.com/kubernetes-sigs/metrics-server/releases/latest/download/components.yaml -O metrics-server.yaml

2.步骤 2:修改metrics-server.yaml关键配置

打开metrics-server.yaml,做以下 3 处核心修改:

# 原镜像(国内无法访问) # image: k8s.gcr.io/metrics-server/metrics-server:v0.7.0 # 替换为阿里云镜像(适配 v0.7.0 版本) image: registry.cn-hangzhou.aliyuncs.com/google_containers/metrics-server:v0.7.0

3. 添加启动参数(解决证书 / 连接问题)

Deploymentargs部分,新增以下参数(关键!):

spec: template: spec: containers: - name: metrics-server args: - --cert-dir=/tmp - --secure-port=4443 # 新增以下 3 个参数 - --kubelet-insecure-tls # 跳过 kubelet TLS 验证(测试环境推荐,生产建议配置证书) - --kubelet-preferred-address-types=InternalIP,ExternalIP,Hostname # 指定 kubelet 地址类型 - --metric-resolution=15s # 监控数据采集间隔

4.可选:调整资源限制(根据集群规模)

resources: requests: cpu: 100m memory: 100Mi limits: cpu: 500m memory: 512Mi

5.部署metrics-server

kubectl apply -f metrics-server.yaml

6.验证部署

kubectl get pods -n kube-system -l k8s-app=metrics-server # 正常输出(STATUS 为 Running): # NAME READY STATUS RESTARTS AGE # metrics-server-7f987d68c4-9x8zl 1/1 Running 0 5m
检查 Pod 日志(排查启动失败)
kubectl logs -n kube-system $(kubectl get pods -n kube-system -l k8s-app=metrics-server -o name) # 常见日志错误及解决: # - "x509: certificate signed by unknown authority" → 确认已加 --kubelet-insecure-tls # - "unable to reach kubelet" → 检查 --kubelet-preferred-address-types 参数 # - "image pull failed" → 确认镜像地址正确
验证 API 可用性(核心!)

metrics-server会注册metrics.k8s.ioAPI,检查是否正常:

# 查看节点资源使用 kubectl top nodes # 输出示例: # NAME CPU(cores) CPU% MEMORY(bytes) MEMORY% # k8s-master 123m 6% 1200Mi 30% # k8s-node1 89m 4% 980Mi 25% # 查看 Pod 资源使用 kubectl top pods -n kube-system # 输出包含 metrics-server 自身的资源占用

二.本次部署环境使用修改后的yaml文件如下,可直接使用

apiVersion: v1 kind: ServiceAccount metadata: labels: k8s-app: metrics-server name: metrics-server namespace: kube-system --- apiVersion: rbac.authorization.k8s.io/v1 kind: ClusterRole metadata: labels: k8s-app: metrics-server rbac.authorization.k8s.io/aggregate-to-admin: "true" rbac.authorization.k8s.io/aggregate-to-edit: "true" rbac.authorization.k8s.io/aggregate-to-view: "true" name: system:aggregated-metrics-reader rules: - apiGroups: - metrics.k8s.io resources: - pods - nodes verbs: - get - list - watch --- apiVersion: rbac.authorization.k8s.io/v1 kind: ClusterRole metadata: labels: k8s-app: metrics-server name: system:metrics-server rules: - apiGroups: - "" resources: - pods - nodes - nodes/stats - namespaces - configmaps verbs: - get - list - watch --- apiVersion: rbac.authorization.k8s.io/v1 kind: RoleBinding metadata: labels: k8s-app: metrics-server name: metrics-server-auth-reader namespace: kube-system roleRef: apiGroup: rbac.authorization.k8s.io kind: Role name: extension-apiserver-authentication-reader subjects: - kind: ServiceAccount name: metrics-server namespace: kube-system --- apiVersion: rbac.authorization.k8s.io/v1 kind: ClusterRoleBinding metadata: labels: k8s-app: metrics-server name: metrics-server:system:auth-delegator roleRef: apiGroup: rbac.authorization.k8s.io kind: ClusterRole name: system:auth-delegator subjects: - kind: ServiceAccount name: metrics-server namespace: kube-system --- apiVersion: rbac.authorization.k8s.io/v1 kind: ClusterRoleBinding metadata: labels: k8s-app: metrics-server name: system:metrics-server roleRef: apiGroup: rbac.authorization.k8s.io kind: ClusterRole name: system:metrics-server subjects: - kind: ServiceAccount name: metrics-server namespace: kube-system --- apiVersion: v1 kind: Service metadata: labels: k8s-app: metrics-server name: metrics-server namespace: kube-system spec: ports: - name: https port: 443 protocol: TCP targetPort: 8443 selector: k8s-app: metrics-server --- apiVersion: apps/v1 kind: Deployment metadata: labels: k8s-app: metrics-server name: metrics-server namespace: kube-system spec: selector: matchLabels: k8s-app: metrics-server strategy: rollingUpdate: maxUnavailable: 0 template: metadata: labels: k8s-app: metrics-server spec: containers: - args: - --cert-dir=/tmp - --secure-port=8443 - --kubelet-preferred-address-types=InternalIP,ExternalIP,Hostname - --kubelet-use-node-status-port - --metric-resolution=15s - --kubelet-insecure-tls - --authorization-always-allow-paths=/livez,/readyz image: swr.cn-east-2.myhuaweicloud.com/kuboard-dependency/metrics-server:v0.5.0 imagePullPolicy: IfNotPresent livenessProbe: failureThreshold: 3 httpGet: path: /livez port: https scheme: HTTPS periodSeconds: 10 name: metrics-server ports: - containerPort: 8443 name: https protocol: TCP readinessProbe: failureThreshold: 3 httpGet: path: /readyz port: https scheme: HTTPS initialDelaySeconds: 20 periodSeconds: 10 resources: requests: cpu: 100m memory: 200Mi securityContext: readOnlyRootFilesystem: true runAsNonRoot: true runAsUser: 1000 volumeMounts: - mountPath: /tmp name: tmp-dir nodeSelector: kubernetes.io/os: linux priorityClassName: system-cluster-critical serviceAccountName: metrics-server volumes: - emptyDir: {} name: tmp-dir --- apiVersion: apiregistration.k8s.io/v1 kind: APIService metadata: labels: k8s-app: metrics-server name: v1beta1.metrics.k8s.io spec: group: metrics.k8s.io groupPriorityMinimum: 100 insecureSkipTLSVerify: true service: name: metrics-server namespace: kube-system version: v1beta1 versionPriority: 100
版权声明: 本文来自互联网用户投稿,该文观点仅代表作者本人,不代表本站立场。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如若内容造成侵权/违法违规/事实不符,请联系邮箱:809451989@qq.com进行投诉反馈,一经查实,立即删除!
网站建设 2026/5/2 22:31:28

SeedVR2视频修复模型深度解析:从技术原理到实战应用

SeedVR2视频修复模型深度解析:从技术原理到实战应用 【免费下载链接】SeedVR2-7B 项目地址: https://ai.gitcode.com/hf_mirrors/ByteDance-Seed/SeedVR2-7B 在当今视频内容爆炸式增长的时代,如何高效处理低质量视频素材成为创作者面临的核心挑战…

作者头像 李华
网站建设 2026/4/26 2:40:56

多模态模型CLIP详解

论文:Learning Transferable Visual Models From Natural Language SupervisionCLIP 的全称是 Contrastive Language-Image Pre-training(对比语言-图像预训练)。它是由 OpenAI 在 2021 年提出的一个多模态人工智能模型。其核心思想是通过学习…

作者头像 李华
网站建设 2026/5/3 13:20:37

EnergyPlus建筑能耗模拟完全指南:掌握核心技术

EnergyPlus作为业界领先的建筑能源模拟解决方案,为建筑节能设计提供了全面的技术支撑。本指南将深入解析其核心功能与应用技巧,帮助您快速掌握这一强大工具。 【免费下载链接】EnergyPlus EnergyPlus™ is a whole building energy simulation program t…

作者头像 李华
网站建设 2026/4/23 22:40:12

为什么90%的Q#开发者忽略了VSCode中的覆盖率指标?

第一章:Q# 程序的 VSCode 代码覆盖率在量子计算开发中,确保 Q# 程序的质量至关重要。Visual Studio Code(VSCode)作为主流开发环境,结合扩展工具可实现对 Q# 代码的覆盖率分析,帮助开发者识别未测试的量子逻…

作者头像 李华
网站建设 2026/5/3 21:54:59

如何实现电脑音频无线投送到手机?跨设备同步终极指南

如何实现电脑音频无线投送到手机?跨设备同步终极指南 【免费下载链接】AudioShare 将Windows的音频在其他Android设备上实时播放。Share windows audio 项目地址: https://gitcode.com/gh_mirrors/audi/AudioShare 还在为设备间的音频壁垒而烦恼吗&#xff1…

作者头像 李华