Kubernetes实战(十五)-Pod垂直自动伸缩VPA实战

大家好，欢迎来到IT知识分享网。

1 介绍

VPA不会改变Pod的资源limits值，只调整pod的request 值。

使用 VPA 的意义:

Pod 资源用其所需，提升集群节点使用效率;
不必运行基准测试任务来确定 CPU 和内存请求的合适值;
VPA可以随时调整CPU和内存请求,无需人为操作，因此可以减少维护时间。

VPA目前还没有生产就绪，在使用之前需要了解资源调节对应用的影响。

2 VPA原理

2.1 工作流

首先 Recommender 会根据应用当前的资源使用情况以及历史的资源使用情况，计算接下来可能的资源使用阈值，如果计算出的值和当前值不一致则会给出一条资源调整建议。

然后 Updater 则根据这些建议进行调整，具体调整方法为：

1）Updater 根据建议发现需要调整，然后调用 api 驱逐 Pod
2）Pod 被驱逐后就会重建，然后再重建过程中VPA Admission Controller 会进行拦截，根据 Recommend 来调整 Pod 的资源请求量
3）最终 Pod 重建出来就是按照推荐资源请求量重建的了。

根据上述流程可知，调整资源请求量需要重建 Pod，这是一个破坏性的操作，所以 VPA 还没有生产就绪。

2.2 VPA架构图

VPA 主要包括两个组件：

1）VPA Controller
- Recommendr：给出 pod 资源调整建议
- Updater：对比建议值和当前值，不一致时驱逐 Pod
2）VPA Admission Controller
- Pod 重建时将 Pod 的资源请求量修改为推荐值

2.2.1 VPA Recommender

监视资源利用率并计算目标值
查看指标历史记录、OOM 事件和 VPA 部署规范并建议公平请求。根据定义的限制请求比例提高/降低限制

2.2.2 VPA Updater

驱逐那些需要新资源限制的 pod
如果定义了“updateMode: Auto”，则实现 Recommender 推荐的任何内容

2.2.3 VPA Admission Controller

每当 VPA 更新程序驱逐并重新启动 pod 时，在新 pod 启动之前更改 CPU 和内存设置（使用 webhook）
当 Vertical Pod Autoscaler 设置为 updateMode 为“Auto”时，如果需要更改 pod 的资源请求，则驱逐 pod。由于 Kubernetes 的设计，修改正在运行的 Pod 的资源请求的唯一方法是重新创建 Pod
2.3 Recommenderd 设计理念

推荐模型(MVP) 假设内存和CPU利用率是独立的随机变量，其分布等于过去 N 天观察到的变量(推荐值为 N=8 以捕获每周峰值)。

对于 CPU，目标是将容器使用率超过请求的高百分比(例如95%)时的时间部分保持在某个阈值(例如1%的时间)以下。
- 在此模型中，CPU 使用率 被定义为在短时间间隔内测量的平均使用率。测量间隔越短，针对尖峰、延迟敏感的工作负载的建议质量就越高。
- 最小合理分辨率为1/min,推荐为1/sec。
对于内存，目标是将特定时间窗口内容器使用率超过请求的概率保持在某个阈值以下(例如，24小时内低于1%)。
- 窗口必须很长(≥24小时)以确保由 OOM 引起的驱逐不会明显影响(a)服务应用程序的可用性(b)批处理计算的进度(更高级的模型可以允许用户指定SLO来控制它)。

2.4 VPA优缺点

2.4.1 优点

Pod 资源用其所需，所以集群节点使用效率高。
Pod 会被安排到具有适当可用资源的节点上。
不必运行基准测试任务来确定 CPU 和内存请求的合适值。
VPA 可以随时调整 CPU 和内存请求，无需人为操作，因此可以减少维护时间。

2.4.2 缺点

VPA的成熟度还不足，更新正在运行的 Pod 资源配置是 VPA 的一项试验性功能，会导致 Pod 的重建和重启，而且有可能被调度到其他的节点上。
VPA 不会驱逐没有在副本控制器管理下的 Pod。目前 VPA 不能和监控 CPU 和内存度量的Horizontal Pod Autoscaler (HPA) 同时运行,除非 HPA 只监控其他定制化的或者外部的资源度量。
VPA 使用 admission webhook 作为其准入控制器。如果集群中有其他的 admission webhook,需要确保它们不会与 VPA 发生冲突。准入控制器的执行顺序定义在 APIServer 的配置参数中。
VPA 会处理出现的绝大多数 OOM 的事件，但不保证所有的场景下都有效。
VPA 性能尚未在大型集群中进行测试。
VPA 对 Pod 资源 requests 的修改值可能超过实际的资源上限，例如节点资源上限、空闲资源或资源配额，从而造成 Pod 处于 Pending 状态无法被调度。
- 同时使用集群自动伸缩(ClusterAutoscaler) 可以一定程度上解决这个问题。
多个 VPA 同时匹配同一个 Pod 会造成未定义的行为。

2.4.3 限制

不能与HPA（Horizontal Pod Autoscaler ）一起使用
Pod使用有限制，比如使用副本控制器的工作负载，例如属于Deployment或者StatefulSet

2.5 In-Place Update of Pod Resources

当前 VPA 需要重建 Pod 才能调整 resource.requst，因此局限性会比较大，毕竟频繁重建 Pod 可能会对业务稳定性有影响。

社区在 2019 年就有人提出In-Place Update of Pod Resources 功能，最新进展见 #1287,根据 issue 中的描述，最快在 k8s v1.26 版本就能完成 Alpha 版本。

该功能实现后对 VPA 来说是一个巨大的优化，毕竟一直破坏性的重建 Pod 风险还是有的。

2.6 VPA运行模式

“Auto”：VPA 在创建 pod 时分配资源请求，并使用首选更新机制在现有 pod 上更新它们。目前这相当于”Recreate”（见下文）。一旦 pod 请求的免重启（“就地”）更新可用，它可能会被该”Auto”模式用作首选的更新机制。注意： VPA 的此功能是实验性的，可能会导致您的应用程序停机，当目前运行的pod的资源达不到VPA的推荐值，就会执行pod驱逐，重新部署新的足够资源的服务
“Recreate”：VPA 在创建 Pod 时分配资源请求，并在现有 Pod 上更新它们，当请求的资源与新建议有很大差异时（尊重 Pod 中断预算，如果定义）。这种模式应该很少使用，只有当您需要确保在资源请求发生变化时重新启动 Pod 时。否则，更喜欢这种”Auto”模式，一旦它们可用，就可以利用重新启动免费更新。注意： VPA 的此功能是实验性的，可能会导致您的应用程序停机
“Initial”：VPA 仅在创建 pod 时分配资源请求，以后不会更改它们
“Off”：VPA 不会自动更改 Pod 的资源需求。这些建议是经过计算的，并且可以在 VPA 对象中进行检查。这种模式仅获取资源推荐值，但是不更新Pod

3 VPA测试验证

3.1 部署metrics-server

3.1.1 下载部署清单文件

$ wget https://github.com/kubernetes-sigs/metrics-server/releases/download/v0.3.7/components.yaml

3.1.2 修改components.yaml文件

$ cat components.yaml --- apiVersion: rbac.authorization.k8s.io/v1 kind: ClusterRole metadata: name: system:aggregated-metrics-reader labels: rbac.authorization.k8s.io/aggregate-to-view: "true" rbac.authorization.k8s.io/aggregate-to-edit: "true" rbac.authorization.k8s.io/aggregate-to-admin: "true" rules: - apiGroups: ["metrics.k8s.io"] resources: ["pods", "nodes"] verbs: ["get", "list", "watch"] --- apiVersion: rbac.authorization.k8s.io/v1 kind: ClusterRoleBinding metadata: name: metrics-server:system:auth-delegator roleRef: apiGroup: rbac.authorization.k8s.io kind: ClusterRole name: system:auth-delegator subjects: - kind: ServiceAccount name: metrics-server namespace: kube-system --- apiVersion: rbac.authorization.k8s.io/v1 kind: RoleBinding metadata: name: metrics-server-auth-reader namespace: kube-system roleRef: apiGroup: rbac.authorization.k8s.io kind: Role name: extension-apiserver-authentication-reader subjects: - kind: ServiceAccount name: metrics-server namespace: kube-system --- apiVersion: apiregistration.k8s.io/v1 kind: APIService metadata: name: v1beta1.metrics.k8s.io spec: service: name: metrics-server namespace: kube-system group: metrics.k8s.io version: v1beta1 insecureSkipTLSVerify: true groupPriorityMinimum: 100 versionPriority: 100 --- apiVersion: v1 kind: ServiceAccount metadata: name: metrics-server namespace: kube-system --- apiVersion: apps/v1 kind: Deployment metadata: name: metrics-server namespace: kube-system labels: k8s-app: metrics-server spec: selector: matchLabels: k8s-app: metrics-server template: metadata: name: metrics-server labels: k8s-app: metrics-server spec: serviceAccountName: metrics-server volumes: # mount in tmp so we can safely use from-scratch images and/or read-only containers - name: tmp-dir emptyDir: {} containers: - name: metrics-server image: registry.aliyuncs.com/google_containers/metrics-server:v0.3.7 imagePullPolicy: IfNotPresent args: - --cert-dir=/tmp - --secure-port=4443 - /metrics-server - --kubelet-insecure-tls - --kubelet-preferred-address-types=InternalIP ports: - name: main-port containerPort: 4443 protocol: TCP securityContext: readOnlyRootFilesystem: true runAsNonRoot: true runAsUser: 1000 volumeMounts: - name: tmp-dir mountPath: /tmp nodeSelector: kubernetes.io/os: linux --- apiVersion: v1 kind: Service metadata: name: metrics-server namespace: kube-system labels: kubernetes.io/name: "Metrics-server" kubernetes.io/cluster-service: "true" spec: selector: k8s-app: metrics-server ports: - port: 443 protocol: TCP targetPort: main-port --- apiVersion: rbac.authorization.k8s.io/v1 kind: ClusterRole metadata: name: system:metrics-server rules: - apiGroups: - "" resources: - pods - nodes - nodes/stats - namespaces - configmaps verbs: - get - list - watch --- apiVersion: rbac.authorization.k8s.io/v1 kind: ClusterRoleBinding metadata: name: system:metrics-server roleRef: apiGroup: rbac.authorization.k8s.io kind: ClusterRole name: system:metrics-server subjects: - kind: ServiceAccount name: metrics-server namespace: kube-system

修改了镜像地址为内部地址：registry.aliyuncs.com/google_containers/metrics-server:v0.3.7
修改了metrics-server启动参数args

3.1.3 执行部署

$ kubectl apply -f components.yaml

3.1.4 验证

$ kubectl get po -n kube-system NAME READY STATUS RESTARTS AGE calico-kube-controllers-5d4b78db86-4wpbx 1/1 Running 0 24d calico-kube-controllers-5d4b78db86-cdcx6 1/1 Running 0 23d calico-kube-controllers-5d4b78db86-gmvg5 1/1 Running 0 24d calico-kube-controllers-5d4b78db86-qfmzm 1/1 Running 0 24d calico-kube-controllers-5d4b78db86-srrxj 1/1 Running 0 24d calico-node-f5s6w 1/1 Running 1 24d calico-node-f6pmk 1/1 Running 0 24d calico-node-jk7zc 1/1 Running 0 24d calico-node-p2c7d 1/1 Running 7 24d calico-node-v8z5x 1/1 Running 0 24d coredns-59d64cd4d4-85h7g 1/1 Running 0 24d coredns-59d64cd4d4-tll9s 1/1 Running 0 23d coredns-59d64cd4d4-zr4hd 1/1 Running 0 24d etcd-ops-master-1 1/1 Running 8 24d etcd-ops-master-2 1/1 Running 3 24d etcd-ops-master-3 1/1 Running 0 24d kube-apiserver-ops-master-1 1/1 Running 9 24d kube-apiserver-ops-master-2 1/1 Running 8 24d kube-apiserver-ops-master-3 1/1 Running 0 24d kube-controller-manager-ops-master-1 1/1 Running 9 24d kube-controller-manager-ops-master-2 1/1 Running 3 24d kube-controller-manager-ops-master-3 1/1 Running 1 24d kube-proxy-cxjz8 1/1 Running 0 24d kube-proxy-dhjxj 1/1 Running 8 24d kube-proxy-rm64j 1/1 Running 0 24d kube-proxy-xg6bp 1/1 Running 0 24d kube-proxy-zcvzs 1/1 Running 1 24d kube-scheduler-ops-master-1 1/1 Running 9 24d kube-scheduler-ops-master-2 1/1 Running 4 24d kube-scheduler-ops-master-3 1/1 Running 1 24d metrics-server-54cc454bdd-ds4zp 1/1 Running 0 34s $ kubectl top nodes W0110 18:58:33. 14156 top_node.go:119] Using json format to get metrics. Next release will switch to protocol-buffers, switch early by passing --use-protocol-buffers flag NAME CPU(cores) CPU% MEMORY(bytes) MEMORY% ops-master-1 82m 1% 1212Mi 8% ops-master-2 98m 1% 2974Mi 19% ops-master-3 106m 1% 2666Mi 17% ops-worker-1 55m 0% 2014Mi 13% ops-worker-2 59m 0% 2011Mi 13%

3.2 部署vertical-pod-autoscaler

VPA 与 k8s 版本兼容性如下：

VPA version	Kubernetes version
0.12	1.25+
0.11	1.22 – 1.24
0.10	1.22+
0.9	1.16+

当前使用的是 k8s 1.23.6 版本，根据 VPA 兼容性，这个版本的 k8s 需要使用 VPA 0.11 版本。

3.2.1 克隆autoscaler项目

$ git clone -b vpa-release-0.11 https://github.com/kubernetes/autoscaler.git

3.2.2 替换镜像

k8s.gcr.io 和 gcr.io 相关镜像参考：https://github.com/anjia0532/gcr.io_mirror，镜像使用方式：

# 镜像无法拉取解决
# 原始镜像
   image: k8s.gcr.io/autoscaling/vpa-updater:0.10.0
   image: k8s.gcr.io/autoscaling/vpa-recommender:0.10.0
   image: k8s.gcr.io/autoscaling/vpa-admission-controller:0.10.0

# 拉取镜像
   docker pull anjia0532/google-containers.autoscaling.vpa-updater:0.10.0
   docker pull anjia0532/google-containers.autoscaling.vpa-recommender:0.10.0
   docker pull anjia0532/google-containers.autoscaling.vpa-admission-controller:0.10.0

# 修改tag
   docker tag anjia0532/google-containers.autoscaling.vpa-admission-controller:0.10.0 k8s.gcr.io/autoscaling/vpa-admission-controller:0.10.0
   docker tag anjia0532/google-containers.autoscaling.vpa-recommender:0.10.0 k8s.gcr.io/autoscaling/vpa-recommender:0.10.0
   docker tag anjia0532/google-containers.autoscaling.vpa-updater:0.10.0 k8s.gcr.io/autoscaling/vpa-updater:0.10.0

# 删除下载的
   docker rmi anjia0532/google-containers.autoscaling.vpa-updater:0.10.0
   docker rmi anjia0532/google-containers.autoscaling.vpa-recommender:0.10.0
   docker rmi anjia0532/google-containers.autoscaling.vpa-admission-controller:0.10.0

将gcr仓库改成国内仓库。

admission-controller-deployment.yaml文件将us.gcr.io/k8s-artifacts-prod/autoscaling/vpa-admission-controller:0.8.0改为scofield/vpa-admission-controller:0.8.0 recommender-deployment.yaml文件将us.gcr.io/k8s-artifacts-prod/autoscaling/vpa-recommender:0.8.0改为image: scofield/vpa-recommender:0.8.0 updater-deployment.yaml文件将us.gcr.io/k8s-artifacts-prod/autoscaling/vpa-updater:0.8.0改为scofield/vpa-updater:0.8.0

3.2.3 部署

$ cd autoscaler/vertical-pod-autoscaler $ ./hack/vpa-up.sh customresourcedefinition.apiextensions.k8s.io/verticalpodautoscalers.autoscaling.k8s.io created customresourcedefinition.apiextensions.k8s.io/verticalpodautoscalercheckpoints.autoscaling.k8s.io created clusterrole.rbac.authorization.k8s.io/system:metrics-reader created clusterrole.rbac.authorization.k8s.io/system:vpa-actor created clusterrole.rbac.authorization.k8s.io/system:vpa-checkpoint-actor created clusterrole.rbac.authorization.k8s.io/system:evictioner created clusterrolebinding.rbac.authorization.k8s.io/system:metrics-reader created clusterrolebinding.rbac.authorization.k8s.io/system:vpa-actor created clusterrolebinding.rbac.authorization.k8s.io/system:vpa-checkpoint-actor created clusterrole.rbac.authorization.k8s.io/system:vpa-target-reader created clusterrolebinding.rbac.authorization.k8s.io/system:vpa-target-reader-binding created clusterrolebinding.rbac.authorization.k8s.io/system:vpa-evictionter-binding created serviceaccount/vpa-admission-controller created clusterrole.rbac.authorization.k8s.io/system:vpa-admission-controller created clusterrolebinding.rbac.authorization.k8s.io/system:vpa-admission-controller created clusterrole.rbac.authorization.k8s.io/system:vpa-status-reader created clusterrolebinding.rbac.authorization.k8s.io/system:vpa-status-reader-binding created serviceaccount/vpa-updater created deployment.apps/vpa-updater created serviceaccount/vpa-recommender created deployment.apps/vpa-recommender created Generating certs for the VPA Admission Controller in /tmp/vpa-certs. Generating RSA private key, 2048 bit long modulus (2 primes) ............................................................................+++++ .+++++ e is 65537 (0x010001) Generating RSA private key, 2048 bit long modulus (2 primes) ............+++++ ...........................................................................+++++ e is 65537 (0x010001) Signature ok subject=CN = vpa-webhook.kube-system.svc Getting CA Private Key Uploading certs to the cluster. secret/vpa-tls-certs created Deleting /tmp/vpa-certs. deployment.apps/vpa-admission-controller created service/vpa-webhook created

这里如果出现错误：ERROR: Failed to create CA certificate for self-signing. If the error is “unknown option -addext”, update your openssl version or deploy VPA from the vpa-release-0.8 branch

需要升级openssl的版本解决,openssl升级见：OpenSSL升级版本-CSDN博客

升级完openssl后执行以下操作：

$ vertical-pod-autoscaler/pkg/admission-controller/gencerts.sh

3.2.4 查看部署结果

可以看到metrics-server和vpa都已经正常运行了。

$ kubectl get po -n kube-system | grep -E "metrics-server|vpa" metrics-server-5b58f4df77-f7nks 1/1 Running 0 35d vpa-admission-controller-7ff888c959-tvtmk 1/1 Running 0 104m vpa-recommender-74f69c56cb-zmzwg 1/1 Running 0 104m vpa-updater-79b88f9c55-m4xx5 1/1 Running 0 103m

4 验证VPA

4.1 updateMode: Off模式

4.1.1 部署nginx测试pod

nginx应用部署到namespace: vpa中。

$ vim nginx-deploy.yaml apiVersion: apps/v1 kind: Deployment metadata: labels: app: nginx name: nginx namespace: vpa spec: replicas: 2 selector: matchLabels: app: nginx template: metadata: labels: app: nginx spec: containers: - image: nginx name: nginx resources: requests: cpu: 100m memory: 250Mi $ kubectl apply -f nginx-deploy.yaml

$ kubectl get po -n vpa NAME READY STATUS RESTARTS AGE nginx-59fdffd754-cb5dn 1/1 Running 0 8s nginx-59fdffd754-cw8d7 1/1 Running 0 9s

4.1.2 创建Service

创建一个NodePort类型的Service。

$ cat svc.yaml 
apiVersion: v1
kind: Service
metadata:
  name: nginx
  namespace: vpa
spec:
  type: NodePort
  ports:
  - port: 80
    targetPort: 80
  selector:
    app: nginx
$ kubectl get svc -n vpa | grep nginx
nginx   NodePort   10.255.253.166   <none>        80:30895/TCP   54s
$ curl -I 10.1.2.16:30895
HTTP/1.1 200 OK
Server: nginx/1.21.1
Date: Fri, 09 Jul 2021 09:54:58 GMT
Content-Type: text/html
Content-Length: 612
Last-Modified: Tue, 06 Jul 2021 14:59:17 GMT
Connection: keep-alive
ETag: "60e46fc5-264"
Accept-Ranges: bytes

4.1.3 创建VPA

使用updateMode: “Off”模式，这种模式仅获取资源推荐，但不更新Pod。

$ cat nginx-vpa-demo.yaml apiVersion: autoscaling.k8s.io/v1beta2 kind: VerticalPodAutoscaler metadata: name: nginx-vpa namespace: vpa spec: targetRef: apiVersion: "apps/v1" kind: Deployment name: nginx updatePolicy: updateMode: "Off" resourcePolicy: containerPolicies: - containerName: "nginx" minAllowed: cpu: "250m" memory: "100Mi" maxAllowed: cpu: "2000m" memory: "2048Mi" $ kubectl apply -f nginx-vpa-demo.yaml $ kubectl get vpa -n vpa NAME MODE CPU MEM PROVIDED AGE nginx-vpa Off 7s

4.1.4 查看vpa详情

$ kubectl describe vpa nginx-vpa -n vpa Name: nginx-vpa Namespace: vpa Spec: Resource Policy: Container Policies: Container Name: nginx Max Allowed: Cpu: 2000m Memory: 2048Mi Min Allowed: Cpu: 250m Memory: 100Mi Target Ref: API Version: apps/v1 Kind: Deployment Name: nginx Update Policy: Update Mode: Off Status: Conditions: Last Transition Time: 2021-07-09T09:59:50Z Status: True Type: RecommendationProvided Recommendation: Container Recommendations: Container Name: nginx Lower Bound: Cpu: 250m Memory: k Target: Cpu: 250m Memory: k Uncapped Target: Cpu: 25m Memory: k Upper Bound: Cpu: 670m Memory:

其中：

Lower Bound: 下限值
Target: 推荐值
Upper Bound: 上限值
Uncapped Target: 如果没有为VPA提供最小或最大边界，则表示目标利用率

上述结果表明，推荐的Pod的CPU请求为25m，推荐的内存请求为k字节。

4.1.5 对Nginx进行压测

# 执行压测命令 $ ab -c 100 -n  http://10.1.2.16:30895/ This is ApacheBench, Version 2.3 <$Revision:  $> Copyright 1996 Adam Twiss, Zeus Technology Ltd, http://www.zeustech.net/ Licensed to The Apache Software Foundation, http://www.apache.org/ Benchmarking 10.1.2.16 (be patient) Completed  requests Completed  requests Completed  requests

4.1.6 再观察VPA Recommendation

$ kubectl describe vpa -n vpa nginx-vpa | tail -n 20 Conditions: Last Transition Time: 2021-07-09T09:59:50Z Status: True Type: RecommendationProvided Recommendation: Container Recommendations: Container Name: nginx Lower Bound: Cpu: 250m Memory: k Target: Cpu: 1643m Memory: k Uncapped Target: Cpu: 1643m Memory: k Upper Bound: Cpu: 2 Memory:  Events: <none>

从输出信息可以看出，VPA对Pod给出了推荐值：Cpu: 1643m，因为我们这里设置了updateMode: “Off”，所以不会更新Pod。

4.2 updateMode: Auto模式

4.2.1 nginx vpa设置updateMode: Auto

$ cat nginx-vpa-demo.yaml apiVersion: autoscaling.k8s.io/v1beta2 kind: VerticalPodAutoscaler metadata: name: nginx-vpa-2 namespace: vpa spec: targetRef: apiVersion: "apps/v1" kind: Deployment name: nginx updatePolicy: updateMode: "Auto" resourcePolicy: containerPolicies: - containerName: "nginx" minAllowed: cpu: "250m" memory: "100Mi" maxAllowed: cpu: "2000m" memory: "2048Mi" $ kubectl apply -f nginx-vpa-demo.yaml $ kubectl get vpa -n vpa NAME MODE CPU MEM PROVIDED AGE nginx-vpa Off 7s

4.2.2 修改nginx resource配置

resources改为：memory: 50Mi，cpu: 100m

$ vim nginx-deploy.yaml apiVersion: apps/v1 kind: Deployment metadata: labels: app: nginx name: nginx namespace: vpa spec: replicas: 2 selector: matchLabels: app: nginx template: metadata: labels: app: nginx spec: containers: - image: nginx name: nginx resources: requests: cpu: 100m memory: 50Mi $ kubectl apply -f nginx-deploy.yaml

$ kubectl get po -n vpa NAME READY STATUS RESTARTS AGE nginx-5594c66dc6-lzs67 1/1 Running 0 26s nginx-5594c66dc6-zk6h9 1/1 Running 0 21s

4.2.3 再次压测

$ ab -c 100 -n  http://10.1.2.16:30895/

4.2.4 查看vpa详情

几分钟后，使用describe查看vpa详情，同样只关注Container Recommendations。

$ kubectl describe vpa nginx-vpa -n vpa | tail -n 20 Conditions: Last Transition Time: 2021-07-09T09:59:50Z Status: True Type: RecommendationProvided Recommendation: Container Recommendations: Container Name: nginx Lower Bound: Cpu: 250m Memory: k Target: Cpu: 1643m Memory: k Uncapped Target: Cpu: 1643m Memory: k Upper Bound: Cpu: 2 Memory:  Events: <none>

Target变成了Cpu:1643m ，Memory:k。

4.2.5 查看event事件

$ kubectl get event -n vpa LAST SEEN TYPE REASON OBJECT MESSAGE 38s Normal Scheduled pod/nginx-5594c66dc6-d8d6h Successfully assigned vpa/nginx-5594c66dc6-d8d6h to 10.1.2.16 38s Normal Pulling pod/nginx-5594c66dc6-d8d6h Pulling image "nginx" 37s Normal Pulled pod/nginx-5594c66dc6-d8d6h Successfully pulled image "nginx" 37s Normal Created pod/nginx-5594c66dc6-d8d6h Created container nginx 37s Normal Started pod/nginx-5594c66dc6-d8d6h Started container nginx 3m10s Normal Scheduled pod/nginx-5594c66dc6-lzs67 Successfully assigned vpa/nginx-5594c66dc6-lzs67 to 10.1.2.15 3m9s Normal Pulling pod/nginx-5594c66dc6-lzs67 Pulling image "nginx" 3m5s Normal Pulled pod/nginx-5594c66dc6-lzs67 Successfully pulled image "nginx" 3m5s Normal Created pod/nginx-5594c66dc6-lzs67 Created container nginx 3m5s Normal Started pod/nginx-5594c66dc6-lzs67 Started container nginx 99s Normal EvictedByVPA pod/nginx-5594c66dc6-lzs67 Pod was evicted by VPA Updater to apply resource recommendation. 99s Normal Killing pod/nginx-5594c66dc6-lzs67 Stopping container nginx 98s Normal Scheduled pod/nginx-5594c66dc6-tdmnh Successfully assigned vpa/nginx-5594c66dc6-tdmnh to 10.1.2.15 98s Normal Pulling pod/nginx-5594c66dc6-tdmnh Pulling image "nginx" 97s Normal Pulled pod/nginx-5594c66dc6-tdmnh Successfully pulled image "nginx" 97s Normal Created pod/nginx-5594c66dc6-tdmnh Created container nginx 97s Normal Started pod/nginx-5594c66dc6-tdmnh Started container nginx 3m5s Normal Scheduled pod/nginx-5594c66dc6-zk6h9 Successfully assigned vpa/nginx-5594c66dc6-zk6h9 to 10.1.2.17 3m4s Normal Pulling pod/nginx-5594c66dc6-zk6h9 Pulling image "nginx" 3m Normal Pulled pod/nginx-5594c66dc6-zk6h9 Successfully pulled image "nginx" 2m59s Normal Created pod/nginx-5594c66dc6-zk6h9 Created container nginx 2m59s Normal Started pod/nginx-5594c66dc6-zk6h9 Started container nginx 39s Normal EvictedByVPA pod/nginx-5594c66dc6-zk6h9 Pod was evicted by VPA Updater to apply resource recommendation. 39s Normal Killing pod/nginx-5594c66dc6-zk6h9 Stopping container nginx 3m10s Normal SuccessfulCreate replicaset/nginx-5594c66dc6 Created pod: nginx-5594c66dc6-lzs67 3m5s Normal SuccessfulCreate replicaset/nginx-5594c66dc6 Created pod: nginx-5594c66dc6-zk6h9 99s Normal SuccessfulCreate replicaset/nginx-5594c66dc6 Created pod: nginx-5594c66dc6-tdmnh 38s Normal SuccessfulCreate replicaset/nginx-5594c66dc6 Created pod: nginx-5594c66dc6-d8d6h 35m Normal Scheduled pod/nginx-59fdffd754-cb5dn Successfully assigned vpa/nginx-59fdffd754-cb5dn to 10.1.2.16 35m Normal Pulling pod/nginx-59fdffd754-cb5dn Pulling image "nginx" 35m Normal Pulled pod/nginx-59fdffd754-cb5dn Successfully pulled image "nginx" 35m Normal Created pod/nginx-59fdffd754-cb5dn Created container nginx 35m Normal Started pod/nginx-59fdffd754-cb5dn Started container nginx 3m5s Normal Killing pod/nginx-59fdffd754-cb5dn Stopping container nginx 35m Normal Scheduled pod/nginx-59fdffd754-cw8d7 Successfully assigned vpa/nginx-59fdffd754-cw8d7 to 10.1.2.16 35m Normal Pulling pod/nginx-59fdffd754-cw8d7 Pulling image "nginx" 35m Normal Pulled pod/nginx-59fdffd754-cw8d7 Successfully pulled image "nginx" 35m Normal Created pod/nginx-59fdffd754-cw8d7 Created container nginx 35m Normal Started pod/nginx-59fdffd754-cw8d7 Started container nginx 2m58s Normal Killing pod/nginx-59fdffd754-cw8d7 Stopping container nginx 35m Normal SuccessfulCreate replicaset/nginx-59fdffd754 Created pod: nginx-59fdffd754-cw8d7 35m Normal SuccessfulCreate replicaset/nginx-59fdffd754 Created pod: nginx-59fdffd754-cb5dn 3m5s Normal SuccessfulDelete replicaset/nginx-59fdffd754 Deleted pod: nginx-59fdffd754-cb5dn 2m58s Normal SuccessfulDelete replicaset/nginx-59fdffd754 Deleted pod: nginx-59fdffd754-cw8d7 35m Normal ScalingReplicaSet deployment/nginx Scaled up replica set nginx-59fdffd754 to 2 34m Normal EnsuringService service/nginx Deleted Loadbalancer 34m Normal EnsureServiceSuccess service/nginx Service Sync Success. RetrunCode: S2000 3m10s Normal ScalingReplicaSet deployment/nginx Scaled up replica set nginx-5594c66dc6 to 1 3m5s Normal ScalingReplicaSet deployment/nginx Scaled down replica set nginx-59fdffd754 to 1 3m5s Normal ScalingReplicaSet deployment/nginx Scaled up replica set nginx-5594c66dc6 to 2 2m58s Normal ScalingReplicaSet deployment/nginx Scaled down replica set nginx-59fdffd754 to 0

从输出信息可以了解到，VPA执行了EvictedByVPA，自动停掉了Nginx，然后使用 VPA推荐的资源启动了新的Nginx，查看下Nginx的Pod可以得到确认。

$ kubectl describe po -n vpa nginx-5594c66dc6-d8d6h Name: nginx-5594c66dc6-d8d6h Namespace: vpa Priority: 0 Node: 10.1.2.16/10.1.2.16 Start Time: Fri, 09 Jul 2021 18:09:26 +0800 Labels: app=nginx pod-template-hash=5594c66dc6 Annotations: tke.cloud.tencent.com/networks-status: [{ "name": "tke-bridge", "interface": "eth0", "ips": [ "10.252.1.50" ], "mac": "e6:38:26:0b:c5:97", "default": true, "dns": {} }] vpaObservedContainers: nginx vpaUpdates: Pod resources updated by nginx-vpa: container 0: cpu request, memory request Status: Running IP: 10.252.1.50 IPs: IP: 10.252.1.50 Controlled By: ReplicaSet/nginx-5594c66dc6 Containers: nginx: Container ID: docker://42e45f5f122ba658ed78a073cfe51534c773f9419afd6d1698ea Image: nginx Image ID: docker-pullable://nginx@sha256:8df46d7414eda82c2a8c9c811ae59fdda7d558b4125b Port: <none> Host Port: <none> State: Running Started: Fri, 09 Jul 2021 18:09:27 +0800 Ready: True Restart Count: 0 Requests: cpu: 1643m memory: k Environment: <none> Mounts: /var/run/secrets/kubernetes.io/serviceaccount from default-token-m2j2z (ro)

看重点Requests：cpu: 1643m，memory: k

再回头看看部署文件：

requests:   cpu: 100m   memory: 50Mi

现在可以知道VPA做了哪些事了吧。当然，随着服务的负载的变化，VPA的推荐之也会不断变化。当目前运行的pod的资源达不到VPA的推荐值，就会执行pod驱逐，重新部署新的足够资源的服务。

免责声明：本站所有文章内容,图片，视频等均是来源于用户投稿和互联网及文摘转载整编而成，不代表本站观点，不承担相关法律责任。其著作权各归其原作者或其出版社所有。如发现本站有涉嫌抄袭侵权/违法违规的内容,侵犯到您的权益，请在线联系站长,一经查实,本站将立刻删除。本文来自网络,若有侵权，请联系删除，如若转载，请注明出处：https://haidsoft.com/147359.html