[AEWS] #5์ฃผ์ฐจ HPA๋ฅผ ํ™œ์šฉํ•œ ์˜คํ† ์Šค์ผ€์ผ๋ง, ์ปค์Šคํ…€ ๋งคํŠธ๋ฆญ ํ™œ์šฉ (1)

25๋…„๋„ AWS EKS Hands-on Study ์Šคํ„ฐ๋”” ์ •๋ฆฌ ๋‚ด์šฉ์ž…๋‹ˆ๋‹ค.

 

์‚ฌ์ „ ํ™˜๊ฒฝ ์„ค์น˜

ํ”„๋กœ๋ฉ”ํ…Œ์šฐ์Šค, ๊ทธ๋ผํŒŒ๋‚˜ ์„ค์น˜

kubectl get targetgroupbindings.elbv2.k8s.aws -A
NAMESPACE     NAME                               SERVICE-NAME                       SERVICE-PORT   TARGET-TYPE   AGE
kube-system   k8s-kubesyst-kubeopsv-5ba48bb304   kube-ops-view                      8080           ip            18m
monitoring    k8s-monitori-kubeprom-1b3f611eba   kube-prometheus-stack-grafana      80             ip            17m
monitoring    k8s-monitori-kubeprom-1c15095430   kube-prometheus-stack-prometheus   9090           ip            17m

 

EKS Node Viewer

go install github.com/awslabs/eks-node-viewer/cmd/eks-node-viewer@latest

 

kubectl autoscale deployment php-apache --cpu-percent=50 --min=1 --max=10
horizontalpodautoscaler.autoscaling/php-apache autoscaled

kubectl describe hpa
Name:                                                  php-apache
Namespace:                                             default
Labels:                                                <none>
Annotations:                                           <none>
CreationTimestamp:                                     Thu, 06 Mar 2025 01:14:27 +0900
Reference:                                             Deployment/php-apache
Metrics:                                               ( current / target )
  resource cpu on pods  (as a percentage of request):  <unknown> / 50%
Min replicas:                                          1
Max replicas:                                          10
Deployment pods:                                       0 current / 0 desired
Events:                                                <none>

kubectl describe hpa
Name:                                                  php-apache
Namespace:                                             default
Labels:                                                <none>
Annotations:                                           <none>
CreationTimestamp:                                     Thu, 06 Mar 2025 01:14:27 +0900
Reference:                                             Deployment/php-apache
Metrics:                                               ( current / target )
  resource cpu on pods  (as a percentage of request):  <unknown> / 50%
Min replicas:                                          1
Max replicas:                                          10
Deployment pods:                                       0 current / 0 desired
Events:                                                <none>
[root@operator-host ~]# kubectl get hpa php-apache -o yaml | kubectl neat
apiVersion: autoscaling/v2
kind: HorizontalPodAutoscaler
metadata:
  name: php-apache
  namespace: default
spec:
  maxReplicas: 10
  metrics:
  - resource:
      name: cpu
      target:
        averageUtilization: 50
        type: Utilization
    type: Resource
  minReplicas: 1
  scaleTargetRef:
    apiVersion: apps/v1
    kind: Deployment
    name: php-apache

 

HPA๋ž€

์ฟ ๋ฒ„๋„คํ‹ฐ์Šค์—์„œ HPA(Horizontal Pod Autoscaler)๋Š” ์›Œํฌ๋กœ๋“œ์˜ ๋ถ€ํ•˜์— ๋”ฐ๋ผ ์ž๋™์œผ๋กœ ํŒŒ๋“œ(Pod)์˜ ๊ฐœ์ˆ˜๋ฅผ ์กฐ์ •ํ•˜๋Š” ์ปจํŠธ๋กค๋Ÿฌ์ด๋‹ค.

์ด๋ฅผ ํ†ตํ•ด ๋ฆฌ์†Œ์Šค๋ฅผ ํšจ์œจ์ ์œผ๋กœ ์‚ฌ์šฉํ•˜๊ณ  ์• ํ”Œ๋ฆฌ์ผ€์ด์…˜์˜ ์•ˆ์ •์„ฑ์„ ์œ ์ง€ํ•  ์ˆ˜ ์žˆ๋‹ค.

HPA๋Š” CPU ์‚ฌ์šฉ๋ฅ , ๋ฉ”๋ชจ๋ฆฌ ์‚ฌ์šฉ๋Ÿ‰ ๋˜๋Š” ์‚ฌ์šฉ์ž ์ •์˜ ๋ฉ”ํŠธ๋ฆญ(Custom Metrics)์„ ๊ธฐ๋ฐ˜์œผ๋กœ ๋™์ž‘ํ•˜๋ฉฐ ์ •์˜๋œ ๊ธฐ์ค€๊ฐ’์„ ์ดˆ๊ณผํ•˜๊ฑฐ๋‚˜ ๋ฏธ๋‹ฌํ•˜๋ฉด ํŒŒ๋“œ์˜ ๊ฐœ์ˆ˜๋ฅผ ๋™์ ์œผ๋กœ ์กฐ์ •ํ•œ๋‹ค.

 

HPA๊ฐ€ ๋™์ž‘ํ•˜๊ธฐ ์œ„ํ•ด ํ•„์š”ํ•œ ์ฃผ์š” ๊ตฌ์„ฑ ์š”์†Œ

  1. Metrics Server: ํŒŒ๋“œ์˜ ๋ฆฌ์†Œ์Šค ์‚ฌ์šฉ๋Ÿ‰์„ ์ˆ˜์ง‘ํ•˜๋Š” ์—ญํ• ์„ ํ•˜๋ฉฐ HPA๊ฐ€ ๋ฉ”ํŠธ๋ฆญ์„ ์กฐํšŒํ•  ์ˆ˜ ์žˆ๋„๋ก ๋ฐ์ดํ„ฐ๋ฅผ ์ œ๊ณตํ•œ๋‹ค.
  2. HPA ์ปจํŠธ๋กค๋Ÿฌ: ์ฟ ๋ฒ„๋„คํ‹ฐ์Šค ์ปจํŠธ๋กค๋Ÿฌ ๋งค๋‹ˆ์ € ๋‚ด์—์„œ ๋™์ž‘ํ•˜๋ฉฐ ์ฃผ๊ธฐ์ ์œผ๋กœ ๋ฉ”ํŠธ๋ฆญ์„ ์กฐํšŒํ•˜๊ณ  ์Šค์ผ€์ผ๋ง์„ ์ˆ˜ํ–‰ํ•œ๋‹ค.
  3. Target(๋Œ€์ƒ ๋ฆฌ์†Œ์Šค): Deployment, StatefulSet, ReplicaSet ๋“ฑ๊ณผ ๊ฐ™์€ ์ปจํŠธ๋กค๋Ÿฌ๊ฐ€ ๊ด€๋ฆฌํ•˜๋Š” ์›Œํฌ๋กœ๋“œ ๋ฆฌ์†Œ์Šค๋ฅผ ๋Œ€์ƒ์œผ๋กœ ํ•œ๋‹ค.

 

HPA ์‹ค์Šต

์ถœ์ฒ˜ https://blog.px.dev/autoscaling-custom-k8s-metric/

HPA(์ˆ˜ํ‰ Pod ์ž๋™ ํ™•์žฅ) ๊ฐ€ php-apache Deployment CPU์˜ ์‚ฌ์šฉ๋Ÿ‰์— ๋”ฐ๋ผ ์ž๋™์œผ๋กœ Pod ๊ฐœ์ˆ˜๋ฅผ ์กฐ์ •๋˜๋Š”์ง€ ํ™•์ธํ•ด๋ณด๋„๋กํ•œ๋‹ค.

 

์ƒ˜ํ”Œ php-apache ๋ฐฐํฌ

cat << EOF > php-apache.yaml
apiVersion: apps/v1
kind: Deployment
metadata: 
  name: php-apache
spec: 
  selector: 
    matchLabels: 
      run: php-apache
  template: 
    metadata: 
      labels: 
        run: php-apache
    spec: 
      containers: 
      - name: php-apache
        image: registry.k8s.io/hpa-example
        ports: 
        - containerPort: 80
        resources: 
          limits: 
            cpu: 500m
          requests: 
            cpu: 200m
---
apiVersion: v1
kind: Service
metadata: 
  name: php-apache
  labels: 
    run: php-apache
spec: 
  ports: 
  - port: 80
  selector: 
    run: php-apache
EOF
kubectl apply -f php-apache.yaml

 

HPA ์ ์šฉ

kubectl autoscale deployment php-apache --cpu-percent=50 --min=1 --max=10

 

CPU ์‚ฌ์šฉ๋ฅ ์ด ํŒŒ๋“œ์˜ ์š”์ฒญ ๊ฐ’ ๋Œ€๋น„ 50% ์ด์ƒ์ด ๋˜๋ฉด ์ž๋™์œผ๋กœ ํŒŒ๋“œ๋ฅผ ์ฆ์„คํ•˜๋Š” HPA ์ •์ฑ…์„ ์ƒ์„ฑํ•œ๋‹ค.

 

๋ถ€ํ•˜๋ฅผ ์ฃผ์ง€ ์•Š์•˜์„ ๋•Œ
๋ถ€ํ•˜๋ฅผ ์ฃผ์—ˆ์„ ๋•Œ

 

๋ถ€ํ•˜๋ฅผ ์ฃผ๊ธฐ ์‹œ์ž‘ํ•˜๋ฉด ํŒŒ๋“œ์˜ ๊ฐฏ์ˆ˜๊ฐ€ ๋Š˜์–ด๋‚˜๋Š”๋ฐ ์„ค์ •ํ•œ ํƒ€๊ฒŸ ๊ฐ’์ธ 50์„ ๋„˜์œผ๋ฉด ๋ ˆํ”Œ๋ฆฌ์นด ๊ฐฏ์ˆ˜๊ฐ€ ๋Š˜์–ด๋‚˜๋Š” ๊ฒƒ์„ ํ™•์ธํ•  ์ˆ˜ ์žˆ๋‹ค.

 

k describe hpa
Name:                                                  php-apache
Namespace:                                             default
Labels:                                                <none>
Annotations:                                           <none>
CreationTimestamp:                                     Thu, 06 Mar 2025 01:14:27 +0900
Reference:                                             Deployment/php-apache
Metrics:                                               ( current / target )
  resource cpu on pods  (as a percentage of request):  0% (1m) / 50%
Min replicas:                                          1
Max replicas:                                          10
Deployment pods:                                       6 current / 6 desired
Conditions:
  Type            Status  Reason               Message
  ----            ------  ------               -------
  AbleToScale     True    ScaleDownStabilized  recent recommendations were higher than current one, applying the highest recent recommendation
  ScalingActive   True    ValidMetricFound     the HPA was able to successfully calculate a replica count from cpu resource utilization (percentage of request)
  ScalingLimited  False   DesiredWithinRange   the desired count is within the acceptable range
Events:
  Type    Reason             Age    From                       Message
  ----    ------             ----   ----                       -------
  Normal  SuccessfulRescale  3m55s  horizontal-pod-autoscaler  New size: 4; reason: cpu resource utilization (percentage of request) above target
  Normal  SuccessfulRescale  3m40s  horizontal-pod-autoscaler  New size: 5; reason: cpu resource utilization (percentage of request) above target
  Normal  SuccessfulRescale  3m10s  horizontal-pod-autoscaler  New size: 6; reason: cpu resource utilization (percentage of request) above target

 

 

HPA๋Š” ์ตœ์†Œ 1๊ฐœ์—์„œ ์ตœ๋Œ€ 10๊ฐœ๊นŒ์ง€ ์ž๋™์œผ๋กœ Pod ๊ฐœ์ˆ˜๋ฅผ ์กฐ์ •ํ•  ์ˆ˜ ์žˆ์œผ๋ฉฐ ๋ถ€ํ•˜๋ฅผ ์ฃผ์–ด 6๊ฐœ๊นŒ์ง€ ์ฆ์„ค์ด ๋œ ๊ฒƒ์„ ํ™•์ธํ•  ์ˆ˜ ์žˆ๋‹ค.

๊ทธ๋ฆฌ๊ณ  ์‹œ๊ฐ„์ด ์กฐ๊ธˆ ๋” ์ง€๋‚˜๋‹ˆ ์ž๋™์œผ๋กœ ๊ฐ์†Œ๋˜์—ˆ๋‹ค

 

Events:
  Type    Reason             Age    From                       Message
  ----    ------             ----   ----                       -------
  Normal  SuccessfulRescale  13m    horizontal-pod-autoscaler  New size: 4; reason: cpu resource utilization (percentage of request) above target
  Normal  SuccessfulRescale  13m    horizontal-pod-autoscaler  New size: 5; reason: cpu resource utilization (percentage of request) above target
  Normal  SuccessfulRescale  13m    horizontal-pod-autoscaler  New size: 6; reason: cpu resource utilization (percentage of request) above target
  Normal  SuccessfulRescale  7m45s  horizontal-pod-autoscaler  New size: 3; reason: All metrics below target
  Normal  SuccessfulRescale  7m29s  horizontal-pod-autoscaler  New size: 1; reason: All metrics below target

 

์ปค์Šคํ…€ ๋งคํŠธ๋ฆญ

์ฟ ๋ฒ„๋„คํ‹ฐ์Šค์—์„œ๋Š” ๊ธฐ๋ณธ์ ์œผ๋กœ CPU ๋ฐ ๋ฉ”๋ชจ๋ฆฌ ์‚ฌ์šฉ๋Ÿ‰์„ ๊ธฐ๋ฐ˜์œผ๋กœ HPA๋ฅผ ๊ตฌ์„ฑํ•  ์ˆ˜ ์žˆ๋‹ค.

ํ•˜์ง€๋งŒ, ๋„คํŠธ์›Œํฌ ํŠธ๋ž˜ํ”ฝ, ์š”์ฒญ ์ˆ˜, ์‚ฌ์šฉ์ž ์ •์˜ ๋ฉ”ํŠธ๋ฆญ ๋“ฑ์„ ๊ธฐ๋ฐ˜์œผ๋กœ Autoscaling์„ ์ˆ˜ํ–‰ํ•˜๋ ค๋ฉด Custom Metrics API๋ฅผ ํ™œ์šฉํ•ด์•ผํ•œ๋‹ค.

 

HPA๊ฐ€ ๋ฐ์ดํ„ฐ๋ฅผ ๊ฐ€์ ธ์˜ค๋Š” ๊ณผ์ •์€ ์•„๋ž˜์™€ ๊ฐ™๋‹ค.

 

  • Prometheus์—์„œ ๋ฉ”ํŠธ๋ฆญ ์ˆ˜์ง‘
    • Prometheus๊ฐ€ cadvisor ๋˜๋Š” kubelet์„ ํ†ตํ•ด container_network_receive_bytes_total ๋“ฑ์˜ ๋ฉ”ํŠธ๋ฆญ์„ ์ˆ˜์ง‘ํ•œ๋‹ค.
    • ์ด ๋ฉ”ํŠธ๋ฆญ์€ Pod๋ณ„ ์ˆ˜์‹ ๋œ ๋„คํŠธ์›Œํฌ ํŠธ๋ž˜ํ”ฝ(Bytes ๋‹จ์œ„)์ด๋ฉฐ, Counter ํƒ€์ž…์ด๋ฏ€๋กœ ๊ณ„์† ์ฆ๊ฐ€ํ•œ๋‹ค.
  • Prometheus Adapter๋ฅผ ํ†ตํ•ด Kubernetes Custom Metrics API์— ๋…ธ์ถœ
    • Prometheus Adapter๊ฐ€ config.yaml ์„ค์ •์„ ๊ธฐ๋ฐ˜์œผ๋กœ Prometheus์—์„œ container_network_receive_bytes_total ๊ฐ’์„ ๊ฐ€์ ธ์˜จ๋‹ค.
    • rate() ํ•จ์ˆ˜๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ์ดˆ๋‹น ์ฆ๊ฐ€๋Ÿ‰์„ ๊ณ„์‚ฐํ•˜๊ณ , /1024๋ฅผ ๋‚˜๋ˆ„์–ด KB ๋‹จ์œ„๋กœ ๋ณ€ํ™˜ํ•œ๋‹ค.
    • ๋ณ€ํ™˜๋œ ๊ฐ’์„ network-traffic-receive-per-second๋ผ๋Š” ์ด๋ฆ„์œผ๋กœ Kubernetes Custom Metrics API์— ๋…ธ์ถœํ•œ๋‹ค.
  • HPA๊ฐ€ Kubernetes Custom Metrics API์—์„œ ๋ฉ”ํŠธ๋ฆญ์„ ๊ฐ€์ ธ์˜ด
    • HPA๊ฐ€ kubectl get --raw "/apis/custom.metrics.k8s.io/v1beta1/namespaces/default/pods/*/network-traffic-receive-per-second" API๋ฅผ ํ˜ธ์ถœํ•˜์—ฌ ๊ฐ’์„ ์กฐํšŒํ•œ๋‹ค.
    • ํŠน์ • Pod์—์„œ ๋ฐœ์ƒํ•˜๋Š” ๋„คํŠธ์›Œํฌ ํŠธ๋ž˜ํ”ฝ ์–‘์„ ํ™•์ธํ•œ ํ›„, ์„ค์ •๋œ Target Value(์˜ˆ: 100KB/s)๋ฅผ ์ดˆ๊ณผํ•˜๋Š”์ง€ ํŒ๋‹จํ•œ๋‹ค.
  • HPA๊ฐ€ ์Šค์ผ€์ผ๋ง ํŒ๋‹จ
    • HPA๊ฐ€ current / target ๋น„์œจ์„ ๋น„๊ตํ•˜์—ฌ Pod ๊ฐœ์ˆ˜๋ฅผ ์กฐ์ •ํ•œ๋‹ค.
    • ๋„คํŠธ์›Œํฌ ํŠธ๋ž˜ํ”ฝ์ด ๊ธฐ์ค€๋ณด๋‹ค ๋†’์œผ๋ฉด ํŒŒ๋“œ๋ฅผ ์ฆ์„คํ•˜๊ณ  ๊ธฐ์ค€๋ณด๋‹ค ๋‚ฎ์œผ๋ฉด ํŒŒ๋“œ๋ฅผ ๊ฐ์†Œํ•œ๋‹ค.

hpa ์ˆ˜์ •

apiVersion: autoscaling/v2
kind: HorizontalPodAutoscaler
metadata:
  name: php-apache
spec:
  scaleTargetRef:
    apiVersion: apps/v1
    kind: Deployment
    name: php-apache
  minReplicas: 1
  maxReplicas: 10
  metrics:
  - type: Pods
    pods:
      metric:
        name: network-traffic-bytes-per-second
      target:
        type: AverageValue
        averageValue: 100k

 

k describe hpa php-apache
Name:                                          php-apache
Namespace:                                     default
Labels:                                        <none>
Annotations:                                   <none>
CreationTimestamp:                             Fri, 07 Mar 2025 01:24:37 +0900
Reference:                                     Deployment/php-apache
Metrics:                                       ( current / target )
  "network-traffic-bytes-per-second" on pods:  0 / 100k             # ์ปค์Šคํ…€ ๋งคํŠธ๋ฆญ ์ถ”๊ฐ€ํ•œ ๋ถ€๋ถ„
Min replicas:                                  1
Max replicas:                                  10
Deployment pods:                               1 current / 1 desired
Conditions:
  Type            Status  Reason               Message
  ----            ------  ------               -------
  AbleToScale     True    ScaleDownStabilized  recent recommendations were higher than current one, applying the highest recent recommendation
  ScalingActive   True    ValidMetricFound     the HPA was able to successfully calculate a replica count from pods metric network-traffic-bytes-per-second
  ScalingLimited  False   DesiredWithinRange   the desired count is within the acceptable range
Events:           <none>

prometheus adapter config

data:
  config.yaml: |
    rules:
    - seriesQuery: 'container_network_receive_bytes_total{namespace!="",pod!=""}'
      resources:
        overrides:
          namespace: {resource: "namespace"}
          pod: {resource: "pod"}
      name:
        matches: "^(.*)$"
        as: "network-traffic-bytes-per-second"
      metricsQuery: "sum(rate(<<.Series>>{<<.LabelMatchers>>}[1m])) by (pod)"
kubectl get --raw "/apis/custom.metrics.k8s.io/v1beta1" | jq .
{
    {
      "name": "pods/network-traffic-receive-per-second",
      "singularName": "",
      "namespaced": true,
      "kind": "MetricValueList",
      "verbs": [
        "get"
      ]
    },
    {
      "name": "namespaces/network-traffic-transmit-per-second",
      "singularName": "",
      "namespaced": false,
      "kind": "MetricValueList",
      "verbs": [
        "get"
      ]
    }
 }

 

ํ”„๋กœ๋ฉ”ํ…Œ์šฐ์Šค ์–ด๋Œ‘ํ„ฐ์— ์ถ”๊ฐ€ํ•ด์ฃผ๊ณ  ์ปค์Šคํ…€ ๋งคํŠธ๋ฆญ์ด ์ƒ์„ฑ๋˜์—ˆ๋Š”์ง€ ํ™•์ธํ•œ๋‹ค.

 

์ปค์Šคํ…€ ๋งคํŠธ๋ฆญ์ด ์กฐํšŒ๊ฐ€ ์•ˆ๋  ๋•Œ

kubectl get --raw /apis/custom.metrics.k8s.io/v1beta1 | jq .
{
  "kind": "APIResourceList",
  "apiVersion": "v1",
  "groupVersion": "custom.metrics.k8s.io/v1beta1",
  "resources": []
}

 

์ฒ˜์Œ ํ”„๋กœ๋ฉ”ํ…Œ์šฐ์Šค ์–ด๋Œ‘ํ„ฐ์— ๋ฃฐ ์ถ”๊ฐ€ํ•ด์ฃผ๊ณ  ํŒŒ๋“œ๋ฅผ ์žฌ์‹œ์ž‘ํ–ˆ๋Š”๋ฐ ์ง€์†์ ์œผ๋กœ hpa์—์„œ ์ปค์Šคํ…€ ๋งคํŠธ๋ฆญ์ด unknown์œผ๋กœ ๋ณด์ด๋Š” ์ฆ์ƒ์ด ์žˆ์—ˆ๋‹ค.

https://github.com/kubernetes-sigs/prometheus-adapter/issues/118

 

Resources list is empty at /apis/custom.metrics.k8s.io/v1beta1/ · Issue #118 · kubernetes-sigs/prometheus-adapter

Hi there! I have the following metrics in Prometheus: http_requests_total{namespace!="",pod!=""} -> http_requests_total{endpoint="http",instance="100.96.3.10:8080",job="sample-app",namespace="defau...

github.com

 

์œ— ๋ถ„์ด ๋งํ•ด์ฃผ์‹ ๋Œ€๋กœ ํŒŒ๋“œ์— ์„ค์ •๋œ ํ”„๋กœ๋ฉ”ํ…Œ์šฐ์Šค url์„ ์‚ดํŽด๋ณด์•˜๋Š”๋ฐ ๋””ํดํŠธ๊ฐ’์œผ๋กœ ์ ์šฉ์ด ์ž˜๋ชป ๋ผ์žˆ์—ˆ๋‹ค.

 

Args:
      /adapter
      --secure-port=6443
      --cert-dir=/tmp/cert
      --prometheus-url=http://prometheus.default.svc:9090
      --metrics-relist-interval=1m
      --v=4
      --config=/etc/adapter/config.yaml

 

kubectl get svc -A | grep prometheus
monitoring    kube-prometheus-stack-prometheus                 ClusterIP   10.100.76.235    <none>        9090/TCP,8080/TCP              67m

 

๋””ํ”Œ๋กœ์ด๋จผํŠธ ์ˆ˜์ •ํ•˜๊ณ  ๋‹ค์‹œ ๋กค์•„์›ƒ ํ•˜์—ฌ ๋ฐฐํฌํ•˜์˜€๋”๋‹ˆ ์ •์ƒ์ ์œผ๋กœ ๊ฐ€์ ธ์˜ฌ ์ˆ˜ ์žˆ๊ฒŒ ๋˜์—ˆ๋‹ค.