25๋ ๋ AWS EKS Hands-on Study ์คํฐ๋ ์ ๋ฆฌ ๋ด์ฉ์ ๋๋ค.
์ฌ์ ํ๊ฒฝ ์ค์น
ํ๋ก๋ฉํ ์ฐ์ค, ๊ทธ๋ผํ๋ ์ค์น
kubectl get targetgroupbindings.elbv2.k8s.aws -A
NAMESPACE NAME SERVICE-NAME SERVICE-PORT TARGET-TYPE AGE
kube-system k8s-kubesyst-kubeopsv-5ba48bb304 kube-ops-view 8080 ip 18m
monitoring k8s-monitori-kubeprom-1b3f611eba kube-prometheus-stack-grafana 80 ip 17m
monitoring k8s-monitori-kubeprom-1c15095430 kube-prometheus-stack-prometheus 9090 ip 17m
EKS Node Viewer
go install github.com/awslabs/eks-node-viewer/cmd/eks-node-viewer@latest
kubectl autoscale deployment php-apache --cpu-percent=50 --min=1 --max=10
horizontalpodautoscaler.autoscaling/php-apache autoscaled
kubectl describe hpa
Name: php-apache
Namespace: default
Labels: <none>
Annotations: <none>
CreationTimestamp: Thu, 06 Mar 2025 01:14:27 +0900
Reference: Deployment/php-apache
Metrics: ( current / target )
resource cpu on pods (as a percentage of request): <unknown> / 50%
Min replicas: 1
Max replicas: 10
Deployment pods: 0 current / 0 desired
Events: <none>
kubectl describe hpa
Name: php-apache
Namespace: default
Labels: <none>
Annotations: <none>
CreationTimestamp: Thu, 06 Mar 2025 01:14:27 +0900
Reference: Deployment/php-apache
Metrics: ( current / target )
resource cpu on pods (as a percentage of request): <unknown> / 50%
Min replicas: 1
Max replicas: 10
Deployment pods: 0 current / 0 desired
Events: <none>
[root@operator-host ~]# kubectl get hpa php-apache -o yaml | kubectl neat
apiVersion: autoscaling/v2
kind: HorizontalPodAutoscaler
metadata:
name: php-apache
namespace: default
spec:
maxReplicas: 10
metrics:
- resource:
name: cpu
target:
averageUtilization: 50
type: Utilization
type: Resource
minReplicas: 1
scaleTargetRef:
apiVersion: apps/v1
kind: Deployment
name: php-apache
HPA๋
์ฟ ๋ฒ๋คํฐ์ค์์ HPA(Horizontal Pod Autoscaler)๋ ์ํฌ๋ก๋์ ๋ถํ์ ๋ฐ๋ผ ์๋์ผ๋ก ํ๋(Pod)์ ๊ฐ์๋ฅผ ์กฐ์ ํ๋ ์ปจํธ๋กค๋ฌ์ด๋ค.
์ด๋ฅผ ํตํด ๋ฆฌ์์ค๋ฅผ ํจ์จ์ ์ผ๋ก ์ฌ์ฉํ๊ณ ์ ํ๋ฆฌ์ผ์ด์ ์ ์์ ์ฑ์ ์ ์งํ ์ ์๋ค.
HPA๋ CPU ์ฌ์ฉ๋ฅ , ๋ฉ๋ชจ๋ฆฌ ์ฌ์ฉ๋ ๋๋ ์ฌ์ฉ์ ์ ์ ๋ฉํธ๋ฆญ(Custom Metrics)์ ๊ธฐ๋ฐ์ผ๋ก ๋์ํ๋ฉฐ ์ ์๋ ๊ธฐ์ค๊ฐ์ ์ด๊ณผํ๊ฑฐ๋ ๋ฏธ๋ฌํ๋ฉด ํ๋์ ๊ฐ์๋ฅผ ๋์ ์ผ๋ก ์กฐ์ ํ๋ค.
HPA๊ฐ ๋์ํ๊ธฐ ์ํด ํ์ํ ์ฃผ์ ๊ตฌ์ฑ ์์
- Metrics Server: ํ๋์ ๋ฆฌ์์ค ์ฌ์ฉ๋์ ์์งํ๋ ์ญํ ์ ํ๋ฉฐ HPA๊ฐ ๋ฉํธ๋ฆญ์ ์กฐํํ ์ ์๋๋ก ๋ฐ์ดํฐ๋ฅผ ์ ๊ณตํ๋ค.
- HPA ์ปจํธ๋กค๋ฌ: ์ฟ ๋ฒ๋คํฐ์ค ์ปจํธ๋กค๋ฌ ๋งค๋์ ๋ด์์ ๋์ํ๋ฉฐ ์ฃผ๊ธฐ์ ์ผ๋ก ๋ฉํธ๋ฆญ์ ์กฐํํ๊ณ ์ค์ผ์ผ๋ง์ ์ํํ๋ค.
- Target(๋์ ๋ฆฌ์์ค): Deployment, StatefulSet, ReplicaSet ๋ฑ๊ณผ ๊ฐ์ ์ปจํธ๋กค๋ฌ๊ฐ ๊ด๋ฆฌํ๋ ์ํฌ๋ก๋ ๋ฆฌ์์ค๋ฅผ ๋์์ผ๋ก ํ๋ค.
HPA ์ค์ต
HPA(์ํ Pod ์๋ ํ์ฅ) ๊ฐ php-apache Deployment CPU์ ์ฌ์ฉ๋์ ๋ฐ๋ผ ์๋์ผ๋ก Pod ๊ฐ์๋ฅผ ์กฐ์ ๋๋์ง ํ์ธํด๋ณด๋๋กํ๋ค.
์ํ php-apache ๋ฐฐํฌ
cat << EOF > php-apache.yaml
apiVersion: apps/v1
kind: Deployment
metadata:
name: php-apache
spec:
selector:
matchLabels:
run: php-apache
template:
metadata:
labels:
run: php-apache
spec:
containers:
- name: php-apache
image: registry.k8s.io/hpa-example
ports:
- containerPort: 80
resources:
limits:
cpu: 500m
requests:
cpu: 200m
---
apiVersion: v1
kind: Service
metadata:
name: php-apache
labels:
run: php-apache
spec:
ports:
- port: 80
selector:
run: php-apache
EOF
kubectl apply -f php-apache.yaml
HPA ์ ์ฉ
kubectl autoscale deployment php-apache --cpu-percent=50 --min=1 --max=10
CPU ์ฌ์ฉ๋ฅ ์ด ํ๋์ ์์ฒญ ๊ฐ ๋๋น 50% ์ด์์ด ๋๋ฉด ์๋์ผ๋ก ํ๋๋ฅผ ์ฆ์คํ๋ HPA ์ ์ฑ ์ ์์ฑํ๋ค.
๋ถํ๋ฅผ ์ฃผ๊ธฐ ์์ํ๋ฉด ํ๋์ ๊ฐฏ์๊ฐ ๋์ด๋๋๋ฐ ์ค์ ํ ํ๊ฒ ๊ฐ์ธ 50์ ๋์ผ๋ฉด ๋ ํ๋ฆฌ์นด ๊ฐฏ์๊ฐ ๋์ด๋๋ ๊ฒ์ ํ์ธํ ์ ์๋ค.
k describe hpa
Name: php-apache
Namespace: default
Labels: <none>
Annotations: <none>
CreationTimestamp: Thu, 06 Mar 2025 01:14:27 +0900
Reference: Deployment/php-apache
Metrics: ( current / target )
resource cpu on pods (as a percentage of request): 0% (1m) / 50%
Min replicas: 1
Max replicas: 10
Deployment pods: 6 current / 6 desired
Conditions:
Type Status Reason Message
---- ------ ------ -------
AbleToScale True ScaleDownStabilized recent recommendations were higher than current one, applying the highest recent recommendation
ScalingActive True ValidMetricFound the HPA was able to successfully calculate a replica count from cpu resource utilization (percentage of request)
ScalingLimited False DesiredWithinRange the desired count is within the acceptable range
Events:
Type Reason Age From Message
---- ------ ---- ---- -------
Normal SuccessfulRescale 3m55s horizontal-pod-autoscaler New size: 4; reason: cpu resource utilization (percentage of request) above target
Normal SuccessfulRescale 3m40s horizontal-pod-autoscaler New size: 5; reason: cpu resource utilization (percentage of request) above target
Normal SuccessfulRescale 3m10s horizontal-pod-autoscaler New size: 6; reason: cpu resource utilization (percentage of request) above target
HPA๋ ์ต์ 1๊ฐ์์ ์ต๋ 10๊ฐ๊น์ง ์๋์ผ๋ก Pod ๊ฐ์๋ฅผ ์กฐ์ ํ ์ ์์ผ๋ฉฐ ๋ถํ๋ฅผ ์ฃผ์ด 6๊ฐ๊น์ง ์ฆ์ค์ด ๋ ๊ฒ์ ํ์ธํ ์ ์๋ค.
๊ทธ๋ฆฌ๊ณ ์๊ฐ์ด ์กฐ๊ธ ๋ ์ง๋๋ ์๋์ผ๋ก ๊ฐ์๋์๋ค
Events:
Type Reason Age From Message
---- ------ ---- ---- -------
Normal SuccessfulRescale 13m horizontal-pod-autoscaler New size: 4; reason: cpu resource utilization (percentage of request) above target
Normal SuccessfulRescale 13m horizontal-pod-autoscaler New size: 5; reason: cpu resource utilization (percentage of request) above target
Normal SuccessfulRescale 13m horizontal-pod-autoscaler New size: 6; reason: cpu resource utilization (percentage of request) above target
Normal SuccessfulRescale 7m45s horizontal-pod-autoscaler New size: 3; reason: All metrics below target
Normal SuccessfulRescale 7m29s horizontal-pod-autoscaler New size: 1; reason: All metrics below target
์ปค์คํ ๋งคํธ๋ฆญ
์ฟ ๋ฒ๋คํฐ์ค์์๋ ๊ธฐ๋ณธ์ ์ผ๋ก CPU ๋ฐ ๋ฉ๋ชจ๋ฆฌ ์ฌ์ฉ๋์ ๊ธฐ๋ฐ์ผ๋ก HPA๋ฅผ ๊ตฌ์ฑํ ์ ์๋ค.
ํ์ง๋ง, ๋คํธ์ํฌ ํธ๋ํฝ, ์์ฒญ ์, ์ฌ์ฉ์ ์ ์ ๋ฉํธ๋ฆญ ๋ฑ์ ๊ธฐ๋ฐ์ผ๋ก Autoscaling์ ์ํํ๋ ค๋ฉด Custom Metrics API๋ฅผ ํ์ฉํด์ผํ๋ค.
HPA๊ฐ ๋ฐ์ดํฐ๋ฅผ ๊ฐ์ ธ์ค๋ ๊ณผ์ ์ ์๋์ ๊ฐ๋ค.
- Prometheus์์ ๋ฉํธ๋ฆญ ์์ง
- Prometheus๊ฐ cadvisor ๋๋ kubelet์ ํตํด container_network_receive_bytes_total ๋ฑ์ ๋ฉํธ๋ฆญ์ ์์งํ๋ค.
- ์ด ๋ฉํธ๋ฆญ์ Pod๋ณ ์์ ๋ ๋คํธ์ํฌ ํธ๋ํฝ(Bytes ๋จ์)์ด๋ฉฐ, Counter ํ์ ์ด๋ฏ๋ก ๊ณ์ ์ฆ๊ฐํ๋ค.
- Prometheus Adapter๋ฅผ ํตํด Kubernetes Custom Metrics API์ ๋
ธ์ถ
- Prometheus Adapter๊ฐ config.yaml ์ค์ ์ ๊ธฐ๋ฐ์ผ๋ก Prometheus์์ container_network_receive_bytes_total ๊ฐ์ ๊ฐ์ ธ์จ๋ค.
- rate() ํจ์๋ฅผ ์ฌ์ฉํ์ฌ ์ด๋น ์ฆ๊ฐ๋์ ๊ณ์ฐํ๊ณ , /1024๋ฅผ ๋๋์ด KB ๋จ์๋ก ๋ณํํ๋ค.
- ๋ณํ๋ ๊ฐ์ network-traffic-receive-per-second๋ผ๋ ์ด๋ฆ์ผ๋ก Kubernetes Custom Metrics API์ ๋ ธ์ถํ๋ค.
- HPA๊ฐ Kubernetes Custom Metrics API์์ ๋ฉํธ๋ฆญ์ ๊ฐ์ ธ์ด
- HPA๊ฐ kubectl get --raw "/apis/custom.metrics.k8s.io/v1beta1/namespaces/default/pods/*/network-traffic-receive-per-second" API๋ฅผ ํธ์ถํ์ฌ ๊ฐ์ ์กฐํํ๋ค.
- ํน์ Pod์์ ๋ฐ์ํ๋ ๋คํธ์ํฌ ํธ๋ํฝ ์์ ํ์ธํ ํ, ์ค์ ๋ Target Value(์: 100KB/s)๋ฅผ ์ด๊ณผํ๋์ง ํ๋จํ๋ค.
- HPA๊ฐ ์ค์ผ์ผ๋ง ํ๋จ
- HPA๊ฐ current / target ๋น์จ์ ๋น๊ตํ์ฌ Pod ๊ฐ์๋ฅผ ์กฐ์ ํ๋ค.
- ๋คํธ์ํฌ ํธ๋ํฝ์ด ๊ธฐ์ค๋ณด๋ค ๋์ผ๋ฉด ํ๋๋ฅผ ์ฆ์คํ๊ณ ๊ธฐ์ค๋ณด๋ค ๋ฎ์ผ๋ฉด ํ๋๋ฅผ ๊ฐ์ํ๋ค.
hpa ์์
apiVersion: autoscaling/v2
kind: HorizontalPodAutoscaler
metadata:
name: php-apache
spec:
scaleTargetRef:
apiVersion: apps/v1
kind: Deployment
name: php-apache
minReplicas: 1
maxReplicas: 10
metrics:
- type: Pods
pods:
metric:
name: network-traffic-bytes-per-second
target:
type: AverageValue
averageValue: 100k
k describe hpa php-apache
Name: php-apache
Namespace: default
Labels: <none>
Annotations: <none>
CreationTimestamp: Fri, 07 Mar 2025 01:24:37 +0900
Reference: Deployment/php-apache
Metrics: ( current / target )
"network-traffic-bytes-per-second" on pods: 0 / 100k # ์ปค์คํ
๋งคํธ๋ฆญ ์ถ๊ฐํ ๋ถ๋ถ
Min replicas: 1
Max replicas: 10
Deployment pods: 1 current / 1 desired
Conditions:
Type Status Reason Message
---- ------ ------ -------
AbleToScale True ScaleDownStabilized recent recommendations were higher than current one, applying the highest recent recommendation
ScalingActive True ValidMetricFound the HPA was able to successfully calculate a replica count from pods metric network-traffic-bytes-per-second
ScalingLimited False DesiredWithinRange the desired count is within the acceptable range
Events: <none>
prometheus adapter config
data:
config.yaml: |
rules:
- seriesQuery: 'container_network_receive_bytes_total{namespace!="",pod!=""}'
resources:
overrides:
namespace: {resource: "namespace"}
pod: {resource: "pod"}
name:
matches: "^(.*)$"
as: "network-traffic-bytes-per-second"
metricsQuery: "sum(rate(<<.Series>>{<<.LabelMatchers>>}[1m])) by (pod)"
kubectl get --raw "/apis/custom.metrics.k8s.io/v1beta1" | jq .
{
{
"name": "pods/network-traffic-receive-per-second",
"singularName": "",
"namespaced": true,
"kind": "MetricValueList",
"verbs": [
"get"
]
},
{
"name": "namespaces/network-traffic-transmit-per-second",
"singularName": "",
"namespaced": false,
"kind": "MetricValueList",
"verbs": [
"get"
]
}
}
ํ๋ก๋ฉํ ์ฐ์ค ์ด๋ํฐ์ ์ถ๊ฐํด์ฃผ๊ณ ์ปค์คํ ๋งคํธ๋ฆญ์ด ์์ฑ๋์๋์ง ํ์ธํ๋ค.
์ปค์คํ ๋งคํธ๋ฆญ์ด ์กฐํ๊ฐ ์๋ ๋
kubectl get --raw /apis/custom.metrics.k8s.io/v1beta1 | jq .
{
"kind": "APIResourceList",
"apiVersion": "v1",
"groupVersion": "custom.metrics.k8s.io/v1beta1",
"resources": []
}
์ฒ์ ํ๋ก๋ฉํ ์ฐ์ค ์ด๋ํฐ์ ๋ฃฐ ์ถ๊ฐํด์ฃผ๊ณ ํ๋๋ฅผ ์ฌ์์ํ๋๋ฐ ์ง์์ ์ผ๋ก hpa์์ ์ปค์คํ ๋งคํธ๋ฆญ์ด unknown์ผ๋ก ๋ณด์ด๋ ์ฆ์์ด ์์๋ค.
https://github.com/kubernetes-sigs/prometheus-adapter/issues/118
Resources list is empty at /apis/custom.metrics.k8s.io/v1beta1/ · Issue #118 · kubernetes-sigs/prometheus-adapter
Hi there! I have the following metrics in Prometheus: http_requests_total{namespace!="",pod!=""} -> http_requests_total{endpoint="http",instance="100.96.3.10:8080",job="sample-app",namespace="defau...
github.com
์ ๋ถ์ด ๋งํด์ฃผ์ ๋๋ก ํ๋์ ์ค์ ๋ ํ๋ก๋ฉํ ์ฐ์ค url์ ์ดํด๋ณด์๋๋ฐ ๋ํดํธ๊ฐ์ผ๋ก ์ ์ฉ์ด ์๋ชป ๋ผ์์๋ค.
Args:
/adapter
--secure-port=6443
--cert-dir=/tmp/cert
--prometheus-url=http://prometheus.default.svc:9090
--metrics-relist-interval=1m
--v=4
--config=/etc/adapter/config.yaml
kubectl get svc -A | grep prometheus
monitoring kube-prometheus-stack-prometheus ClusterIP 10.100.76.235 <none> 9090/TCP,8080/TCP 67m
๋ํ๋ก์ด๋จผํธ ์์ ํ๊ณ ๋ค์ ๋กค์์ ํ์ฌ ๋ฐฐํฌํ์๋๋ ์ ์์ ์ผ๋ก ๊ฐ์ ธ์ฌ ์ ์๊ฒ ๋์๋ค.
'Infra > AWS' ์นดํ ๊ณ ๋ฆฌ์ ๋ค๋ฅธ ๊ธ
[AEWS] #5์ฃผ์ฐจ KEDA, CAS ์ค์ต (2) (0) | 2025.03.07 |
---|---|
EKS cloudformation DELETE_IN_PROGRESS ์ํ์ผ ๋ ์กฐ์น (0) | 2025.03.07 |
[AEWS] #4์ฃผ์ฐจ ํ๋ก๋ฉํ ์ฐ์ค๋งคํธ๋ฆญ๊ณผ PromQL (4) (0) | 2025.03.02 |
[AEWS] #4์ฃผ์ฐจ ํ๋ก๋ฉํ ์ฐ์ค ์ค์ต (3) (1) | 2025.03.02 |
[AEWS] # 4์ฃผ์ฐจ kwatch ์ค์ต (2) (0) | 2025.03.01 |