Resource Requests and Limits in Kubernetes: A Practical Guide - DevOps Thiago | Thiago Dos Santos

Thiago YouTube Thumbnail 23

As a Site Reliability Engineer with 17 years of experience, I’ve seen firsthand how crucial proper resource management is in Kubernetes environments. Today, we’re diving deep into setting resource requests and limits in Kubernetes – a fundamental skill for any SRE or DevOps engineer working with container orchestration.

Understanding Kubernetes Resource Management

Before we jump into the practical steps, let’s brush up on some key concepts:

Pod Resource Usage: Pods consume resources from the nodes they’re scheduled on. Kubernetes needs to know how much CPU and memory a pod requires to make informed scheduling decisions.
Requests vs. Limits:

Requests are what the container is guaranteed to get.
Limits are the maximum amount of resources that the container can use.

Priority Levels:

High Priority: Requests = Limits
Medium Priority: Requests ≠ Limits
Low Priority: Requests = Null | Limits = Null

Practical Implementation

Let’s walk through the process of setting up and testing resource management in Kubernetes. We’ll use a local Kind cluster for this demonstration. I’ll provide both the direct commands and their corresponding Makefile aliases for each step.

Before we begin, make sure you have the following set up on your computer:

Docker
Kind (Kubernetes in Docker)

If you haven’t installed these tools yet, don’t worry! Check out the links in the video description for my tutorials on installing them on Windows, Mac, and Ubuntu.

Ubuntu – Kind / Docker

Mac – Kind / Docker

Windows – Kind / Docker

Step 1: Create a Kind Cluster

First, let’s create a Kind cluster:

# Direct command
kind create cluster --name resources --config kind/kind-config.yaml

# Makefile alias
make create-cluster

Step 2: Create and Apply a Pod with Resource Specifications

Let’s create a pod YAML file with resource specifications:

# Direct command
kubectl run nginx --image=nginx:latest --port=80 --dry-run=client -o yaml > deployment/app/pod.yaml

# Makefile alias
make create-pod-file

Now, modify the pod.yaml File to include resource requests and limits. Here’s an example:

apiVersion: v1
kind: Pod
metadata:
  name: nginx
spec:
  containers:
  - name: nginx
    image: nginx:latest
    resources:
      requests:
        memory: "64Mi"
        cpu: "250m"
      limits:
        memory: "128Mi"
        cpu: "500m"

Apply the pod configuration:

# Direct command
kubectl apply -f deployment/app/pod-resources.yaml

# Makefile alias
make apply-pod

Step 3: Enable Metrics Server

To monitor resource usage, we need to enable the Metrics Server:

# Direct command
kubectl apply -f metrics/components.yaml

# Makefile alias
make enable-metrics

Verify that the Metrics Server is running:

# Direct command
kubectl -n kube-system get pods | grep metrics-server

# Makefile alias
make check-metrics

Step 4: Monitor Resource Usage

Now, let’s monitor our resource usage:

# Monitor node resources
# Direct command
while true; do kubectl top nodes; sleep 2; done

# Makefile alias
make show-node-resource

# Monitor pod resources
# Direct command
while true; do kubectl top pod; sleep 2; done

# Makefile alias
make show-pod-resource

Step 5: Test Resource Limits

To test our resource limits, we’ll deploy CPU and memory-intensive applications:

# Deploy CPU-intensive app
# Direct command
kubectl apply -f deployment/cpu/deployment.yaml

# Makefile alias
make create-cpu

# Deploy memory-intensive app
# Direct command
kubectl apply -f deployment/memory/deployment.yaml

# Makefile alias
make create-memory

Now, let’s generate some load:

# Create traffic generator
# Direct command
kubectl apply -f deployment/traffic/traffic-generator.yaml

# Makefile alias
make create-traffic

# Install wrk in the traffic generator
# Direct command
kubectl exec -it traffic-generator -- apk add --no-cache wrk

# Makefile alias
make add-app-traffic

# Generate load on CPU app
# Direct command
kubectl exec -it traffic-generator -- wrk -c 7 -t 7 -d 99999 -H "Connection: Close" http://cpu

# Makefile alias
make start-app-traffic-cpu

# Generate load on memory app
# Direct command
kubectl exec -it traffic-generator -- wrk -c 7 -t 7 -d 99999 -H "Connection: Close" http://memory

# Makefile alias
make start-app-traffic-memory

Monitor the resource usage as the load increases. You should see the CPU and memory usage climb but never exceed the limits we set.

Step 6: Break the Limits

Now, let’s try to exceed our limits:

# Stress test CPU
# Direct command
kubectl exec -it traffic-generator -- wrk -c 100 -t 100 -d 99999 -H "Connection: Close" http://cpu

# Makefile alias
make break-cpu

# Stress test memory
# Direct command
kubectl exec -it traffic-generator -- wrk -c 7 -t 7 -d 99999 -H "Connection: Close" http://memory

# Makefile alias
make break-memory

Watch what happens when we hit our resource limits. You might see pods being terminated or throttled.

Cleanup

After you’re done with the demonstration, you can clean up your environment:

# Delete the Kind cluster
# Direct command
kind delete cluster --name resources

# Makefile alias
make delete-all

Conclusion

Setting appropriate resource requests and limits is crucial for maintaining a stable and efficient Kubernetes cluster. It helps prevent resource contention, ensures fair resource allocation, and can even help with cost optimization in cloud environments.

Remember:

Always set both requests and limits for critical workloads.
Monitor your actual usage and adjust accordingly.
Be cautious with memory limits, as hitting them can cause pod termination.
CPU limits throttle performance but doesn’t terminate pods.

By mastering resource management, you’re taking a big step toward becoming a Kubernetes expert. Happy clustering!

Share This Tutorial:

Learn Horizontal Pod Autoscaler (HPA) in Kubernetes | Hands-on Guide

Kubernetes | Docker | Infrastructure | Kind | Linux | Uncategorized

Learn Horizontal Pod Autoscaler (HPA) in Kubernetes | Hands-on Guide

Bytsantos October 15, 2024October 21, 2024

Horizontal Pod Autoscaler (HPA) is a powerful feature in Kubernetes that automatically adjusts the number of pod replicas based on resource utilization. In this guide, we’ll walk through setting up and experimenting with HPA in a Kubernetes environment. Before we begin, make sure you have the following set up on your computer: If you haven’t…

Building a Powerful Ubuntu Desktop Homelab on Proxmox: A Step-by-Step Guide

Homelab | Linux | Uncategorized

Building a Powerful Ubuntu Desktop Homelab on Proxmox: A Step-by-Step Guide

Bytsantos December 3, 2024December 3, 2024

Looking to create a robust Ubuntu desktop environment in your homelab? This guide will walk you through setting up Ubuntu Desktop 24.10 on Proxmox, creating the perfect foundation for your testing, development, and learning needs. Requirements Before we begin, make sure you have: Download Ubuntu Desktop ISO First, download the Ubuntu Desktop 24.10 AMD64 ISO…

DevOps- Docker-Commit

Docker

Docker Commit: Creating Custom Images with Ease

Bytsantos June 17, 2024January 26, 2025

Docker commit is a powerful technique for transforming container modifications into new, customized images. This guide demonstrates leveraging a Makefile to streamline the Docker image creation process. Why Use Docker Commit? Step-by-Step Guide Using Makefile 1. Create Initial Container Create the base container using the Makefile: 2. Enter and Modify Container Access the container and…

archlinux-proxmox

Infrastructure | Homelab | Linux

Arch Linux Installation in Proxmox: A Complete Guide

Bytsantos January 18, 2025

Are you ready to dive into the world of Arch Linux? In this comprehensive guide, I’ll walk you through installing Arch Linux in a Proxmox virtual environment, complete with a GNOME desktop environment. Whether you’re a Linux enthusiast or just starting, this tutorial will help you create a fully functional Arch Linux system. https://youtu.be/qvV52fE4f3E Requirements…

how to run windows on a docker container

Infrastructure | Docker

How to Run Windows in a Docker Container: A Step-by-Step Guide

Bytsantos April 17, 2024April 17, 2024

Running Windows in a Docker container might sound like a complex task, but it’s perfectly achievable with the right tools and a clear guide. This blog post will walk you through the entire process, ensuring you have a Windows environment up and running in a Docker container in no time. Plus, if you’re new to…

how to run ubuntu as a docker container

Docker | Linux

How to Run Ubuntu as a Docker Container (fast & easy guide)

Bytsantos February 21, 2024February 29, 2024

Today, we’ll dive into how to run Ubuntu as a Docker container, combining the flexibility of Ubuntu with the efficiency of Docker. Docker has revolutionized how we deploy and manage applications, making it possible to run lightweight, portable containers on any system. This guide will teach the fast and easy way to run Ubuntu as a Docker…

Leave a Reply Cancel reply