Klustered: Level Three

Welcome to Klustered: Level Three. The app is deployed, the cluster looks healthy at a glance, and yet nothing reaches you on port 30000. This level stacks several traps that only reveal themselves one at a time, each surfacing once you clear the one before it. Peel them back in order and the app comes alive, counting every visit.

The Goal

Open the App tab in the playground UI. Right now it's blank. You're done when the page loads and shows a visit counter that increases by one every time you refresh:

Rawkode Academy - Klustered: Level 3
Visits: 1

Refresh, and Visits should climb. That counter is backed by a persistent volume, so getting it to increment proves the whole chain is working: pod running, traffic routed, and the app able to read and write its data.

Two Ways to Play

Hard mode. Just get the counter climbing. No hints, no peeking.
Guided mode. Work through each trap step by step, with the concept behind it explained before the fix.

Pick your path below.

Getting Started

You'll be working from cplane-01. The admin kubeconfig is only readable by root, so the very first thing to do is switch users:

sudo su

Quick sanity check (you should see a node listed):

kubectl get nodes

If you get a permission or auth error, double-check you actually became root before running kubectl.

Hard Mode

You're on cplane-01. The klustered Deployment and its NodePort Service already exist, but curl http://localhost:30000 gives you nothing. There are three independent traps between you and a climbing counter, and they surface in layers: the Deployment can't get a pod running at all, then traffic won't reach the pod once it does, then the app can't read its own data.

Reach for the usual tools (kubectl get, describe, logs, get events), and don't trust the Deployment template alone, what's been admitted to the cluster may not match what you wrote. Keep the App tab open in another window.

Stop reading here if you want the full challenge.

Guided Mode

There are three planted bugs standing between you and a working counter. They surface in this order:

The Deployment can't get a pod created at all.
The pod runs, but nothing reaches it on 30000.
Traffic reaches the app, but the app can't read its data.

Each step gives you only the symptom and a question to sit with. Everything investigative, the commands to run, the concept behind the trap, and the fix, lives in the hint blocks, revealed in that order: where to look, then why, then how. Open them one at a time, and only when you're genuinely stuck.

Step 0: Get the Lay of the Land

Before changing anything, survey what exists. The usual first moves all work here:

kubectl get pods,svc,deploy
kubectl get events --sort-by=.lastTimestamp
curl -s http://localhost:30000

Read that output closely before touching anything: which objects exist, which are missing, and what is curl actually telling you? Each of the three steps below starts from one of these observations.

Step 1: A Deployment With No Pods

curl http://localhost:30000 is silent, and kubectl get pods comes back empty, not Pending, not CrashLoopBackOff, no pod at all. Yet the klustered Deployment is sitting right there.

Why would a healthy-looking Deployment produce zero pods, not even a pending one? Start there.

Hint 1: Where to Look

A Deployment with no pods means its ReplicaSet is trying to create them and being refused. The pod never reaches the scheduler, so there's nothing to kubectl logs or describe pod; the story lives one level up, in the ReplicaSet's own events.

kubectl get deploy,rs -l app=klustered
kubectl describe rs -l app=klustered

Look for a FailedCreate event (and a ReplicaFailure condition on the Deployment). It quotes the exact reason the API server rejected the pod, read that message closely before opening the next hint.

Hint 2: The Concept

The rejection message is the whole clue:

Pod "klustered-..." is invalid: spec.containers[0].resources.requests:
  Invalid value: "128Mi": must be less than or equal to memory limit of 1Mi

The pod's memory request (128Mi, straight from the Deployment template) is larger than its limit (1Mi), and a request can never exceed its limit. But the template's limit is 256Mi, nobody wrote 1Mi. So something rewrote the pod at admission time, as it was being created.

That something is a MutatingAdmissionPolicy: it lets the API server rewrite incoming objects using CEL, no webhook server required. Find it with:

kubectl get mutatingadmissionpolicies,mutatingadmissionpolicybindings

This cluster has one called resource-enforcer bound to the default namespace; on every pod CREATE it overwrites each container's memory limit to 1Mi while leaving the request untouched. The Deployment template is innocent, the mutation hits the pod, not the template, which is why every pod the ReplicaSet submits is rejected and you never see even a Pending one.

Hint 3: The Fix

Remove the binding first, then the policy:

kubectl delete mutatingadmissionpolicybinding resource-enforcer-binding
kubectl delete mutatingadmissionpolicy resource-enforcer

The ReplicaSet has been retrying on a loop this whole time, so once the policy is gone its next attempt succeeds on its own, no pod to delete, there was never one to begin with. Within a few seconds a pod appears, keeps its real 256Mi limit, and reaches Running:

kubectl get pod -l app=klustered -o jsonpath='{.items[0].spec.containers[0].resources}{"\n"}'

(Impatient? kubectl rollout restart deploy klustered nudges the controller.) Now try curl http://localhost:30000 again, it still won't answer. On to the next layer.

Step 2: Connection Refused on 30000

The pod is finally Running, but curl http://localhost:30000 still fails, this time with Connection refused rather than silence. The traffic is reaching the pod; it's just landing somewhere nothing is listening.

If the routing works but the door is shut, which port is the app actually listening on, and how would you even find out when the container is FROM scratch (no shell, no ss, no netstat)?

Hint 1: Where to Look

Confirm the routing first, then start questioning the port:

kubectl get svc klustered -o yaml
kubectl get endpoints klustered
curl -v http://localhost:30000     # "Connection refused" = reached the pod, no listener on that port

The Service forwards to targetPort: 666, which matches the Deployment's containerPort: 666. That agreement looks reassuring, but ask yourself whether containerPort actually forces the process to bind 666.

Hint 2: The Concept (containerPort is a lie)

A Service has three port fields, and conflating them is the whole trap:

port: the port the Service itself is reachable on.
targetPort: the port on the pod that traffic is forwarded to.
nodePort: the port exposed on every node (for type: NodePort).

containerPort in the Deployment is documentation only. The kubelet does not enforce it, and the process is free to bind any port it likes. Here the binary listens on a random high port, while containerPort and targetPort were both left at 666, so kube-proxy faithfully forwards 30000 -> pod:666, a closed door. Nothing in kubectl get or describe will tell you the real port; you have to inspect the live socket.

Hint 3: The Quick Way (ephemeral debug container)

You can't exec into a scratch image, but you can attach an ephemeral debug container that shares the target's network namespace, then run socket tools from there:

POD=$(kubectl get pod -l app=klustered -o jsonpath='{.items[0].metadata.name}')
kubectl debug -it ${POD} --image=nicolaka/netshoot --target=klustered -- ss -tlnp

ss shows both IPv4 and IPv6 listeners in one view, so the LISTEN row reveals the real port (it won't be 666). If the debug image can't be pulled, fall back to the node-side procfs method in the next hint, which needs no tools at all.

Hint 4: The Deep Way (procfs) and the tcp6 trap

Every listening socket is visible from the node in procfs, no tools required in the container. First find the container's host PID, then read its per-namespace socket tables:

CID=$(crictl ps --name klustered -q)
PID=$(crictl inspect "$CID" | jq -r .info.pid)
awk '$4=="0A"{print $2}' /proc/$PID/net/tcp     # state 0A = LISTEN; HEXIP:HEXPORT

Run that and you'll likely get nothing. That is the trap, and the most important lesson in this step:

A Go server started with http.ListenAndServe(":PORT", ...) binds the IPv6 wildcard [::], not 0.0.0.0. The socket is AF_INET6, so it appears in /proc/$PID/net/tcp6, not /proc/$PID/net/tcp. Look at only tcp and you'll wrongly conclude nothing is listening.

Always check both tables (and udp/udp6 for UDP services). This reads both and decodes the hex port to decimal:

for proto in tcp tcp6; do
  echo "== $proto =="
  awk '$4=="0A"{split($2,a,":"); print a[2]}' /proc/$PID/net/$proto \
    | while read -r hex; do printf '  listening on %d\n' "0x$hex"; done
done

tcp prints nothing; tcp6 prints the real port. Decoding by hand: in 00000000000000000000000000000000:1F90, the [::] address is all zeros and 1F90 hex = 8080. That decimal value is your targetPort.

Hint 5: The Fix

Point the Service's targetPort at the port you discovered (replace <PORT>):

kubectl patch svc klustered --type merge \
  -p '{"spec":{"ports":[{"port":666,"targetPort":<PORT>,"nodePort":30000}]}}'

Or kubectl edit svc klustered and set spec.ports[0].targetPort by hand. Confirm traffic now reaches the app:

curl -v http://localhost:30000

This time you'll get a reply from the app, but it isn't the page you wanted. One layer left.

Step 3: An Error Instead of the Page

Traffic reaches the app now, but instead of the page and counter you get a terse error:

curl -s http://localhost:30000
# Failed to read data

The app is up and reachable, yet it can't read its own data file. What identity is the process running as, and is that identity actually allowed to read the file on disk?

Hint 1: Where to Look

The data lives on a hostPath-backed PersistentVolume, and the file is definitely on disk, so this isn't a missing-file problem. Two things decide whether a read succeeds: the UID the process runs as, and the ownership/mode of the file. Look at both.

The identity comes from the container's securityContext:

kubectl get deploy klustered -o jsonpath='{.spec.template.spec.containers[0].securityContext}{"\n"}'
# {"runAsGroup":1000,"runAsUser":1000}

The ownership comes from the backing hostPath on the node:

ls -ln /var/data/klustered
# drwx------ ... 0 0 ... index.html   (owner UID 0, mode 700)

A process running as UID 1000 against a directory owned by UID 0 with mode 700. That can't end well.

Hint 2: The Concept

runAsUser: 1000 makes the container process run as UID 1000. Unlike user namespaces, there is no UID remapping: a hostPath volume keeps whatever ownership it has on the node's disk, and the kernel checks the container's UID against those on-disk owners directly.

The data dir is root:root with mode 700 (owner-only rwx). UID 1000 is not the owner and not in the group, so it can't even traverse the directory, let alone read index.html. The app returns "Failed to read data".

This is one of the most common real-world Kubernetes failures: an image that assumed root, dropped onto a cluster that forces non-root (PodSecurity restricted, OpenShift's arbitrary UIDs, or a hardened securityContext), suddenly unable to touch its own volume.

Hint 3: The Fix (the right way)

The production-correct fix is to reconcile the volume's ownership with the UID the app runs as, using an init container that runs as root and chowns the data before the app starts:

kubectl patch deployment klustered --type json -p '[
  {"op":"add","path":"/spec/template/spec/initContainers","value":[
    {"name":"fix-perms","image":"busybox:1.36",
     "securityContext":{"runAsUser":0},
     "command":["sh","-c","chown -R 1000:1000 /data"],
     "volumeMounts":[{"name":"data","mountPath":"/data"}]}
  ]}
]'

The init container is root (UID 0), so it can chown the node's hostPath; the main container stays non-root and now owns its data. The Deployment rolls a new pod; once it's Running:

curl -s http://localhost:30000

You should now see the page and Visits: 1.

Hint 4: Other Fixes (and a trap)

Three more ways out, each teaching something:

fsGroup looks right but silently fails here. Adding spec.template.spec.securityContext.fsGroup: 1000 tells the kubelet to fix volume group ownership, but that mechanism does not apply to hostPath volumes (only to volume types that support ownership management, like most CSI/emptyDir volumes). The read still fails. A great reminder that fsGroup is not a universal fix.
Run as root (kubectl patch deployment klustered --type json -p '[{"op":"remove","path":"/spec/template/spec/containers/0/securityContext"}]'). Works, because UID 0 owns the data, but it throws away the non-root posture. Quick, and usually the wrong call.
Loosen perms on the node (chmod -R 755 /var/data/klustered). Works, but only when you control the node, which in the real world you usually don't.

The init-container chown in Hint 3 is the one that keeps the app non-root and doesn't depend on node access, which is why it's the production answer.

Step 4: Watch the Counter Climb

Everything's healthy: the pod runs, the Service routes to the right port, and the app can read and write its data. Flip to the App tab and refresh a few times.

curl -s http://localhost:30000
curl -s http://localhost:30000

Each request increments the counter and persists it to the volume, so Visits should climb 1, 2, 3, ... with every refresh. When the number goes up, you've won.