Runbook: Pod OOM Killed
Alert: OpenClawPodOOMKilled
Meaning
An OpenClaw container was terminated by the Linux OOM killer because it exceeded its memory limit.
Impact
The pod restarts, causing temporary unavailability. Repeated OOM kills lead to crash-looping.
Diagnosis
# Confirm OOM kill
kubectl get pod <name>-0 -n <namespace> -o jsonpath='{.status.containerStatuses[0].lastState.terminated.reason}'
# Check current memory usage
kubectl top pod <name>-0 -n <namespace> --containers
# Check memory limits
kubectl get pod <name>-0 -n <namespace> -o jsonpath='{.spec.containers[0].resources.limits.memory}'
# Check if the Chromium sidecar is contributing to memory pressure
kubectl top pod <name>-0 -n <namespace> --containers
Mitigation
- Increase memory limits - Update
spec.resources.limits.memoryin the OpenClawInstance CR - Check Chromium sidecar - If enabled, the Chromium sidecar can be memory-hungry; set dedicated resource limits via
spec.chromium.resources - Check Ollama sidecar - LLM inference requires significant memory; ensure appropriate limits via
spec.ollama.resources - Reduce workload - Limit concurrent operations or large file processing
- Monitor trends - Use the Grafana instance dashboard to identify memory growth patterns