Exit Code 137: How My ML Pod Got Killed Before It Could Even Say Hello

📰 Medium · Python

Learn how Kubernetes probes and slow model loading can kill your ML pod before it's ready to serve, and why 'healthy at boot' isn't the same as 'ready to serve'

intermediate Published 11 May 2026

Action Steps

Check Kubernetes probe configurations to ensure they align with model loading times
Implement a readiness probe to distinguish between 'healthy at boot' and 'ready to serve'
Optimize model loading to reduce startup time
Test and validate probe configurations using Kubernetes debugging tools
Configure liveness probes to allow for sufficient model loading time

Who Needs to Know This

DevOps and ML engineers can benefit from understanding how Kubernetes probes work and how to optimize model loading to prevent pod termination.

Key Insight

💡 'Healthy at boot' is not the same as 'ready to serve', and Kubernetes probes can terminate pods that take too long to load