Part 6 | Deploy ML Model on Kubernetes | Auto-Scaling with HPA and Monitoring with Prometheus
In this tutorial, we'll be deploying a machine learning service on Kubernetes, encompassing:
- Sentiment Analysis Model: Developed using Scikit-Learn.
- FastAPI-based REST API: For seamless model inference.
- Containerization: Using Docker or Podman.
- Kubernetes Deployment: Featuring auto-scaling with Horizontal Pod Autoscaler (HPA).
- Persistent Storage: Ensuring reliable management of model artifacts.
- Monitoring: Implemented with Prometheus for real-time insights.
This comprehensive guide is tailored for beginners eager to enhance their MLOps skills and gain practical experience in deploy…
Watch on YouTube ↗
(saves to browser)
DeepCamp AI