Déployer et adapter des modèles d'IA avec Cloud Run
AI inference is the process of using a trained machine learning model to make predictions on new, unseen data by applying learned patterns. This course is designed for developers, data scientists, and ML engineers interested in quickly deploying AI inference services on Cloud Run. It is useful for those familiar with cloud-based serverless application deployment solutions, but who may not have experience with running AI inference using Google Cloud serverless products.
The course includes examples that deploys a model for AI inference with GPUs and integrates gen AI apps with data storage services.
Watch on Coursera ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
More on: Model Deployment
View skill →Related AI Lessons
⚡
⚡
⚡
⚡
Python Programming Course in Delhi
Medium · Python
Choosing the Right Architecture: A Software Engineer’s Field Guide to Neural Networks
Medium · Data Science
Chandra OCR 2: When Open Source Reads What Others Miss
Medium · Machine Learning
The hidden value of teaching ML to Non-ML teams
Medium · Machine Learning
🎓
Tutor Explanation
DeepCamp AI