Skills › Systems Design & Architecture

AI Systems Design

Design systems for LLM serving, inference optimisation, and vector DB at scale.

0%
Confidence · no data yet
Sign in to track

After this skill you can…

  • Design an LLM inference cluster with vLLM
  • Implement batching and caching strategies for LLM APIs
  • Architect a production RAG system for millions of queries

Prerequisites

Watch (10 videos)

Architecting Scalable Cloud AI Infrastructure
Coursera · intermediate hands-on
→ Design scalable cloud AI infrastructure→ Build resilient microservices
GenAI Model Development and Production Engineering
Coursera · advanced hands-on
→ Design production-ready AI systems→ Transform AI prototypes into robust systems
Explore NVIDIA Metropolis AI-Powered Multi-Camera Tracking on AWS
NVIDIA Developer · intermediate hands-on
→ Deploy AI-powered tracking on cloud infrastructure→ Configure multi-camera tracking workflow
Accelerate AI on NVIDIA RTX AI PCs with Windows ML | Microsoft Build 2025
NVIDIA Developer · intermediate hands-on
→ Build AI applications with Windows ML→ Deploy AI models on NVIDIA RTX AI PCs
Generative AI web development with Angular
Google Cloud Tech · beginner hands-on
→ Build generative AI web apps→ Deploy AI models to production
Dassault Systèmes 3DEXCITE accelerates product experiences with AWS cloud solutions
Amazon Web Services · advanced hands-on
→ Design cloud-based product development workflows→ Deploy personalized customer experiences
End-to-End AI: From Development to Deployment
NVIDIA Developer · intermediate hands-on
→ Deploy AI applications at the edge→ Manage AI systems with Fleet Command
How Nuro transformed end-to-end AI and data discovery with Google Cloud Consulting
Google Cloud · advanced hands-on
→ Design scalable infrastructure for AI→ Implement data discovery solutions
Demo - Create Azure AD app to work with groups using Microsoft Graph
Microsoft 365 Developer · intermediate hands-on
→ Register Azure AD applications for Microsoft Graph→ Submit requests to Microsoft Graph for group management
Deploy Resilient AI Microservices with LangChain
Coursera · intermediate hands-on
→ Design microservices architecture for AI apps→ Deploy AI microservices with LangChain