Benchmarking Llama 3.1 405B on 8 x AMD MI300X using vLLM and KubeAI

Name: Benchmarking Llama 3.1 405B on 8 x AMD MI300X using vLLM and KubeAI
Uploaded: 2025-01-20T19:15:00+00:00
Channel: Samos123
Description: Blog: https://substratus.ai/blog/benchmarking-llama-3.1-405b-amd-mi300x Software used in video: vLLM for deploying LLMs: https://github.com/vllm-project...

Samos123 · Intermediate ·🧠 Large Language Models ·1y ago

Blog: https://substratus.ai/blog/benchmarking-llama-3.1-405b-amd-mi300x Software used in video: vLLM for deploying LLMs: https://github.com/vllm-project/vllm KubeAI for deploying vLLM on K8s: https://github.com/substratusai/kubeai

Watch on YouTube ↗ (saves to browser)

Next Up

5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems

Dave Ebbelaar (LLM Eng)