MLX Genmedia — Prince Canuma, Arcee

Name: MLX Genmedia — Prince Canuma, Arcee
Uploaded: 2026-05-11T13:00:06Z
Channel: AI Engineer
Description: MLX is an array framework for Apple Silicon, essentially PyTorch for your Mac, and this is a tour of what it can run: real-time vision models that descr...

AI Engineer · Intermediate ·📐 ML Fundamentals ·3d ago

Skills: ML Pipelines70%

MLX is an array framework for Apple Silicon, essentially PyTorch for your Mac, and this is a tour of what it can run: real-time vision models that describe the world around you, sub-100ms text-to-speech, speech-to-speech pipelines, omni models that take image and audio together, and video generation from a text prompt on 16GB of VRAM. A recent breakthrough called Turbo Quant cuts KV cache by 4x and gets 1M context running fully on device. The community projects include a native voice app, a robot speaking in real time with a cloned voice, and a system that chains video generations into a coherent story — all without a cloud call. The underlying argument: the cloud assumption doesn't hold everywhere. Not for someone in Africa on an unreliable connection. Not for a local agent that needs to stay on. Not for a robot that has to hear, see, and respond without phoning home. Speaker info: - https://x.com/Prince_Canuma - https://pl.linkedin.com/in/prince-canuma

Watch on YouTube ↗ (saves to browser)