No Cloud, No API Keys: Local Open-Source Coding with Trinity Mini, OpenCode, and MLX

Julien Simon · Intermediate ·🧠 Large Language Models ·2w ago
No API keys. No cloud. No per-token cost. Just your Mac. In this video, I show you how to run Arcee AI's Trinity Mini (26B parameters, 3B active) locally on Apple Silicon using MLX, and wire it up to OpenCode as a fully local AI coding assistant. Everything runs on-device — ideal for air-gapped environments, regulated industries, or anyone who wants a private coding AI. I cover the full setup: choosing the right quantization for your hardware, benchmarking generation speed across all quantizations, and configuring OpenCode to talk to the local model. I also check Trinity's results with Claud…
Watch on YouTube ↗ (saves to browser)
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Next Up
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Dave Ebbelaar (LLM Eng)