770 Experiments to Squeeze 30 tok/s Out of a 35B MoE Model on a $500 GPU

📰 Dev.to · AlexChen

770 Experiments to Squeeze 30 tok/s Out of a 35B MoE Model on a $500 GPU 29.899 tokens per...

Published 2 Apr 2026
Read full article → ← Back to Reads