▶ Videos →

📰 Dev.to · kuroko

2 articles · Updated every 3 hours · View all reads

All Articles 111,116 Blog Posts 121,327 Tech Tutorials 28,311 Research Papers 22,451 News 16,632 ⚡ AI Lessons

Benchmarking Local Coding LLMs: 11 Realistic Tasks, 232 Runs, and the Bugs My Bench Found in My Agent

Dev.to · kuroko 🧠 Large Language Models ⚡ AI Lesson 2mo ago

Benchmarking Local Coding LLMs: 11 Realistic Tasks, 232 Runs, and the Bugs My Bench Found in My Agent

What can a 16GB GPU and a local LLM actually do for everyday coding work? I built an 11-task...

What Happens When Local LLMs Fail at Tool Calling — Testing 7 Models with a Rust Coding Agent

Dev.to · kuroko 🤖 AI Agents & Automation 4mo ago

What Happens When Local LLMs Fail at Tool Calling — Testing 7 Models with a Rust Coding Agent

I tested 7 local LLMs on the same simple coding task. 4 succeeded. 3 failed — each in a different...