📰 Dev.to · Billy Bob Gurr
Articles from Dev.to · Billy Bob Gurr · 2 articles · Updated every 3 hours · View all reads
All
⚡ AI Lessons (26832)
ArXiv cs.AIDev.to AIMedium · ProgrammingMedium · AIMedium · Machine LearningMedium · Cybersecurity

Dev.to · Billy Bob Gurr
2d ago
When I started running models locally, I thought quantization meant squeezing more into RAM. Turns o
Most people default to Q4_K_M in llama.cpp because it's the "safe" choice. But I've found the real...

Dev.to · Billy Bob Gurr
🧠 Large Language Models
⚡ AI Lesson
3d ago
Why GPU Memory Bandwidth Matters More Than VRAM for Local LLMs
You've probably read that you need a GPU with tons of VRAM to run local models. That's true, but only...
DeepCamp AI