Your local LLM is 10x slower than it should be
Here's the one change that took mine from ~120 tok/s to 1200+ without a new GPU. TryHackMe just launched Cyber Security 101 ...
Watch on YouTube ↗
(saves to browser)
DeepCamp AI