Demoing a Large Language Model running locally on my laptop

Patrick Devaney · Beginner ·🧠 Large Language Models ·2y ago
Model: laser-dolphin-mixtral-2x7b-dpo.Q3_K_M Front-End App: https:/github.com/oobabooga/text-generation-webui Device: hp pavilion laptop 15-cs3xxx Ram:12gb CPU: i5-1035g1 cpu @1ghz Download the model from: https:/huggingface.co/TheBloke/laser-dolphin-mixtral-2x7b-dpo-GGUF/blob/main/laser-dolphin-mixtral-2x7b-dpo.Q3_K_M.GGUF Download the latest release of the text-generation-webui. Extract it to your desired directory. You could also clone it from Github. Downloading this zip resolved an issue with gradio not being recognized by the start executable. Now run the start_linux.sh, start_windows.bat, start_macos.sh, or start_wsl.bat script depending on your OS. Select a GPU manufacturer, in this case none. Cut and paste your model to the models directory in the text-generation webui repository. After downloading all packages for the webui and initializing with with a localhost ip, load the model from within the webui. The token generation rate for Q3_K_M is still not viable for daily use, but the verbosity and accuracy is impressive for this hardware setup.
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Big Tech firms are investing heavily in AI, driving growth and transformation, while emphasizing safety and responsible adoption
Dev.to AI
What happens when AI starts building itself
Explore the concept of AI building itself and its implications on the future of technology
Dev.to AI
Ship Your SaaS for Free: OpenRouter’s Hidden Superpower
Learn how to use OpenRouter's free API tiers to build and prototype SaaS applications without incurring costs, leveraging 200+ LLMs like Mistral 7B and Llama 3.1 8B
Dev.to AI
Shipping Multilingual Video with GPT-5.2: A Developer's Guide to VideoDubber's Translation Pipeline
Learn how to ship multilingual video content with GPT-5.2 using VideoDubber's translation pipeline for better idiom handling and tone preservation
Dev.to AI
Up next
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Dave Ebbelaar (LLM Eng)
Watch →