Fix Bad OCR: Fine-Tune DeepSeek-V2 on Your Own Data (Unsloth)

Shane | LLM Implementation · Beginner ·📐 ML Fundamentals ·5mo ago

Skills: Fine-tuning LLMs95%ML Maths Basics60%

🚀 Colab tutorial: Fine-tune DeepSeek-OCR (3B) with Unsloth + LoRA to improve handwriting & document OCR. In a demo we cut CER from 23% to 6% (~74% relative) and show a brief look at a small Persian OCR set. Notebook: https://docs.unsloth.ai/new/deepseek-ocr-run-and-fine-tune What you’ll learn DeepSeek-OCR overview (layout → vision tokens; fast inference) Colab setup (Transformers, PyTorch, Unsloth) Baseline inference + CER evaluation Dataset formatting (image + instruction, user/assistant turns) LoRA/PEFT fine-tuning via FastVisionModel.get_peft_model Training with Trainer, monitoring loss (quick 60-step run) Post-training evaluation (demo sample + Persian examples) Saving/pushing LoRA adapters to Hugging Face Hub Resources Unsloth: https://github.com/unslothai/unsloth DeepSeek-OCR (HF): https://huggingface.co/unsloth/DeepSeek-OCR Persian OCR dataset: https://huggingface.co/datasets/hezARAI/parsynth-ocr-200k Chapters 00:00 Intro — What is DeepSeek-OCR? 01:04 Fine-tuning results (demo & Persian set overview) 02:06 Colab notebook walkthrough 02:18 Install dependencies (Unsloth) 02:32 Load unsloth/DeepSeek-OCR 03:00 Baseline eval (CER on sample) 03:57 Test on a custom screenshot 04:43 Prep for LoRA fine-tuning 05:14 Data prep & formatting 06:22 Train (60 steps) 07:08 Evaluate — 23%→6% CER (demo sample) 07:50 Save LoRA / push to HF Hub 08:10 Outro Note: Results shown include a single-sample demo (23%→6% CER) and a brief, small Persian OCR evaluation. Expect variability on your data/language. 💬 What should I fine-tune next? Comment below. 👍 Like & subscribe if this helped! #DeepSeekOCR #Unsloth #LoRA

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

More on: Fine-tuning LLMs

View skill →

Fine-tuning T5 LLM for Text Generation: Complete Tutorial w/ free COLAB #coding

Fine-tuning T5 LLM for Text Generation: Complete Tutorial w/ free COLAB #coding

Train image classifier using transfer learning - Fine-tuning MobileNet with Keras

Train image classifier using transfer learning - Fine-tuning MobileNet with Keras

Advanced Fine-Tuning in Rust

Advanced Fine-Tuning in Rust

GPT-4o: Fine-tune OpenAI's Multimodal Model | Live Coding & Q&A (Oct 3rd)

GPT-4o: Fine-tune OpenAI's Multimodal Model | Live Coding & Q&A (Oct 3rd)

LLM Fine-tuning: Two Crucial Tips for New Models - LLama 2

LLM Fine-tuning: Two Crucial Tips for New Models - LLama 2

SDXL LORA STYLE Training! Get THE PERFECT RESULTS!

SDXL LORA STYLE Training! Get THE PERFECT RESULTS!

Related AI Lessons

Mathematics for Machine Learning — Part 3

Learn the basics of statistics for machine learning and why it's crucial for data analysis

Medium · Machine Learning

Mathematics for Machine Learning — Part 3

Learn the statistical foundations crucial for machine learning, including probability, distributions, and inference, to improve your ML models

Medium · Data Science

Mathematics for Machine Learning — Part 3

Learn the statistical foundations for machine learning and why they matter for building predictive models

Medium · Deep Learning

🔥 From 1 Day 100 Days. This Changed Everything.

Consistency is key to improving coding skills, as shown by earning the 100 Days Badge on LeetCode

Chapters (13)

Intro — What is DeepSeek-OCR?

1:04 Fine-tuning results (demo & Persian set overview)

2:06 Colab notebook walkthrough

2:18 Install dependencies (Unsloth)

2:32 Load unsloth/DeepSeek-OCR

3:00 Baseline eval (CER on sample)

3:57 Test on a custom screenshot

4:43 Prep for LoRA fine-tuning

5:14 Data prep & formatting

6:22 Train (60 steps)

7:08 Evaluate — 23%→6% CER (demo sample)

7:50 Save LoRA / push to HF Hub

8:10 Outro

Becoming a Better Python Developer Through Learning Rust | Real Python Podcast #292