✕ Clear all filters
65 articles

📰 Dev.to · Akhilesh

65 articles · Updated every 3 hours · View all reads

All Articles 90,962Blog Posts 109,273Tech Tutorials 22,747Research Papers 19,221News 14,835 ⚡ AI Lessons
98. RAG: Give Your AI Access to Your Documents
Dev.to · Akhilesh 🔍 RAG & Vector Search ⚡ AI Lesson 3w ago
98. RAG: Give Your AI Access to Your Documents
You ask ChatGPT about your company's internal policies. It makes something up. It sounds confident....
95. Fine-Tuning LLMs: Make a General Model Do Your Specific Job
Dev.to · Akhilesh 🧠 Large Language Models ⚡ AI Lesson 3w ago
95. Fine-Tuning LLMs: Make a General Model Do Your Specific Job
A general language model knows a little about everything. It knows some medicine. Some law. Some...
93. GPT: The Model That Predicts the Next Word Forever
Dev.to · Akhilesh 1mo ago
93. GPT: The Model That Predicts the Next Word Forever
BERT reads everything at once and understands. GPT reads left to right and predicts what comes next....
92. BERT: The Model That Reads in Both Directions
Dev.to · Akhilesh 1mo ago
92. BERT: The Model That Reads in Both Directions
GPT generates text by predicting the next word. It reads left to right. BERT does something...
89. The Claude API: Building with Anthropic's Models
Dev.to · Akhilesh 1mo ago
89. The Claude API: Building with Anthropic's Models
Every model has a philosophy behind it. OpenAI built GPT to maximize capability. Anthropic built...
86. RAG: Giving Language Models Long-Term Memory
Dev.to · Akhilesh 🧠 Large Language Models ⚡ AI Lesson 1mo ago
86. RAG: Giving Language Models Long-Term Memory
Large language models know a lot. They do not know everything. They were trained on internet text up...
85. Embeddings and Vector Search: Memory for Language Models
Dev.to · Akhilesh 1mo ago
85. Embeddings and Vector Search: Memory for Language Models
A language model has no memory. You ask it a question. It generates an answer from its pretrained...
84. Fine-Tuning LLMs: Teaching Giants New Tricks
Dev.to · Akhilesh 1mo ago
84. Fine-Tuning LLMs: Teaching Giants New Tricks
GPT-3 has 175 billion parameters. Full fine-tuning updates all 175 billion with every gradient step....
83. HuggingFace: Your Library for Every Pretrained Model
Dev.to · Akhilesh 1mo ago
83. HuggingFace: Your Library for Every Pretrained Model
You have spent the last five posts building transformers, training BERT-like models, implementing GPT...
82. GPT: The Art of Predicting the Next Word
Dev.to · Akhilesh 1mo ago
82. GPT: The Art of Predicting the Next Word
One model objective, stated simply: given all previous words, predict the next word. That is the...
81. BERT: Understanding Language Deeply
Dev.to · Akhilesh 1mo ago
81. BERT: Understanding Language Deeply
Google Search used to work by matching keywords. You type "jaguar speed." You get pages about the...
80. The Transformer: The Architecture That Changed Everything
Dev.to · Akhilesh 1mo ago
80. The Transformer: The Architecture That Changed Everything
In 2017, eight researchers at Google published a paper called "Attention Is All You Need." The title...
78. Word Embeddings: Words as Numbers That Actually Mean Something
Dev.to · Akhilesh 1mo ago
78. Word Embeddings: Words as Numbers That Actually Mean Something
The tokenizer gave you integers. "cat" is 2345. "dog" is 7891. Your model sees these numbers and...
70. Hyperparameter Tuning: Finding the Best Settings.
Dev.to · Akhilesh 1mo ago
70. Hyperparameter Tuning: Finding the Best Settings.
You picked a model. You trained it. You got decent accuracy. Then someone asks: did you tune the...
69. Feature Engineering: Building Better Inputs
Dev.to · Akhilesh 1mo ago
69. Feature Engineering: Building Better Inputs
You've tried three different algorithms. None of them break 78% accuracy. You add dropout, tune...
68. PCA: Shrinking Data Without Losing Information
Dev.to · Akhilesh 📐 ML Fundamentals ⚡ AI Lesson 1mo ago
68. PCA: Shrinking Data Without Losing Information
You have 100 features. Most of them are correlated. Training is slow. Visualization is impossible....
67. DBSCAN: Clustering That Handles Messy Data
Dev.to · Akhilesh 1mo ago
67. DBSCAN: Clustering That Handles Messy Data
Last post K-Means failed on crescent-shaped data. It cut across the natural curves instead of...
64. Precision and Recall: Beyond Accuracy
Dev.to · Akhilesh 1mo ago
64. Precision and Recall: Beyond Accuracy
Last post you saw that accuracy can be 95% while your model catches zero fraud. Precision and recall...
63. Confusion Matrix: What Your Model Got Wrong and Why
Dev.to · Akhilesh 1mo ago
63. Confusion Matrix: What Your Model Got Wrong and Why
Your model has 95% accuracy. You ship it. Three weeks later someone tells you it's missing 40% of...
62. Naive Bayes: Fast, Simple, Surprisingly Effective
Dev.to · Akhilesh 1mo ago
62. Naive Bayes: Fast, Simple, Surprisingly Effective
Your email spam filter makes a decision in milliseconds. Thousands of words. Instant...