Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

1695
videos
The evolution of LLM evaluation and Japan’s cutting-edge benchmarks on the Nejumi leaderboard
🧠 Large Language Models
The evolution of LLM evaluation and Japan’s cutting-edge benchmarks on the Nejumi leaderboard
Weights & Biases Advanced 2mo ago
Training Dashboards with Trackio + Hugging Face
🧠 Large Language Models
Training Dashboards with Trackio + Hugging Face
Hugging Face Advanced 2mo ago
The RL Irony in LLMs (and its insane new meta)
🧠 Large Language Models
The RL Irony in LLMs (and its insane new meta)
bycloud Advanced 2mo ago
AgentCPM-Explore Tutorial
🧠 Large Language Models
AgentCPM-Explore Tutorial
OpenBMB Advanced 2mo ago
Hierarchical Reasoning HRM 2.0: NEW Attractor Dynamics in AI
🧠 Large Language Models
Hierarchical Reasoning HRM 2.0: NEW Attractor Dynamics in AI
Discover AI Advanced 2mo ago
The Best SQL Live Corporate TRAINING starting in Just 2 Days
🧠 Large Language Models
The Best SQL Live Corporate TRAINING starting in Just 2 Days
Manish Sharma Advanced 2mo ago
Luis Solis Navarro - Efficient 2D LiDAR Scene Understanding for Autonomous Driving via Multi Modal
🧠 Large Language Models
Luis Solis Navarro - Efficient 2D LiDAR Scene Understanding for Autonomous Driving via Multi Modal
Cohere Advanced 2mo ago
Death of the Token in AI: Multi-Parallel AI Reality, NVIDIA’s Silent Robots & Pre-GPT-6
🧠 Large Language Models
Death of the Token in AI: Multi-Parallel AI Reality, NVIDIA’s Silent Robots & Pre-GPT-6
Discover AI Advanced 2mo ago
Local VLM fine-tuning on the NVIDIA DGX Spark - Part 2.5 - Training the LLM part only
🧠 Large Language Models
Local VLM fine-tuning on the NVIDIA DGX Spark - Part 2.5 - Training the LLM part only
Daniel Bourke Advanced 2mo ago
Claude Cowork is Here! Full Breakdown + Testing
🧠 Large Language Models
Claude Cowork is Here! Full Breakdown + Testing
The AI Advantage Advanced 2mo ago
AI Models Are Falling Apart | CLAUDE 4.5 & KIMI K2
🧠 Large Language Models
AI Models Are Falling Apart | CLAUDE 4.5 & KIMI K2
Discover AI Advanced 2mo ago
Why LLMs Shouldn’t Follow Instructions (But Do)
🧠 Large Language Models
Why LLMs Shouldn’t Follow Instructions (But Do)
ML Guy Advanced 2mo ago
Advanced RAG Techniques with Arcee Trinity Mini (100% Local)
🧠 Large Language Models
Advanced RAG Techniques with Arcee Trinity Mini (100% Local)
Julien Simon Advanced 2mo ago
Local LLM fine-tuning on the NVIDIA DGX Spark - Part 1
🧠 Large Language Models
Local LLM fine-tuning on the NVIDIA DGX Spark - Part 1
Daniel Bourke Advanced 2mo ago
10: Generative AI – Adapting LLMs with Parameter-Efficient Fine-Tuning
🧠 Large Language Models
10: Generative AI – Adapting LLMs with Parameter-Efficient Fine-Tuning
MIT OpenCourseWare Advanced 2mo ago
AI Kill Switch for Hallucinations (Anthropic)
🧠 Large Language Models
AI Kill Switch for Hallucinations (Anthropic)
Discover AI Advanced 2mo ago
#DeepSeek’s #mHC Breakthrough: Stabilizing Hyper-Connections for Large-Scale LLM Training
🧠 Large Language Models
#DeepSeek’s #mHC Breakthrough: Stabilizing Hyper-Connections for Large-Scale LLM Training
BazAI Advanced 2mo ago
Youtu-Agent: Scaling LLM Agent Productivity via Automated Generation and Hybrid RL
🧠 Large Language Models
Youtu-Agent: Scaling LLM Agent Productivity via Automated Generation and Hybrid RL
BazAI Advanced 2mo ago
MAI-UI: Alibaba’s New Foundation GUI Agents Outperforming Gemini & GPT-4o
🧠 Large Language Models
MAI-UI: Alibaba’s New Foundation GUI Agents Outperforming Gemini & GPT-4o
BazAI Advanced 2mo ago
Molmo2: Open-Source Vision-Language Models with State-of-the-Art Video Grounding
🧠 Large Language Models
Molmo2: Open-Source Vision-Language Models with State-of-the-Art Video Grounding
BazAI Advanced 2mo ago
[State of AI Papers 2025] Fixing Research with Social Signals, OCR & Implementation — Team AlphaXiv
🧠 Large Language Models
[State of AI Papers 2025] Fixing Research with Social Signals, OCR & Implementation — Team AlphaXiv
Latent Space Advanced 2mo ago
[State of Post-Training] From GPT-4.1 to 5.1: RLVR, Agent & Token Efficiency — Josh McGrath, OpenAI
🧠 Large Language Models
[State of Post-Training] From GPT-4.1 to 5.1: RLVR, Agent & Token Efficiency — Josh McGrath, OpenAI
Latent Space Advanced 2mo ago
Jack Morris: Stuffing Context is not Memory, Updating Weights is
🧠 Large Language Models
Jack Morris: Stuffing Context is not Memory, Updating Weights is
AI Engineer Advanced 3mo ago
Base vs instruct models explained
🧠 Large Language Models
Base vs instruct models explained
What's AI by Louis-François Bouchard Advanced 3mo ago