📰 Microsoft Research
7 articles · Updated every 3 hours · View all reads
All
Articles 72,049Blog Posts 101,122Tech Tutorials 17,514Research Papers 15,348News 12,911
⚡ AI Lessons

Microsoft Research
🧠 Large Language Models
⚡ AI Lesson
3w ago
GridSFM: A new, small foundation model for the electric grid
Introducing GridSFM, a small foundation model that can predict AC optimal power flow in milliseconds, boosting efficiency and unlocking cost savings. Learn how

Microsoft Research
🧠 Large Language Models
⚡ AI Lesson
2mo ago
ADeLe: Predicting and explaining AI performance across tasks
AI benchmarks report how large language models (LLMs) perform on specific tasks but provide little insight into their underlying capabilities that drive their p

Microsoft Research
🧠 Large Language Models
⚡ AI Lesson
2mo ago
GroundedPlanBench: Spatially grounded long-horizon task planning for robot manipulation
Vision-language models (VLMs) use images and text to plan robot actions, but they still struggle to decide what actions to take and where to take them. Most sys

Microsoft Research
🧠 Large Language Models
⚡ AI Lesson
3mo ago
Phi-4-reasoning-vision and the lessons of training a multimodal reasoning model
We are pleased to announce Phi-4-reasoning-vision-15B, a 15 billion parameter open‑weight multimodal reasoning model, available through Microsoft Foundry (opens

Microsoft Research
🧠 Large Language Models
⚡ AI Lesson
3mo ago
CORPGEN advances AI agents for real work
By mid-morning, a typical knowledge worker is already juggling a client report, a budget spreadsheet, a slide deck, and an email backlog, all interdependent and

Microsoft Research
🧠 Large Language Models
⚡ AI Lesson
3mo ago
Rethinking imitation learning with Predictive Inverse Dynamics Models
This research looks at why Predictive Inverse Dynamics Models often outperform standard Behavior Cloning in imitation learning. By using simple predictions of w
Microsoft Research
🧠 Large Language Models
⚡ AI Lesson
4mo ago
Argos: Multimodal reinforcement learning with agentic verifier for AI agents
Argos improves multimodal RL by evaluating whether an agent’s reasoning aligns with what it observes over time. The approach reduces visual hallucinations and p
DeepCamp AI