📰 Microsoft Research
Articles from Microsoft Research · 12 articles · Updated every 3 hours · View all news
All
⚡ AI Lessons (4905)
ArXiv cs.AIOpenAI NewsHugging Face BlogForbes InnovationDev.to AIWeaviate Blog

Microsoft Research
👁️ Computer Vision
⚡ AI Lesson
3d ago
AsgardBench: A benchmark for visually grounded interactive planning
Imagine a robot tasked with cleaning a kitchen. It needs to observe its environment, decide what to do, and adjust when things don’t go as expected, for example

Microsoft Research
🧠 Large Language Models
⚡ AI Lesson
3d ago
GroundedPlanBench: Spatially grounded long-horizon task planning for robot manipulation
Vision-language models (VLMs) use images and text to plan robot actions, but they still struggle to decide what actions to take and where to take them. Most sys
Microsoft Research
⚡ AI Lesson
2w ago
Systematic debugging for AI agents: Introducing the AgentRx framework
As AI agents transition from simple chatbots to autonomous systems capable of managing cloud incidents, navigating complex web interfaces, and executing multi-s

Microsoft Research
⚡ AI Lesson
2w ago
PlugMem: Transforming raw agent interactions into reusable knowledge
It seems counterintuitive: giving AI agents more memory can make them less effective. As interaction logs accumulate, they grow large, fill with irrelevant cont

Microsoft Research
🧠 Large Language Models
⚡ AI Lesson
3w ago
Phi-4-reasoning-vision and the lessons of training a multimodal reasoning model
We are pleased to announce Phi-4-reasoning-vision-15B, a 15 billion parameter open‑weight multimodal reasoning model, available through Microsoft Foundry (opens

Microsoft Research
🧠 Large Language Models
⚡ AI Lesson
1mo ago
CORPGEN advances AI agents for real work
By mid-morning, a typical knowledge worker is already juggling a client report, a budget spreadsheet, a slide deck, and an email backlog, all interdependent and

Microsoft Research
👁️ Computer Vision
⚡ AI Lesson
1mo ago
Media Authenticity Methods in Practice: Capabilities, Limitations, and Directions
As synthetic media grows, verifying what’s real, and the origin of content, matters more than ever. Our latest report explores media integrity and authenticatio

Microsoft Research
⚡ AI Lesson
1mo ago
Project Silica’s advances in glass storage technology
Project Silica introduces new techniques for encoding data in borosilicate glass, as described in the journal Nature. These advances lower media cost and simpli

Microsoft Research
🧠 Large Language Models
⚡ AI Lesson
1mo ago
Rethinking imitation learning with Predictive Inverse Dynamics Models
This research looks at why Predictive Inverse Dynamics Models often outperform standard Behavior Cloning in imitation learning. By using simple predictions of w

Microsoft Research
⚡ AI Lesson
1mo ago
Paza: Introducing automatic speech recognition benchmarks and models for low resource languages
Microsoft Research unveils Paza, a human-centered speech pipeline, and PazaBench, the first leaderboard for low-resource languages. It covers 39 African languag

Microsoft Research
⚡ AI Lesson
2mo ago
UniRG: Scaling medical imaging report generation with multimodal reinforcement learning
AI can help generate medical image reports, but today’s models struggle with varying reporting schemes. Learn how UniRG uses reinforcement learning to boost per
Microsoft Research
🧠 Large Language Models
⚡ AI Lesson
2mo ago
Argos: Multimodal reinforcement learning with agentic verifier for AI agents
Argos improves multimodal RL by evaluating whether an agent’s reasoning aligns with what it observes over time. The approach reduces visual hallucinations and p
DeepCamp AI