AI News — Latest Developments & Breakthroughs

Microsoft Research 👁️ Computer Vision ⚡ AI Lesson 3d ago

AsgardBench: A benchmark for visually grounded interactive planning

Imagine a robot tasked with cleaning a kitchen. It needs to observe its environment, decide what to do, and adjust when things don’t go as expected, for example

Microsoft Research 🧠 Large Language Models ⚡ AI Lesson 3d ago

GroundedPlanBench: Spatially grounded long-horizon task planning for robot manipulation

Vision-language models (VLMs) use images and text to plan robot actions, but they still struggle to decide what actions to take and where to take them. Most sys

Microsoft Research ⚡ AI Lesson 2w ago

Systematic debugging for AI agents: Introducing the AgentRx framework

As AI agents transition from simple chatbots to autonomous systems capable of managing cloud incidents, navigating complex web interfaces, and executing multi-s

Microsoft Research ⚡ AI Lesson 2w ago

PlugMem: Transforming raw agent interactions into reusable knowledge

It seems counterintuitive: giving AI agents more memory can make them less effective. As interaction logs accumulate, they grow large, fill with irrelevant cont

Microsoft Research 🧠 Large Language Models ⚡ AI Lesson 3w ago

Phi-4-reasoning-vision and the lessons of training a multimodal reasoning model

We are pleased to announce Phi-4-reasoning-vision-15B, a 15 billion parameter open‑weight multimodal reasoning model, available through Microsoft Foundry (opens

Microsoft Research 🧠 Large Language Models ⚡ AI Lesson 1mo ago

CORPGEN advances AI agents for real work

By mid-morning, a typical knowledge worker is already juggling a client report, a budget spreadsheet, a slide deck, and an email backlog, all interdependent and

Microsoft Research 👁️ Computer Vision ⚡ AI Lesson 1mo ago

Media Authenticity Methods in Practice: Capabilities, Limitations, and Directions

As synthetic media grows, verifying what’s real, and the origin of content, matters more than ever. Our latest report explores media integrity and authenticatio

Microsoft Research ⚡ AI Lesson 1mo ago

Project Silica’s advances in glass storage technology

Project Silica introduces new techniques for encoding data in borosilicate glass, as described in the journal Nature. These advances lower media cost and simpli

Microsoft Research 🧠 Large Language Models ⚡ AI Lesson 1mo ago

Rethinking imitation learning with Predictive Inverse Dynamics Models

This research looks at why Predictive Inverse Dynamics Models often outperform standard Behavior Cloning in imitation learning. By using simple predictions of w

Microsoft Research ⚡ AI Lesson 1mo ago

Paza: Introducing automatic speech recognition benchmarks and models for low resource languages

Microsoft Research unveils Paza, a human-centered speech pipeline, and PazaBench, the first leaderboard for low-resource languages. It covers 39 African languag

Microsoft Research ⚡ AI Lesson 2mo ago

UniRG: Scaling medical imaging report generation with multimodal reinforcement learning

AI can help generate medical image reports, but today’s models struggle with varying reporting schemes. Learn how UniRG uses reinforcement learning to boost per

Microsoft Research 🧠 Large Language Models ⚡ AI Lesson 2mo ago

Argos: Multimodal reinforcement learning with agentic verifier for AI agents

Argos improves multimodal RL by evaluating whether an agent’s reasoning aligns with what it observes over time. The approach reduces visual hallucinations and p

📰 Microsoft Research