12 articles

📰 Microsoft Research

Articles from Microsoft Research · 12 articles · Updated every 3 hours · View all news

All ⚡ AI Lessons (4905) ArXiv cs.AIOpenAI NewsHugging Face BlogForbes InnovationDev.to AIWeaviate Blog
AsgardBench: A benchmark for visually grounded interactive planning
Microsoft Research 👁️ Computer Vision ⚡ AI Lesson 3d ago
AsgardBench: A benchmark for visually grounded interactive planning
Imagine a robot tasked with cleaning a kitchen. It needs to observe its environment, decide what to do, and adjust when things don’t go as expected, for example
GroundedPlanBench: Spatially grounded long-horizon task planning for robot manipulation
Microsoft Research 🧠 Large Language Models ⚡ AI Lesson 3d ago
GroundedPlanBench: Spatially grounded long-horizon task planning for robot manipulation
Vision-language models (VLMs) use images and text to plan robot actions, but they still struggle to decide what actions to take and where to take them. Most sys
Systematic debugging for AI agents: Introducing the AgentRx framework
Microsoft Research ⚡ AI Lesson 2w ago
Systematic debugging for AI agents: Introducing the AgentRx framework
As AI agents transition from simple chatbots to autonomous systems capable of managing cloud incidents, navigating complex web interfaces, and executing multi-s
PlugMem: Transforming raw agent interactions into reusable knowledge
Microsoft Research ⚡ AI Lesson 2w ago
PlugMem: Transforming raw agent interactions into reusable knowledge
It seems counterintuitive: giving AI agents more memory can make them less effective. As interaction logs accumulate, they grow large, fill with irrelevant cont
Phi-4-reasoning-vision and the lessons of training a multimodal reasoning model
Microsoft Research 🧠 Large Language Models ⚡ AI Lesson 3w ago
Phi-4-reasoning-vision and the lessons of training a multimodal reasoning model
We are pleased to announce Phi-4-reasoning-vision-15B, a 15 billion parameter open‑weight multimodal reasoning model, available through Microsoft Foundry (opens
CORPGEN advances AI agents for real work
Microsoft Research 🧠 Large Language Models ⚡ AI Lesson 1mo ago
CORPGEN advances AI agents for real work
By mid-morning, a typical knowledge worker is already juggling a client report, a budget spreadsheet, a slide deck, and an email backlog, all interdependent and
Media Authenticity Methods in Practice: Capabilities, Limitations, and Directions
Microsoft Research 👁️ Computer Vision ⚡ AI Lesson 1mo ago
Media Authenticity Methods in Practice: Capabilities, Limitations, and Directions
As synthetic media grows, verifying what’s real, and the origin of content, matters more than ever. Our latest report explores media integrity and authenticatio
Project Silica’s advances in glass storage technology
Microsoft Research ⚡ AI Lesson 1mo ago
Project Silica’s advances in glass storage technology
Project Silica introduces new techniques for encoding data in borosilicate glass, as described in the journal Nature. These advances lower media cost and simpli
Rethinking imitation learning with Predictive Inverse Dynamics Models
Microsoft Research 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Rethinking imitation learning with Predictive Inverse Dynamics Models
This research looks at why Predictive Inverse Dynamics Models often outperform standard Behavior Cloning in imitation learning. By using simple predictions of w
Paza: Introducing automatic speech recognition benchmarks and models for low resource languages
Microsoft Research ⚡ AI Lesson 1mo ago
Paza: Introducing automatic speech recognition benchmarks and models for low resource languages
Microsoft Research unveils Paza, a human-centered speech pipeline, and PazaBench, the first leaderboard for low-resource languages. It covers 39 African languag
UniRG: Scaling medical imaging report generation with multimodal reinforcement learning
Microsoft Research ⚡ AI Lesson 2mo ago
UniRG: Scaling medical imaging report generation with multimodal reinforcement learning
AI can help generate medical image reports, but today’s models struggle with varying reporting schemes. Learn how UniRG uses reinforcement learning to boost per
Argos: Multimodal reinforcement learning with agentic verifier for AI agents
Microsoft Research 🧠 Large Language Models ⚡ AI Lesson 2mo ago
Argos: Multimodal reinforcement learning with agentic verifier for AI agents
Argos improves multimodal RL by evaluating whether an agent’s reasoning aligns with what it observes over time. The approach reduces visual hallucinations and p