✕ Clear all filters
27 articles

📰 Microsoft Research

27 articles · Updated every 3 hours · View all reads

All Articles 67,371Blog Posts 99,886Tech Tutorials 16,278Research Papers 13,813News 12,538 ⚡ AI Lessons
Further Notes on Our Recent Research on AI Delegation and Long-Horizon Reliability
Microsoft Research 2w ago
Further Notes on Our Recent Research on AI Delegation and Long-Horizon Reliability
Our recent paper, “LLMs Corrupt Your Documents When You Delegate”, has generated discussion about the reliability of AI systems in delegated workflows. We appre
mimalloc: A new, high-performance, scalable memory allocator for the modern era
Microsoft Research ⚡ AI Lesson 2w ago
mimalloc: A new, high-performance, scalable memory allocator for the modern era
mimalloc is an open-source, modern, scalable memory allocator that is a drop-in replacement for malloc and free. It is relatively small (~12K lines), with clear
GridSFM: A new, small foundation model for the electric grid
Microsoft Research 🧠 Large Language Models ⚡ AI Lesson 2w ago
GridSFM: A new, small foundation model for the electric grid
Introducing GridSFM, a small foundation model that can predict AC optimal power flow in milliseconds, boosting efficiency and unlocking cost savings. Learn how
Advancing AI for materials with MatterSim: experimental synthesis, faster simulation, and multi-task models
Microsoft Research 2w ago
Advancing AI for materials with MatterSim: experimental synthesis, faster simulation, and multi-task models
MatterSim is expanding what AI can do for materials science—from faster large-scale simulations to MatterSim-MT, a new multi-task model for simulating propertie
SocialReasoning-Bench: Measuring whether AI agents act in users’ best interests
Microsoft Research 2w ago
SocialReasoning-Bench: Measuring whether AI agents act in users’ best interests
Using SocialReasoning Bench, we observed a stable pattern across models—agents execute competently, but fail to consistently improve the user’s position, even w
Building realistic electric transmission grid dataset at scale: a pipeline from open dataset
Microsoft Research 📊 Data Analytics & Business Intelligence ⚡ AI Lesson 3w ago
Building realistic electric transmission grid dataset at scale: a pipeline from open dataset
Microsoft Research is excited to release an open dataset of approximate transmission topology of the U.S. power grid derived from publicly available data. The a
Microsoft at NSDI 2026: Advances in large-scale networked systems
Microsoft Research 3w ago
Microsoft at NSDI 2026: Advances in large-scale networked systems
Microsoft researchers share advances in building and operating large-scale distributed systems, spanning datacenters, networking, and the growing intersection w
Red-teaming a network of agents: Understanding what breaks when AI agents interact at scale
Microsoft Research 1mo ago
Red-teaming a network of agents: Understanding what breaks when AI agents interact at scale
Safe agents don’t guarantee a safe ecosystem of interconnected agents. Microsoft Research examines what breaks when AI agents interact and why network-level ris
AutoAdapt: Automated domain adaptation for large language models
Microsoft Research 1mo ago
AutoAdapt: Automated domain adaptation for large language models
Deploying large language models (LLMs) in real-world, high-stakes settings is harder than it should be. In high-stakes settings like law, medicine, and cloud in
New Future of Work: AI is driving rapid change, uneven benefits
Microsoft Research 1mo ago
New Future of Work: AI is driving rapid change, uneven benefits
For the past five years, the New Future of Work report has captured how work is changing. This year, the shift feels especially sharp. Previous editions have fo
ADeLe: Predicting and explaining AI performance across tasks
Microsoft Research 🧠 Large Language Models ⚡ AI Lesson 1mo ago
ADeLe: Predicting and explaining AI performance across tasks
AI benchmarks report how large language models (LLMs) perform on specific tasks but provide little insight into their underlying capabilities that drive their p
AsgardBench: A benchmark for visually grounded interactive planning
Microsoft Research 👁️ Computer Vision ⚡ AI Lesson 2mo ago
AsgardBench: A benchmark for visually grounded interactive planning
Imagine a robot tasked with cleaning a kitchen. It needs to observe its environment, decide what to do, and adjust when things don’t go as expected, for example
GroundedPlanBench: Spatially grounded long-horizon task planning for robot manipulation
Microsoft Research 🧠 Large Language Models ⚡ AI Lesson 2mo ago
GroundedPlanBench: Spatially grounded long-horizon task planning for robot manipulation
Vision-language models (VLMs) use images and text to plan robot actions, but they still struggle to decide what actions to take and where to take them. Most sys
Systematic debugging for AI agents: Introducing the AgentRx framework
Microsoft Research ⚡ AI Lesson 2mo ago
Systematic debugging for AI agents: Introducing the AgentRx framework
As AI agents transition from simple chatbots to autonomous systems capable of managing cloud incidents, navigating complex web interfaces, and executing multi-s
PlugMem: Transforming raw agent interactions into reusable knowledge
Microsoft Research ⚡ AI Lesson 2mo ago
PlugMem: Transforming raw agent interactions into reusable knowledge
It seems counterintuitive: giving AI agents more memory can make them less effective. As interaction logs accumulate, they grow large, fill with irrelevant cont
Phi-4-reasoning-vision and the lessons of training a multimodal reasoning model
Microsoft Research 🧠 Large Language Models ⚡ AI Lesson 2mo ago
Phi-4-reasoning-vision and the lessons of training a multimodal reasoning model
We are pleased to announce Phi-4-reasoning-vision-15B, a 15 billion parameter open‑weight multimodal reasoning model, available through Microsoft Foundry (opens
CORPGEN advances AI agents for real work
Microsoft Research 🧠 Large Language Models ⚡ AI Lesson 3mo ago
CORPGEN advances AI agents for real work
By mid-morning, a typical knowledge worker is already juggling a client report, a budget spreadsheet, a slide deck, and an email backlog, all interdependent and
Media Authenticity Methods in Practice: Capabilities, Limitations, and Directions
Microsoft Research 👁️ Computer Vision ⚡ AI Lesson 3mo ago
Media Authenticity Methods in Practice: Capabilities, Limitations, and Directions
As synthetic media grows, verifying what’s real, and the origin of content, matters more than ever. Our latest report explores media integrity and authenticatio
Project Silica’s advances in glass storage technology
Microsoft Research ⚡ AI Lesson 3mo ago
Project Silica’s advances in glass storage technology
Project Silica introduces new techniques for encoding data in borosilicate glass, as described in the journal Nature. These advances lower media cost and simpli
Rethinking imitation learning with Predictive Inverse Dynamics Models
Microsoft Research 🧠 Large Language Models ⚡ AI Lesson 3mo ago
Rethinking imitation learning with Predictive Inverse Dynamics Models
This research looks at why Predictive Inverse Dynamics Models often outperform standard Behavior Cloning in imitation learning. By using simple predictions of w