Learning Coordinated Preference for Multi-Objective Multi-Agent Reinforcement Learning

📰 ArXiv cs.AI

arXiv:2606.14693v1 Announce Type: cross Abstract: Cooperative multi-objective multi-agent reinforcement learning (MOMARL) models team decision making under multiple, potentially conflicting objectives. In this setting, conflicts arise not only across objectives but also across agents with different observations, roles, and contributions. We propose Preference Coordinated Multi-agent Policy Optimization (PCMA), which learns coordinated agent-specific preferences to enable complementary trade-offs

Published 15 Jun 2026

Read full paper → ← Back to Reads