Heterogeneous Decentralized Diffusion Models

📰 ArXiv cs.AI

arXiv:2603.06741v2 Announce Type: replace-cross Abstract: Training frontier-scale diffusion models often requires substantial computational resources concentrated in tightly-coupled clusters, limiting participation to well-resourced institutions. While Decentralized Diffusion Models (DDM) enable training multiple experts in isolation, existing approaches require 1176 GPU-days and homogeneous training objectives across all experts. We present an efficient framework that dramatically reduces resou

Published 2 Jun 2026
Read full paper → ← Back to Reads