Model Merging: Foundations and Algorithms

📰 ArXiv cs.AI

arXiv:2605.01580v1 Announce Type: cross Abstract: Modern deep learning usually treats models as separate artifacts: trained independently, specialized for particular purposes, and replaced when improved versions appear. This thesis studies model merging as an alternative paradigm: combining independently trained neural networks directly in weight space, with little or no optimization and without requiring access to the original training data. The thesis considers two main regimes. In the single-

Published 5 May 2026
Read full paper → ← Back to Reads