Essential Subspace Merging for Multi-Task Learning
📰 ArXiv cs.AI
arXiv:2606.19164v2 Announce Type: replace-cross Abstract: Model merging aims to enable multi-task learning by integrating the capabilities of multiple models fine-tuned from the same pre-trained checkpoint into a single model. Its core challenge is inter-task interference among task-specific parameter updates. In this paper, we analyze the output shifts induced by task updates and observe that their energy is concentrated in a small number of principal directions. We call the subspace spanned by
DeepCamp AI