Deep Delta Learning
📰 ArXiv cs.AI
arXiv:2601.00417v3 Announce Type: replace-cross Abstract: Transformer residual streams evolve by additive accumulation: each layer appends a feature update to a shared hidden state, but has no direct mechanism for replacing content that has become obsolete or conflicting. We introduce Deep Delta Learning (DDL), a residual update rule that preserves the identity path while giving every layer the ability to selectively rewrite residual content. DDL reads the current state along a learned direction
DeepCamp AI