Apple Intelligence Foundation Language Models

📰 ArXiv cs.AI

arXiv:2407.21075v2 Announce Type: replace Abstract: We present foundation language models developed to power Apple Intelligence features, including a ~3 billion parameter model designed to run efficiently on devices and a large server-based language model designed for Private Cloud Compute. These models are designed to perform a wide range of tasks efficiently, accurately, and responsibly. This report describes the model architecture, the data used to train the model, the training process, how t

Published 28 May 2026
Read full paper → ← Back to Reads