Limited Linguistic Diversity in Embodied AI Datasets

📰 ArXiv cs.AI

arXiv:2601.03136v2 Announce Type: replace-cross Abstract: Language plays a critical role in Vision-Language-Action (VLA) models, yet the linguistic characteristics of the datasets used to train and evaluate these systems remain poorly documented. In this work, we present a systematic dataset audit of several widely used VLA corpora, aiming to characterize what kinds of instructions these datasets actually contain and how much linguistic variety they provide. We quantify instruction language alon

Published 29 Apr 2026

Read full paper → ← Back to Reads