Structured Exploration and Exploitation of Label Functions for Automated Data Annotation
📰 ArXiv cs.AI
arXiv:2604.08578v1 Announce Type: cross Abstract: High-quality labeled data is critical for training reliable machine learning and deep learning models, yet manual annotation remains costly and error-prone. Programmatic labeling addresses this challenge by using label functions (LFs), i.e., heuristic rules that automatically generate weak labels for training datasets. However, existing automated LF generation methods either rely on large language models (LLMs) to synthesize surface-level heurist
DeepCamp AI