STREAM: A Data-Centric Framework for Mining High-Value Task-Oriented Dialogues from Streaming Media
📰 ArXiv cs.AI
arXiv:2605.25162v1 Announce Type: cross Abstract: Large language models for vertical domains are bottlenecked by the scarcity of complex, domain-specific task-oriented dialogues. Existing data acquisition pipelines face a persistent trilemma: expert annotation is expensive, real-world service conversations are constrained by privacy and commercial restrictions, and static corpora quickly become temporally stale. We propose Stream, a data-centric framework that leverages publicly available stream
DeepCamp AI