AudioGuard: Toward Comprehensive Audio Safety Protection Across Diverse Threat Models
📰 ArXiv cs.AI
arXiv:2604.08867v1 Announce Type: cross Abstract: Audio has rapidly become a primary interface for foundation models, powering real-time voice assistants. Ensuring safety in audio systems is inherently more complex than just "unsafe text spoken aloud": real-world risks can hinge on audio-native harmful sound events, speaker attributes (e.g., child voice), impersonation/voice-cloning misuse, and voice-content compositional harms, such as child voice plus sexual content. The nature of audio makes
DeepCamp AI