Environmental Sound Deepfake Detection Using Deep-Learning Framework

📰 ArXiv cs.AI

arXiv:2604.19652v1 Announce Type: cross Abstract: In this paper, we propose a deep-learning framework for environmental sound deepfake detection (ESDD) -- the task of identifying whether the sound scene and sound event in an input audio recording is fake or not. To this end, we conducted extensive experiments to explore how individual spectrograms, a wide range of network architectures and pre-trained models, ensemble of spectrograms or network architectures affect the ESDD task performance. The

Published 22 Apr 2026

Read full paper → ← Back to Reads