DDSP-QbE++: Improving Speech Quality for Speech Anonymisation for Atypical Speech

📰 ArXiv cs.AI

arXiv:2604.09246v1 Announce Type: cross Abstract: Differentiable Digital Signal Processing (DDSP) pipelines for voice conversion rely on subtractive synthesis, where a periodic excitation signal is shaped by a learned spectral envelope to reconstruct the target voice. In DDSP-QbE, the excitation is generated via phase accumulation, producing a sawtooth-like waveform whose abrupt discontinuities introduce aliasing artefacts that manifest perceptually as buzziness and spectral distortion, particul

Published 13 Apr 2026

Read full paper → ← Back to Reads