EAB Meeting • 17 June 2022
Differentiable signal processing �for audio engineering
Christian J. Steinmetz
Centre for Digital Music, Queen Mary University of London
Huy Phan
Joshua D. Reiss
More people are creating audio content
Music
Podcasts
Short-form content
Sound for Video
Producing high quality audio requires expertise
Demand for high quality audio
Deep learning for audio processing
Neural network
Source separation
Speech enhancement
Audio effect modeling
Stöter et al., 2019, "Open-unmix-a reference implementation for music source separation." JOSS
Pascual et al., 2017 "SEGAN: Speech enhancement generative adversarial network." arXiv:1703.09452
Martínez Ramírez et al., 2020, "Deep learning for black-box modeling of audio effects." Applied Sciences
Audio engineers solve problems with DSP
Controlling audio effects
Modeling acoustic spaces
Creating a mix
Can we build models that learn to control DSP for audio engineering tasks?
Differentiable signal processing
Neural network
Signal processing
Control parameters
Techniques
Colonel and Steinmetz et al., 2022 "Direct design of biquad filter cascades with deep learning by sampling random polynomials." IEEE ICASSP
Steinmetz et al., 2021 "Filtered noise shaping for time domain room impulse response estimation from reverberant speech." IEEE WASPAA (Best Student Paper Award)
Steinmetz et al., 2022 "Style transfer of audio effects with differentiable signal processing." Journal of the Audio Engineering Society
Differentiable IIR filters
Differentiable reverberation
Generalized differentiable effects
What’s next?
Torchdiffx
Differentiable audio effects in PyTorch
Coming soon
Applications in...
Improved methods to facilitate backprop through DSP
EAB Meeting • 17 June 2022
Differentiable signal processing �for audio engineering
Christian J. Steinmetz
Centre for Digital Music, Queen Mary University of London
Huy Phan
Joshua D. Reiss
Differentiable signal processing
Deep neural network
Signal processing
Control parameters
Make this differentiable
Deep learning for audio processing
Deep neural network
Great for problems that can’t be solved with DSP
but...
Differentiable signal processing
Backprop through DSP requires special techniques