1 of 13

EAB Meeting • 17 June 2022

Differentiable signal processing �for audio engineering

Christian J. Steinmetz

c.j.steinmetz@qmul.ac.uk

Centre for Digital Music, Queen Mary University of London

Huy Phan

Joshua D. Reiss

2 of 13

3 of 13

Producing high quality audio requires expertise

Demand for high quality audio

4 of 13

Deep learning for audio processing

Neural network

Source separation

Speech enhancement

Audio effect modeling

Stöter et al., 2019, "Open-unmix-a reference implementation for music source separation." JOSS

Pascual et al., 2017 "SEGAN: Speech enhancement generative adversarial network." arXiv:1703.09452

Martínez Ramírez et al., 2020, "Deep learning for black-box modeling of audio effects." Applied Sciences

5 of 13

Audio engineers solve problems with DSP

Controlling audio effects

Modeling acoustic spaces

Creating a mix

Can we build models that learn to control DSP for audio engineering tasks?

6 of 13

Differentiable signal processing

Neural network

Signal processing

Leveraging existing DSP tools and knowledge
High quality audio processing with few artifacts
Human understandable outputs that can be adjusted
Efficient and can easily run in real-time on CPU

Control parameters

7 of 13

Techniques

Automatic differentiation (AD)�Engel et al. 2020

Neural proxies and hybrids (NP)�Steinmetz et al. 2020, Steinmetz et al. 2022

Numerical gradient approximation (NGA)�Martinez Ramirez et al. 2021

8 of 13

Colonel and Steinmetz et al., 2022 "Direct design of biquad filter cascades with deep learning by sampling random polynomials." IEEE ICASSP

Steinmetz et al., 2021 "Filtered noise shaping for time domain room impulse response estimation from reverberant speech." IEEE WASPAA (Best Student Paper Award)

Steinmetz et al., 2022 "Style transfer of audio effects with differentiable signal processing." Journal of the Audio Engineering Society

Differentiable IIR filters

Differentiable reverberation

Generalized differentiable effects

9 of 13

What’s next?

Torchdiffx

Differentiable audio effects in PyTorch

Gain
Panner
Lowpass/Highpass
Parametric EQ
Dynamic range compressor

Coming soon

Multiband compressor
Brickwall limiter
Denoiser
Artificial reverberation
Overdrive/Distortion

Applications in...

Automatic multitrack mixing
Deconstructing a mix
Audio quality enhancement

Improved methods to facilitate backprop through DSP

10 of 13

EAB Meeting • 17 June 2022

Differentiable signal processing �for audio engineering

Christian J. Steinmetz

c.j.steinmetz@qmul.ac.uk

Centre for Digital Music, Queen Mary University of London

Huy Phan

Joshua D. Reiss

11 of 13

Differentiable signal processing

Deep neural network

Signal processing

Control parameters

Make this differentiable

12 of 13

Deep learning for audio processing

Deep neural network

Great for problems that can’t be solved with DSP

but...

Prone to artifacts or adding signal distortions
Not easily controllable (how do we adjust the output?)
Requires significant computation (often not real-time on CPU)

13 of 13

Differentiable signal processing

Backprop through DSP requires special techniques