FULLY CONDITIONED AND LOW-LATENCY BLACK-BOX MODELING OF ANALOG COMPRESSION

University of Oslo
DAFx 2023

Abstract

Neural networks have been found suitable for virtual analog modeling applications. Several analog audio effects have been successfully modeled with deep learning techniques, using low-latency and conditioned architectures suitable for real-world applications. Challenges remain with effects presenting more complex responses, such as nonlinear and time-varying input-output relationships. This paper proposes a deep-learning model for the analog compression effect. The architecture we introduce is fully conditioned by the device control parameters and it works on small audio segments, allowing low-latency real-time implementations. The architecture is used to model the CL 1B analog optical compressor, showing an overall high accuracy and ability to capture the different attack and release compression profiles. The proposed architecture' ability to model audio compression behaviors is also verified using datasets from other compressors. Limitations remain with heavy compression scenarios determined by the conditioning parameters.

TubeTech CL 1B - ED (Light) Audio Examples

Attack Time: 150 ms, Release Time: 0,005 s, Ratio 6:1, Threshold -20dBu.

STFT time
Input Target Prediciton

Attack Time: 0.5 ms, Release Time: 0,005 s, Ratio 10:1, Threshold -20dBu.

STFT time
Input Target Prediciton

Attack Time: 0.5 ms, Release Time: 0,005 s, Ratio 10:1, Threshold -40dBu.

STFT time
Input Target Prediciton

Teletronix LA-2A - ED (Light) Audio Examples

Mode: Compression, Peak Reduction 80.

STFT time
Input Target Prediciton

Mode: Compression, Peak Reduction 80.

STFT time
Input Target Prediciton

Mode: Compression, Peak Reduction 80.

STFT time
Input Target Prediciton

TubeTech CL 1B - ED Audio Examples

Attack Time: 0.5 ms, Release Time: 0,005 s, Ratio 10:1, Threshold -20dBu.

STFT time
Input Target Prediciton

Attack Time: 150 ms, Release Time: 0,005 s, Ratio 6:1, Threshold -20dBu.

STFT time
Input Target Prediciton

Attack Time: 0.5 ms, Release Time: 0,005 s, Ratio 2:1, Threshold -40dBu.

STFT time
Input Target Prediciton

Teletronix LA-2A - ED Audio Examples

Mode: Compression, Peak Reduction 80.

STFT time
Input Target Prediciton

Mode: Compression, Peak Reduction 80.

STFT time
Input Target Prediciton

Mode: Compression, Peak Reduction 80.

STFT time
Input Target Prediciton

Softube FET - ED Audio Examples

Attack Time: 0.5 ms, Release Time: 0,005 s, Ratio 10:1, Threshold 20

STFT time
Input Target Prediciton

U-he Presswerk - ED Audio Examples

Attack Time: 0.5 ms, Release Time: 0,005 s, Ratio 10:1, Threshold 20

STFT time
Input Target Prediciton

PSP MicroComp - ED Audio Examples

Attack Time: 0.5 ms, Release Time: 0,005 s, Ratio 10:1, Threshold 20

STFT time
Input Target Prediciton

TubeTech CL 1B - TCN Audio Examples

Attack Time: 150 ms, Release Time: 0,005 s, Ratio 6:1, Threshold -20dBu.

STFT time
Input Target Prediciton

Attack Time: 150 ms, Release Time: 0,005 s, Ratio 6:1, Threshold -20dBu.

STFT time
Input Target Prediciton

BibTeX

@inproceedings{simionato2023fully,
  title={Fully conditioned and low-latency black-box modeling of analog compression},
  author={Simionato, Riccardo and Fasciani, Stefano},
  booktitle={Proceedings of the International Conference on Digital Audio Effects},
  year={2023},
  organization={DAFx Board}
}