Diff-MST^C: A Mixing Style Transfer Prototype for Cubase

Soumya Sai Vanka (QMUL)*, Lennart Hannink (Steinberg Media Technologies GmbH), Jean-Baptiste Rolland (Steinberg Media Technologies GmbH), George Fazekas (QMUL)

This paper will be presented in person

Abstract:

In our demo, participants are invited to explore the Diff-MST^C prototype, which integrates the Diff-MST model into Steinberg's digital audio workstation (DAW), Cubase. Diff-MST, a deep learning model for mixing style transfer, forecasts mixing console parameters for tracks using a reference song. The system processes up to 20 raw tracks along with a reference song to predict mixing console parameters that can be used to create an initial mix. Users have the option to manually adjust these parameters further for greater control. In contrast to earlier deep learning systems that are limited to research ideas, Diff-MST^C is a first-of-its-kind prototype integrated into a DAW. This integration facilitates mixing decisions on multitracks and lets users input context through a reference song, followed by fine-tuning of audio effects in a traditional manner.