SINES, TRANSIENT, NOISE NEURAL MODELING OF PIANO NOTES

University of Oslo

Abstract

This paper introduces a novel method for emulating piano sounds. We propose to exploit the sines, transient, and noise decomposition to design a differentiable spectral modeling synthesizer replicating piano notes. Three sub-modules learn these components from piano recordings and generate the corresponding harmonic, transient, and noise signals. Splitting the emulation into three independently trainable models reduces the modeling tasks' complexity. The quasi-harmonic content is produced using a differentiable sinusoidal model guided by physics-derived formulas, whose parameters are automatically estimated from audio recordings. The noise sub-module uses a learnable time-varying filter, and the transients are generated using a deep convolutional network. From singular notes, we emulate the coupling between different keys in trichords with a convolutional-based network. Results show the model matches the partial distribution of the target while predicting the energy in the higher part of the spectrum presents more challenges. The energy distribution in the spectra of the transient and noise components is accurate overall. While the model is more computationally and memory efficient, perceptual tests reveal limitations in accurately modeling the attack phase of notes. Despite this, it generally achieves perceptual accuracy in emulating single notes and trichords.

BiViB Dataset - Upright Piano

Parameters Real Model

Key: C3

Key: C#3

Key: D3

Key: D#3

Key: E3

Key: F3

Key: F#3

Key: G3

Key: G#3

Key: A3

Key: A#3

Key: B3

Key: C4

Key: C#4

Key: D4

Key: D#4

Key: E4

Key: F4

Key: G4

Key: G#4

Key: A4

Key: A#4

Key: B4

BiViB Dataset - Grand Piano

Parameters Real Model

Key: C3

Key: C#3

Key: D3

Key: D#3

Key: E3

Key: F3

Key: F#3

Key: G3

Key: G#3

Key: A3

Key: A#3

Key: B3

Key: C4

Key: C#4

Key: D4

Key: D#4

Key: E4

Key: F4

Key: G4

Key: G#4

Key: A4

Key: A#4

Key: B4

Chords (Minor) Dataset

Parameters Real Sum of the Notes Model

Chord: A#3

Velocity: 70

Chord: B3

Velocity: 110

Chord: C4

Velocity: 80

Chord: C#4

Velocity: 70

Chord: D3

Velocity: 60

Chord: D4

Velocity: 100

Chord: D#3

Velocity: 80

Chord: D#4

Velocity: 70

Chord: G3

Velocity: 80

BibTeX

@article{simionato2025sines,
  title={Sines, transient, noise neural modeling of piano notes},
  author={Simionato, Riccardo and Fasciani, Stefano},
  journal={Frontiers in Signal Processing},
  volume={4},
  pages={1494864},
  year={2025},
  publisher={Frontiers Media SA}
}