Blind estimation of audio effects using an auto-encoder approach and differentiable digital signal processing - Département Image, Données, Signal
Conference Papers Year : 2024

Blind estimation of audio effects using an auto-encoder approach and differentiable digital signal processing

Estimation aveugle d'effets audios avec une approche auto-encodeur utilisant du traitement du signal différentiable

Abstract

Blind Estimation of Audio Effects (BE-AFX) aims at estimating the audio effects (AFXs) applied to an original, unprocessed audio sample solely based on the processed audio sample. To train such a system traditional approaches optimize a loss between ground truth and estimated AFX parameters. This involves knowing the exact implementation of the AFXs used for the process. In this work, we propose an alternative solution that eliminates the requirement for knowing this implementation. Instead, we introduce an auto-encoder approach, which optimizes an audio quality metric. We explore, suggest, and compare various implementations of commonly used mastering AFXs, using differential signal processing or neural approximations. Our findings demonstrate that our auto-encoder approach yields superior estimates of the audio quality produced by a chain of AFXs, compared to the traditional parameter-based approach, even if the latter provides a more accurate parameter estimation.
Fichier principal
Vignette du fichier
Peladeau - ICASSP2024 - Hal Version.pdf (298.42 Ko) Télécharger le fichier
Origin Publisher files allowed on an open archive

Dates and versions

hal-04539329 , version 1 (09-04-2024)

Identifiers

Cite

Côme Peladeau, Geoffroy Peeters. Blind estimation of audio effects using an auto-encoder approach and differentiable digital signal processing. ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, Apr 2024, Seoul, South Korea. pp.856-860, ⟨10.1109/ICASSP48485.2024.10448301⟩. ⟨hal-04539329⟩
137 View
103 Download

Altmetric

Share

More