Multichannel Audio Modeling with Elliptically Stable Tensor Decomposition

This paper introduces a new method for multichannel speech enhancement based on a versatile modeling of the residual noise spec-trogram. Such a model has already been presented before in the single channel case where the noise component is assumed to follow an alpha-stable distribution for each time-frequency bin, whereas the speech spec-trogram, supposed to be more regular, is modeled as Gaussian. In this paper, we describe a multichannel extension of this model, as well as a Monte Carlo Expectation-Maximisation algorithm for parameter estimation. In particular, a multichannel extension of the Itakura-Saito nonnegative matrix factorization is exploited to estimate the spectral parameters for speech, and a Metropolis-Hastings algorithm is proposed to estimate the noise contribution. We evaluate the proposed method in a challenging multichannel denoising application and compare it to other state-of-the-art algorithms.

Domaines

Traitement du signal et de l'image [eess.SP]

Fichier principal

LVA-ICA2018_046_original_v5.pdf (366.42 Ko)

Origine	Fichiers produits par l'(les) auteur(s)

Antoine Liutkus : Connectez-vous pour contacter le contributeur

https://hal-lirmm.ccsd.cnrs.fr/lirmm-01766795

Soumis le : samedi 14 avril 2018-09:58:31

Dernière modification le : jeudi 5 décembre 2024-03:21:56

Dates et versions

lirmm-01766795 , version 1 (14-04-2018)

Identifiants

HAL Id : lirmm-01766795 , version 1
DOI : 10.1007/978-3-319-93764-9_2

Citer

Mathieu Fontaine, Fabian-Robert Stöter, Antoine Liutkus, Umut Şimşekli, Romain Serizel, et al.. Multichannel Audio Modeling with Elliptically Stable Tensor Decomposition. LVA/ICA 2018 - 14th International Conference on Latent Variable Analysis and Signal Separation, Jul 2018, Surrey, United Kingdom. pp.13-23, ⟨10.1007/978-3-319-93764-9_2⟩. ⟨lirmm-01766795⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA PARISTECH ZENITH LIRMM UNIV-LORRAINE INRIA2 LORIA LORIA-NLPKD MIPS UNIV-MONTPELLIER LTCI IDS S2A ANR INSTITUT-MINES-TELECOM

681 Consultations

675 Téléchargements