广义高斯分布的卷积传递函数多通道非负矩阵分解

Cong Zhang; Feiran Yang; Xianmei Chen; Jun Yang

doi:10.12395/0371-0025.2023009

广义高斯分布的卷积传递函数多通道非负矩阵分解

Translated title of the contribution: Convolution transfer function-based multi-channel non-negative matrix factorization using generalized Gaussian distributions

Cong Zhang, Feiran Yang^*, Xianmei Chen, Jun Yang

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

Abstract

The convolution transfer function-based multi-channel non-negative matrix factorization (CTF-MNMF) has been shown to perform well in blind source separation in highly reverberant environments, but its effectiveness may be limited by the source model. An improved version of the CTF-MNMF is proposed, where the generalized Gaussian distribution (GGD) is used as the source model. The domain parameter is introduced into the NMF and the generalized NMF (GNMF) is utilized to model the nonnegative scale factors of the GGD, which enhances the robustness of the source model in capturing signal outliers, and thus improves the accuracy of source estimation. An auxiliary function-based method is used to derive an improved formula for updating the separated matrix and non-negative matrix parameters. Simulation results shows that the proposed algorithm achieves better separation performance than the GGD-ILRMA, WPE-ILRMA, CTF-MNMF algorithms for both speech and music input signals.

Translated title of the contribution	Convolution transfer function-based multi-channel non-negative matrix factorization using generalized Gaussian distributions
Original language	Chinese (Simplified)
Pages (from-to)	598-610
Number of pages	13
Journal	Shengxue Xuebao/Acta Acustica
Volume	49
Issue number	3
DOIs	https://doi.org/10.12395/0371-0025.2023009
Publication status	Published - May 2024
Externally published	Yes

Bibliographical note

Publisher Copyright:
© 2024 Science Press. All rights reserved.

ASJC Scopus Subject Areas

Acoustics and Ultrasonics

Keywords

Blind source separation
Convolution transfer function
Generalized Gaussian distribution
Non-negative matrix factorization

Access to Document

10.12395/0371-0025.2023009

Cite this

@article{854b2cbd3e1140739fc0ec81869d3a05,

title = "广义高斯分布的卷积传递函数多通道非负矩阵分解",

abstract = "The convolution transfer function-based multi-channel non-negative matrix factorization (CTF-MNMF) has been shown to perform well in blind source separation in highly reverberant environments, but its effectiveness may be limited by the source model. An improved version of the CTF-MNMF is proposed, where the generalized Gaussian distribution (GGD) is used as the source model. The domain parameter is introduced into the NMF and the generalized NMF (GNMF) is utilized to model the nonnegative scale factors of the GGD, which enhances the robustness of the source model in capturing signal outliers, and thus improves the accuracy of source estimation. An auxiliary function-based method is used to derive an improved formula for updating the separated matrix and non-negative matrix parameters. Simulation results shows that the proposed algorithm achieves better separation performance than the GGD-ILRMA, WPE-ILRMA, CTF-MNMF algorithms for both speech and music input signals.",

keywords = "Blind source separation, Convolution transfer function, Generalized Gaussian distribution, Non-negative matrix factorization",

author = "Cong Zhang and Feiran Yang and Xianmei Chen and Jun Yang",

year = "2024",

month = may,

doi = "10.12395/0371-0025.2023009",

language = "Chinese (Simplified)",

volume = "49",

pages = "598--610",

journal = "Shengxue Xuebao/Acta Acustica",

issn = "0371-0025",

publisher = "Science Press",

number = "3",

}

TY - JOUR

T1 - 广义高斯分布的卷积传递函数多通道非负矩阵分解

AU - Zhang, Cong

AU - Yang, Feiran

AU - Chen, Xianmei

AU - Yang, Jun

PY - 2024/5

Y1 - 2024/5

N2 - The convolution transfer function-based multi-channel non-negative matrix factorization (CTF-MNMF) has been shown to perform well in blind source separation in highly reverberant environments, but its effectiveness may be limited by the source model. An improved version of the CTF-MNMF is proposed, where the generalized Gaussian distribution (GGD) is used as the source model. The domain parameter is introduced into the NMF and the generalized NMF (GNMF) is utilized to model the nonnegative scale factors of the GGD, which enhances the robustness of the source model in capturing signal outliers, and thus improves the accuracy of source estimation. An auxiliary function-based method is used to derive an improved formula for updating the separated matrix and non-negative matrix parameters. Simulation results shows that the proposed algorithm achieves better separation performance than the GGD-ILRMA, WPE-ILRMA, CTF-MNMF algorithms for both speech and music input signals.

AB - The convolution transfer function-based multi-channel non-negative matrix factorization (CTF-MNMF) has been shown to perform well in blind source separation in highly reverberant environments, but its effectiveness may be limited by the source model. An improved version of the CTF-MNMF is proposed, where the generalized Gaussian distribution (GGD) is used as the source model. The domain parameter is introduced into the NMF and the generalized NMF (GNMF) is utilized to model the nonnegative scale factors of the GGD, which enhances the robustness of the source model in capturing signal outliers, and thus improves the accuracy of source estimation. An auxiliary function-based method is used to derive an improved formula for updating the separated matrix and non-negative matrix parameters. Simulation results shows that the proposed algorithm achieves better separation performance than the GGD-ILRMA, WPE-ILRMA, CTF-MNMF algorithms for both speech and music input signals.

KW - Blind source separation

KW - Convolution transfer function

KW - Generalized Gaussian distribution

KW - Non-negative matrix factorization

UR - http://www.scopus.com/inward/record.url?scp=85192075351&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85192075351&partnerID=8YFLogxK

U2 - 10.12395/0371-0025.2023009

DO - 10.12395/0371-0025.2023009

M3 - Article

AN - SCOPUS:85192075351

SN - 0371-0025

VL - 49

SP - 598

EP - 610

JO - Shengxue Xuebao/Acta Acustica

JF - Shengxue Xuebao/Acta Acustica

IS - 3

ER -

广义高斯分布的卷积传递函数多通道非负矩阵分解

Abstract

Bibliographical note

ASJC Scopus Subject Areas

Keywords

Access to Document

Other files and links

Fingerprint

Cite this