A Deep Hybrid Model for Stereophonic Acoustic Echo Control

Yang Liu; Sichen Liu; Feiran Yang; Jun Yang

doi:10.1007/s00034-024-02807-x

A Deep Hybrid Model for Stereophonic Acoustic Echo Control

Yang Liu, Sichen Liu, Feiran Yang^*, Jun Yang^*

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

Abstract

This paper proposes a deep hybrid model for stereophonic acoustic echo cancellation (SAEC). A two-stage model is considered, i.e., a deep-learning-based Kalman filter (DeepKalman) and a gated convolutional recurrent network (GCRN)-based postfilter, which are jointly trained in an end-to-end manner. The difference between the proposed DeepKalman filter and the conventional one is twofold. First, the input signal of the DeepKalman filter is a combination of the original two far-end signals and the nonlinear reference signal estimated from the microphone signal directly. Second, a low-complexity recurrent neural network is utilized to estimate the covariance of the process noise, which can enhance the tracking capability of the DeepKalman filter. In the second stage, we adopt GCRN to suppress residual echo and noise by estimating complex masks applied to the output signal of the first stage. Computer simulations confirm the performance advantage of the proposed method over existing SAEC algorithms.

Original language	English
Pages (from-to)	8046-8059
Number of pages	14
Journal	Circuits, Systems, and Signal Processing
Volume	43
Issue number	12
DOIs	https://doi.org/10.1007/s00034-024-02807-x
Publication status	Published - Dec 2024
Externally published	Yes

Bibliographical note

Publisher Copyright:
© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.

ASJC Scopus Subject Areas

Signal Processing
Applied Mathematics

Keywords

Deep learning
Kalman filter
Nonlinear distortion
Stereophonic acoustic echo cancellation

Access to Document

10.1007/s00034-024-02807-x

Cite this

@article{50dad894006e46098e7f6d89bc4e519d,

title = "A Deep Hybrid Model for Stereophonic Acoustic Echo Control",

abstract = "This paper proposes a deep hybrid model for stereophonic acoustic echo cancellation (SAEC). A two-stage model is considered, i.e., a deep-learning-based Kalman filter (DeepKalman) and a gated convolutional recurrent network (GCRN)-based postfilter, which are jointly trained in an end-to-end manner. The difference between the proposed DeepKalman filter and the conventional one is twofold. First, the input signal of the DeepKalman filter is a combination of the original two far-end signals and the nonlinear reference signal estimated from the microphone signal directly. Second, a low-complexity recurrent neural network is utilized to estimate the covariance of the process noise, which can enhance the tracking capability of the DeepKalman filter. In the second stage, we adopt GCRN to suppress residual echo and noise by estimating complex masks applied to the output signal of the first stage. Computer simulations confirm the performance advantage of the proposed method over existing SAEC algorithms.",

keywords = "Deep learning, Kalman filter, Nonlinear distortion, Stereophonic acoustic echo cancellation",

author = "Yang Liu and Sichen Liu and Feiran Yang and Jun Yang",

note = "Publisher Copyright: {\textcopyright} The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.",

year = "2024",

month = dec,

doi = "10.1007/s00034-024-02807-x",

language = "English",

volume = "43",

pages = "8046--8059",

journal = "Circuits, Systems, and Signal Processing",

issn = "0278-081X",

publisher = "Birkhause Boston",

number = "12",

}

TY - JOUR

T1 - A Deep Hybrid Model for Stereophonic Acoustic Echo Control

AU - Liu, Yang

AU - Liu, Sichen

AU - Yang, Feiran

AU - Yang, Jun

N1 - Publisher Copyright: © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.

PY - 2024/12

Y1 - 2024/12

N2 - This paper proposes a deep hybrid model for stereophonic acoustic echo cancellation (SAEC). A two-stage model is considered, i.e., a deep-learning-based Kalman filter (DeepKalman) and a gated convolutional recurrent network (GCRN)-based postfilter, which are jointly trained in an end-to-end manner. The difference between the proposed DeepKalman filter and the conventional one is twofold. First, the input signal of the DeepKalman filter is a combination of the original two far-end signals and the nonlinear reference signal estimated from the microphone signal directly. Second, a low-complexity recurrent neural network is utilized to estimate the covariance of the process noise, which can enhance the tracking capability of the DeepKalman filter. In the second stage, we adopt GCRN to suppress residual echo and noise by estimating complex masks applied to the output signal of the first stage. Computer simulations confirm the performance advantage of the proposed method over existing SAEC algorithms.

AB - This paper proposes a deep hybrid model for stereophonic acoustic echo cancellation (SAEC). A two-stage model is considered, i.e., a deep-learning-based Kalman filter (DeepKalman) and a gated convolutional recurrent network (GCRN)-based postfilter, which are jointly trained in an end-to-end manner. The difference between the proposed DeepKalman filter and the conventional one is twofold. First, the input signal of the DeepKalman filter is a combination of the original two far-end signals and the nonlinear reference signal estimated from the microphone signal directly. Second, a low-complexity recurrent neural network is utilized to estimate the covariance of the process noise, which can enhance the tracking capability of the DeepKalman filter. In the second stage, we adopt GCRN to suppress residual echo and noise by estimating complex masks applied to the output signal of the first stage. Computer simulations confirm the performance advantage of the proposed method over existing SAEC algorithms.

KW - Deep learning

KW - Kalman filter

KW - Nonlinear distortion

KW - Stereophonic acoustic echo cancellation

UR - http://www.scopus.com/inward/record.url?scp=85200596750&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85200596750&partnerID=8YFLogxK

U2 - 10.1007/s00034-024-02807-x

DO - 10.1007/s00034-024-02807-x

M3 - Article

AN - SCOPUS:85200596750

SN - 0278-081X

VL - 43

SP - 8046

EP - 8059

JO - Circuits, Systems, and Signal Processing

JF - Circuits, Systems, and Signal Processing

IS - 12

ER -

A Deep Hybrid Model for Stereophonic Acoustic Echo Control

Abstract

Bibliographical note

ASJC Scopus Subject Areas

Keywords

Access to Document

Other files and links

Fingerprint

Cite this