Abstract
In this paper, we present a real-time blind source separation (BSS) algorithm, which unifies the independent vector analysis (IVA) as a spatial model and a deep neural network (DNN) as a source model. The auxiliary-function based IVA (Aux-IVA) is utilized to update the demixing matrix, and the required time-varying variance of the speech source is estimated by a DNN. The DNN could provide a more accurate source model, which then helps to optimize the spatial model. In addition, because the DNN is used to estimate the source variance instead of the source power spectrogram, the size of DNN can be reduced significantly. Experiment results show that the joint utilization of the model-based approach and the data-driven approach provides a more efficient solution than just alone in terms of convergence rate and source separation performance.
Original language | English |
---|---|
Title of host publication | 2021 IEEE Spoken Language Technology Workshop, SLT 2021 - Proceedings |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 665-669 |
Number of pages | 5 |
ISBN (Electronic) | 9781728170664 |
DOIs | |
Publication status | Published - Jan 19 2021 |
Externally published | Yes |
Event | 2021 IEEE Spoken Language Technology Workshop, SLT 2021 - Virtual, Shenzhen, China Duration: Jan 19 2021 → Jan 22 2021 |
Publication series
Name | 2021 IEEE Spoken Language Technology Workshop, SLT 2021 - Proceedings |
---|
Conference
Conference | 2021 IEEE Spoken Language Technology Workshop, SLT 2021 |
---|---|
Country/Territory | China |
City | Virtual, Shenzhen |
Period | 1/19/21 → 1/22/21 |
Bibliographical note
Publisher Copyright:© 2021 IEEE.
ASJC Scopus Subject Areas
- Linguistics and Language
- Language and Linguistics
- Artificial Intelligence
- Computer Science Applications
- Computer Vision and Pattern Recognition
- Hardware and Architecture
Keywords
- Blind source separation
- deep neural net-work
- real-time