Abstract
The need for dense spatial sampling can be alleviated by predicting room impulse responses (RIRs) at additional locations in the space from measurements obtained using limited microphones. In this paper, a data-driven approach for RIR reconstruction is proposed. The proposed method utilizes an improved time domain U-Net network, which enhances the ability of the network for feature learning through the optimization of the fundamental blocks. In the proposed method, the control region is divided into square blocks. The RIRs at the four vertices of each square are inputted into the proposed U-Net network, which aims to predict the RIR at the central position of the square. The converged U-Net network can reconstruct RIRs for supplementary positions. Experiments conducted on a recorded RIR dataset containing multiple room scenarios demonstrate that the proposed method outperforms the traditional singular value decomposition technique in terms of the RIR reconstruction performance. Additionally, the performance of the proposed U-Net network surpasses that of the original U-Net, CRN and DCCRN networks. In a practical scenario concerning sound zone reproduction, the utilization of the proposed method for predicting RIRs has resulted in higher acoustic contrast between the listening and quiet zones with fewer microphones. This further validates the merits of the proposed approach.
Translated title of the contribution | Room impulse response reconstruction with improved time domain U-Net network |
---|---|
Original language | Chinese (Simplified) |
Pages (from-to) | 323-331 |
Number of pages | 9 |
Journal | Shengxue Xuebao/Acta Acustica |
Volume | 50 |
Issue number | 2 |
DOIs | |
Publication status | Published - Mar 2025 |
Externally published | Yes |
Bibliographical note
Publisher Copyright:© 2025 Science Press. All rights reserved.
ASJC Scopus Subject Areas
- Acoustics and Ultrasonics
Keywords
- Array signal processing
- Deep learning
- Microphone array
- Room impulse response