Unambiguous speech DOA estimation under spatial aliasing conditions

Vinod Veera Reddy, Andy W.H. Khong, Boon Poh Ng

Research output: Contribution to journalArticlepeer-review

34 Citations (Scopus)

Abstract

With the bandwidth of speech signals extending over several octaves, the spatial Nyquist criterion constrains the microphone array design. Violating this criterion by increasing microphone spacing in order to achieve high resolution introduces ambiguity in identifying the source directions due to the aliasing components. In this work, we investigate the effect of spatial aliasing on the direction-of-arrival (DOA) spectrum due to wideband sources. Noting that the extent of aliasing is frequency dependent, we propose a multi-stage scheme for speech DOA estimation following a subband decomposition. To observe the advantage of this scheme, we verify it with the steered minimum variance distortionless response (STMV) and approximate kernel density estimators. The performance is evaluated with simulations and recorded room impulse responses.

Original languageEnglish
Pages (from-to)2133-2145
Number of pages13
JournalIEEE/ACM Transactions on Audio Speech and Language Processing
Volume22
Issue number12
DOIs
Publication statusPublished - Dec 1 2014
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2014 IEEE.

ASJC Scopus Subject Areas

  • Computer Science (miscellaneous)
  • Acoustics and Ultrasonics
  • Computational Mathematics
  • Electrical and Electronic Engineering

Keywords

  • DOA estimation
  • Spatial aliasing

Fingerprint

Dive into the research topics of 'Unambiguous speech DOA estimation under spatial aliasing conditions'. Together they form a unique fingerprint.

Cite this