Full azimuth multiple sound source localization with 3-channel microphone array

Suwon Shon, David K. Han, Jounghoon Beh, Hanseok Ko

Research output: Contribution to journalArticle

3 Citations (Scopus)

Abstract

This paper describes a method for estimating Direction Of Arrival (DOA) of multiple sound sources in full azimuth with three microphones. Estimating DOA with paired microphone arrays creates imaginary sound sources because of time delay of arrival (TDOA) being identical between real and imaginary sources. Imaginary sound sources can create chronic problems in multiple Sound Source Localization (SSL), because they can be localized as real sound sources. Our proposed approach is based on the observation that each microphone array creates imaginary sound sources, but the DOA of imaginary sources may be different depending on the orientation of the paired microphone array. With the fact that a real source would always be localized in the same direction regardless of the array orientation, we can suppress the imaginary sound sources by minimum filtering based on Steered Response Power\-Phase Transform (SRP-PHAT) method. A set of experiments conducted in a real noisy environment showed that the proposed method was accurate in localizing multiple sound sources.

Original languageEnglish
Pages (from-to)745-750
Number of pages6
JournalIEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences
VolumeE95-A
Issue number4
DOIs
Publication statusPublished - 2012 Apr 1

Fingerprint

Microphone Array
Source Localization
Azimuth
Microphones
Acoustic waves
Direction of arrival
Direction of Arrival
Sound
Time delay
Mathematical transformations
Time Delay
Filtering
Transform

Keywords

  • Multichannel array processing
  • Sound source localization (SSL)
  • SRP-PHAT

ASJC Scopus subject areas

  • Electrical and Electronic Engineering
  • Computer Graphics and Computer-Aided Design
  • Applied Mathematics
  • Signal Processing

Cite this

Full azimuth multiple sound source localization with 3-channel microphone array. / Shon, Suwon; Han, David K.; Beh, Jounghoon; Ko, Hanseok.

In: IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, Vol. E95-A, No. 4, 01.04.2012, p. 745-750.

Research output: Contribution to journalArticle

@article{629ac3ce250f4a9cb0bb123e91892d10,
title = "Full azimuth multiple sound source localization with 3-channel microphone array",
abstract = "This paper describes a method for estimating Direction Of Arrival (DOA) of multiple sound sources in full azimuth with three microphones. Estimating DOA with paired microphone arrays creates imaginary sound sources because of time delay of arrival (TDOA) being identical between real and imaginary sources. Imaginary sound sources can create chronic problems in multiple Sound Source Localization (SSL), because they can be localized as real sound sources. Our proposed approach is based on the observation that each microphone array creates imaginary sound sources, but the DOA of imaginary sources may be different depending on the orientation of the paired microphone array. With the fact that a real source would always be localized in the same direction regardless of the array orientation, we can suppress the imaginary sound sources by minimum filtering based on Steered Response Power\-Phase Transform (SRP-PHAT) method. A set of experiments conducted in a real noisy environment showed that the proposed method was accurate in localizing multiple sound sources.",
keywords = "Multichannel array processing, Sound source localization (SSL), SRP-PHAT",
author = "Suwon Shon and Han, {David K.} and Jounghoon Beh and Hanseok Ko",
year = "2012",
month = "4",
day = "1",
doi = "10.1587/transfun.E95.A.745",
language = "English",
volume = "E95-A",
pages = "745--750",
journal = "IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences",
issn = "0916-8508",
publisher = "Maruzen Co., Ltd/Maruzen Kabushikikaisha",
number = "4",

}

TY - JOUR

T1 - Full azimuth multiple sound source localization with 3-channel microphone array

AU - Shon, Suwon

AU - Han, David K.

AU - Beh, Jounghoon

AU - Ko, Hanseok

PY - 2012/4/1

Y1 - 2012/4/1

N2 - This paper describes a method for estimating Direction Of Arrival (DOA) of multiple sound sources in full azimuth with three microphones. Estimating DOA with paired microphone arrays creates imaginary sound sources because of time delay of arrival (TDOA) being identical between real and imaginary sources. Imaginary sound sources can create chronic problems in multiple Sound Source Localization (SSL), because they can be localized as real sound sources. Our proposed approach is based on the observation that each microphone array creates imaginary sound sources, but the DOA of imaginary sources may be different depending on the orientation of the paired microphone array. With the fact that a real source would always be localized in the same direction regardless of the array orientation, we can suppress the imaginary sound sources by minimum filtering based on Steered Response Power\-Phase Transform (SRP-PHAT) method. A set of experiments conducted in a real noisy environment showed that the proposed method was accurate in localizing multiple sound sources.

AB - This paper describes a method for estimating Direction Of Arrival (DOA) of multiple sound sources in full azimuth with three microphones. Estimating DOA with paired microphone arrays creates imaginary sound sources because of time delay of arrival (TDOA) being identical between real and imaginary sources. Imaginary sound sources can create chronic problems in multiple Sound Source Localization (SSL), because they can be localized as real sound sources. Our proposed approach is based on the observation that each microphone array creates imaginary sound sources, but the DOA of imaginary sources may be different depending on the orientation of the paired microphone array. With the fact that a real source would always be localized in the same direction regardless of the array orientation, we can suppress the imaginary sound sources by minimum filtering based on Steered Response Power\-Phase Transform (SRP-PHAT) method. A set of experiments conducted in a real noisy environment showed that the proposed method was accurate in localizing multiple sound sources.

KW - Multichannel array processing

KW - Sound source localization (SSL)

KW - SRP-PHAT

UR - http://www.scopus.com/inward/record.url?scp=84859395405&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84859395405&partnerID=8YFLogxK

U2 - 10.1587/transfun.E95.A.745

DO - 10.1587/transfun.E95.A.745

M3 - Article

VL - E95-A

SP - 745

EP - 750

JO - IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences

JF - IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences

SN - 0916-8508

IS - 4

ER -