Acoustic event recognition using dominant spectral basis vectors

Woohyun Choi, Sangwook Park, David K. Han, Hanseok Ko

Research output: Chapter in Book/Report/Conference proceedingConference contribution

6 Citations (Scopus)

Abstract

This paper proposes a novel filter bank composed of dominant Spectral Basis Vectors (SBVs) in a spectrogram. Spectral envelopes represented by the SBVs have shown to be excellent characteristic features for discriminating different acoustic events in noisy environment. Non-negative Matrix Factorization (NMF) and non-negative K-SVD (NKSVD) for part-based and holistic representations extract dominant SBVs from a spectrogram. The effectiveness of the proposed method is demonstrated on a database of real life recordings via experiments, and its robust performance is compared to conventional methods.

Original languageEnglish
Title of host publicationProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
PublisherInternational Speech and Communication Association
Pages2002-2006
Number of pages5
Volume2015-January
Publication statusPublished - 2015
Event16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015 - Dresden, Germany
Duration: 2015 Sep 62015 Sep 10

Other

Other16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015
CountryGermany
CityDresden
Period15/9/615/9/10

Fingerprint

Acoustics
Spectrogram
Filter banks
Singular value decomposition
Factorization
Non-negative Matrix Factorization
Robust Performance
Filter Banks
Envelope
Non-negative
Spectrality
Experiments
Experiment

Keywords

  • Acoustic event recognition
  • Dictionary learning
  • Dominant spectral basis vector
  • K-SVD
  • Non-negative matrix factorization
  • Robust feature extraction
  • Spectral envelope

ASJC Scopus subject areas

  • Language and Linguistics
  • Human-Computer Interaction
  • Signal Processing
  • Software
  • Modelling and Simulation

Cite this

Choi, W., Park, S., Han, D. K., & Ko, H. (2015). Acoustic event recognition using dominant spectral basis vectors. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (Vol. 2015-January, pp. 2002-2006). International Speech and Communication Association.

Acoustic event recognition using dominant spectral basis vectors. / Choi, Woohyun; Park, Sangwook; Han, David K.; Ko, Hanseok.

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Vol. 2015-January International Speech and Communication Association, 2015. p. 2002-2006.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Choi, W, Park, S, Han, DK & Ko, H 2015, Acoustic event recognition using dominant spectral basis vectors. in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. vol. 2015-January, International Speech and Communication Association, pp. 2002-2006, 16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015, Dresden, Germany, 15/9/6.
Choi W, Park S, Han DK, Ko H. Acoustic event recognition using dominant spectral basis vectors. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Vol. 2015-January. International Speech and Communication Association. 2015. p. 2002-2006
Choi, Woohyun ; Park, Sangwook ; Han, David K. ; Ko, Hanseok. / Acoustic event recognition using dominant spectral basis vectors. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Vol. 2015-January International Speech and Communication Association, 2015. pp. 2002-2006
@inproceedings{6d8d00eccefb4cd78ff53d7e41d133b9,
title = "Acoustic event recognition using dominant spectral basis vectors",
abstract = "This paper proposes a novel filter bank composed of dominant Spectral Basis Vectors (SBVs) in a spectrogram. Spectral envelopes represented by the SBVs have shown to be excellent characteristic features for discriminating different acoustic events in noisy environment. Non-negative Matrix Factorization (NMF) and non-negative K-SVD (NKSVD) for part-based and holistic representations extract dominant SBVs from a spectrogram. The effectiveness of the proposed method is demonstrated on a database of real life recordings via experiments, and its robust performance is compared to conventional methods.",
keywords = "Acoustic event recognition, Dictionary learning, Dominant spectral basis vector, K-SVD, Non-negative matrix factorization, Robust feature extraction, Spectral envelope",
author = "Woohyun Choi and Sangwook Park and Han, {David K.} and Hanseok Ko",
year = "2015",
language = "English",
volume = "2015-January",
pages = "2002--2006",
booktitle = "Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH",
publisher = "International Speech and Communication Association",

}

TY - GEN

T1 - Acoustic event recognition using dominant spectral basis vectors

AU - Choi, Woohyun

AU - Park, Sangwook

AU - Han, David K.

AU - Ko, Hanseok

PY - 2015

Y1 - 2015

N2 - This paper proposes a novel filter bank composed of dominant Spectral Basis Vectors (SBVs) in a spectrogram. Spectral envelopes represented by the SBVs have shown to be excellent characteristic features for discriminating different acoustic events in noisy environment. Non-negative Matrix Factorization (NMF) and non-negative K-SVD (NKSVD) for part-based and holistic representations extract dominant SBVs from a spectrogram. The effectiveness of the proposed method is demonstrated on a database of real life recordings via experiments, and its robust performance is compared to conventional methods.

AB - This paper proposes a novel filter bank composed of dominant Spectral Basis Vectors (SBVs) in a spectrogram. Spectral envelopes represented by the SBVs have shown to be excellent characteristic features for discriminating different acoustic events in noisy environment. Non-negative Matrix Factorization (NMF) and non-negative K-SVD (NKSVD) for part-based and holistic representations extract dominant SBVs from a spectrogram. The effectiveness of the proposed method is demonstrated on a database of real life recordings via experiments, and its robust performance is compared to conventional methods.

KW - Acoustic event recognition

KW - Dictionary learning

KW - Dominant spectral basis vector

KW - K-SVD

KW - Non-negative matrix factorization

KW - Robust feature extraction

KW - Spectral envelope

UR - http://www.scopus.com/inward/record.url?scp=84946584270&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84946584270&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:84946584270

VL - 2015-January

SP - 2002

EP - 2006

BT - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

PB - International Speech and Communication Association

ER -