Abstract
This paper proposes an effective method to remedy the acoustic modeling problem inherent in the usual log-normal PMC intended for achieving robust speech recognition. In particular, the Gaussian kernels under the prescribed log-normal PMC cannot sufficiently express the corrupted speech distributions. The proposed scheme corrects this deficiency by judicially selecting the "fairly" corrupted component and by re-estimating it as a mixture of two distributions using data-driven PMC. As a result, some components become merged while equal number of components split. The determination for splitting or merging is achieved by means of measuring the similarity of corrupted speech model to those of clean model and noise model. The experimental results indicate that the suggested algorithm is effective in representing the corrupted speech distributions and attains consistent improvement over various SNR and noise cases.
Original language | English |
---|---|
Pages (from-to) | 4176 |
Number of pages | 1 |
Journal | ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings |
Volume | 4 |
DOIs | |
Publication status | Published - 2002 |
ASJC Scopus subject areas
- Software
- Signal Processing
- Electrical and Electronic Engineering