Fast speech adaptation in linear spectral domain for additive and convolutional noise

Donghyun Kim, Dongsuk Yook

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In this paper, we propose a transform-based adaptation technique for robust speech recognition in unknown environments. It uses maximum likelihood spectral transform (MLST) algorithm with additive and convolutional noise parameters. Previously many adaptation algorithms have been proposed in the cepstral domain. Though the cepstral domain may be appropriate for the speech recognition, it is difficult to handle environmental noise directly in the cepstral domain. Therefore our approach deals with such noise in the linear spectral domain in which speech is directly affected by the noise. As a result, we can use a small number of noise parameters for fast adaptation. The experiments evaluated on the FFMTIMIT corpus shows promising result with only a small number of adaptation data.

Original languageEnglish
Title of host publication8th International Conference on Spoken Language Processing, ICSLP 2004
PublisherInternational Speech Communication Association
Pages2557-2560
Number of pages4
Publication statusPublished - 2004
Event8th International Conference on Spoken Language Processing, ICSLP 2004 - Jeju, Jeju Island, Korea, Republic of
Duration: 2004 Oct 42004 Oct 8

Other

Other8th International Conference on Spoken Language Processing, ICSLP 2004
CountryKorea, Republic of
CityJeju, Jeju Island
Period04/10/404/10/8

Fingerprint

Spectrality
experiment
Speech Recognition
Maximum Likelihood
Experiment

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language

Cite this

Kim, D., & Yook, D. (2004). Fast speech adaptation in linear spectral domain for additive and convolutional noise. In 8th International Conference on Spoken Language Processing, ICSLP 2004 (pp. 2557-2560). International Speech Communication Association.

Fast speech adaptation in linear spectral domain for additive and convolutional noise. / Kim, Donghyun; Yook, Dongsuk.

8th International Conference on Spoken Language Processing, ICSLP 2004. International Speech Communication Association, 2004. p. 2557-2560.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Kim, D & Yook, D 2004, Fast speech adaptation in linear spectral domain for additive and convolutional noise. in 8th International Conference on Spoken Language Processing, ICSLP 2004. International Speech Communication Association, pp. 2557-2560, 8th International Conference on Spoken Language Processing, ICSLP 2004, Jeju, Jeju Island, Korea, Republic of, 04/10/4.
Kim D, Yook D. Fast speech adaptation in linear spectral domain for additive and convolutional noise. In 8th International Conference on Spoken Language Processing, ICSLP 2004. International Speech Communication Association. 2004. p. 2557-2560
Kim, Donghyun ; Yook, Dongsuk. / Fast speech adaptation in linear spectral domain for additive and convolutional noise. 8th International Conference on Spoken Language Processing, ICSLP 2004. International Speech Communication Association, 2004. pp. 2557-2560
@inproceedings{0969e73679dd47e685d84b32dfe0dbcb,
title = "Fast speech adaptation in linear spectral domain for additive and convolutional noise",
abstract = "In this paper, we propose a transform-based adaptation technique for robust speech recognition in unknown environments. It uses maximum likelihood spectral transform (MLST) algorithm with additive and convolutional noise parameters. Previously many adaptation algorithms have been proposed in the cepstral domain. Though the cepstral domain may be appropriate for the speech recognition, it is difficult to handle environmental noise directly in the cepstral domain. Therefore our approach deals with such noise in the linear spectral domain in which speech is directly affected by the noise. As a result, we can use a small number of noise parameters for fast adaptation. The experiments evaluated on the FFMTIMIT corpus shows promising result with only a small number of adaptation data.",
author = "Donghyun Kim and Dongsuk Yook",
year = "2004",
language = "English",
pages = "2557--2560",
booktitle = "8th International Conference on Spoken Language Processing, ICSLP 2004",
publisher = "International Speech Communication Association",

}

TY - GEN

T1 - Fast speech adaptation in linear spectral domain for additive and convolutional noise

AU - Kim, Donghyun

AU - Yook, Dongsuk

PY - 2004

Y1 - 2004

N2 - In this paper, we propose a transform-based adaptation technique for robust speech recognition in unknown environments. It uses maximum likelihood spectral transform (MLST) algorithm with additive and convolutional noise parameters. Previously many adaptation algorithms have been proposed in the cepstral domain. Though the cepstral domain may be appropriate for the speech recognition, it is difficult to handle environmental noise directly in the cepstral domain. Therefore our approach deals with such noise in the linear spectral domain in which speech is directly affected by the noise. As a result, we can use a small number of noise parameters for fast adaptation. The experiments evaluated on the FFMTIMIT corpus shows promising result with only a small number of adaptation data.

AB - In this paper, we propose a transform-based adaptation technique for robust speech recognition in unknown environments. It uses maximum likelihood spectral transform (MLST) algorithm with additive and convolutional noise parameters. Previously many adaptation algorithms have been proposed in the cepstral domain. Though the cepstral domain may be appropriate for the speech recognition, it is difficult to handle environmental noise directly in the cepstral domain. Therefore our approach deals with such noise in the linear spectral domain in which speech is directly affected by the noise. As a result, we can use a small number of noise parameters for fast adaptation. The experiments evaluated on the FFMTIMIT corpus shows promising result with only a small number of adaptation data.

UR - http://www.scopus.com/inward/record.url?scp=85009065170&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85009065170&partnerID=8YFLogxK

M3 - Conference contribution

SP - 2557

EP - 2560

BT - 8th International Conference on Spoken Language Processing, ICSLP 2004

PB - International Speech Communication Association

ER -