Pronunciation similarity estimation for spoken language learning

Donghyun Kim, Dongsuk Yook

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper presents an approach for estimating pronunciation similarity between two speakers using the cepstral distance. General speech recognition systems have been used to find the matched words of a speaker, using the acoustical score of a speech signal and the grammatical score of a word sequence. In the case of learning a language, for a speaker with impaired hearing, it is not easy to estimate the pronunciation similarity using automatic speech recognition systems, as this requires more information of pronouncing characteristics, than information on word matching. This is a new challenge for computer aided pronunciation learning. The dynamic time warping algorithm is used for cepstral distance computation between two speech data with codebook distance subtracted to consider the characteristics of each speaker. The experiments evaluated on the Korean fundamental vowel set show that the similarity of two speaker's pronunciation can be efficiently computed using computers.

Original languageEnglish
Title of host publicationComputer Processing of Oriental Languages - Beyond the Orient
Subtitle of host publicationThe Research Challenges Ahead - 21st International Conference, ICCPOL 2006, Proceedings
Pages442-449
Number of pages8
DOIs
Publication statusPublished - 2006
Event21st International Conference on Computer Processing of Oriental Languages: Beyond the Orient: The Research Challenges Ahead, ICCPOL 2006 - Singapore, Singapore
Duration: 2006 Dec 172006 Dec 19

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4285 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other21st International Conference on Computer Processing of Oriental Languages: Beyond the Orient: The Research Challenges Ahead, ICCPOL 2006
CountrySingapore
CitySingapore
Period06/12/1706/12/19

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint Dive into the research topics of 'Pronunciation similarity estimation for spoken language learning'. Together they form a unique fingerprint.

  • Cite this

    Kim, D., & Yook, D. (2006). Pronunciation similarity estimation for spoken language learning. In Computer Processing of Oriental Languages - Beyond the Orient: The Research Challenges Ahead - 21st International Conference, ICCPOL 2006, Proceedings (pp. 442-449). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 4285 LNAI). https://doi.org/10.1007/11940098_46