Abstract
In the acoustic modeling for large vocabulary speech recognition, context-dependent (CD) modeling is essential for realizing both improved recognition performance and rapid search. However, sparse data problem caused by huge number of CD models usually leads the estimated models unreliable. To cope with that, two major context-clustering methods, datadriven and rule-based, have been investigated vigorously. In this paper, we briefly review the two methods and develop a new clustering method based on ID3 decision tree learning algorithm that effectively captures the CD modeling. The proposed scheme essentially constructs a decision rule of preclustered triphones using ID3 algorithm. In particular, the datadriven method is used as a clustering algorithm while its result is used as the learning target of ID3 algorithm. The proposed scheme is shown effective over the database of low unknowncontext ratio in terms of recognition performance. For speakerindependent, task-independent continuous speech recognition task, the proposed method reduced percent accuracy WER by 1.16% comparing to that of the existing rule-based method alone.
Original language | English |
---|---|
Title of host publication | 7th International Conference on Spoken Language Processing, ICSLP 2002 |
Publisher | International Speech Communication Association |
Pages | 2657-2660 |
Number of pages | 4 |
Publication status | Published - 2002 |
Externally published | Yes |
Event | 7th International Conference on Spoken Language Processing, ICSLP 2002 - Denver, United States Duration: 2002 Sept 16 → 2002 Sept 20 |
Other
Other | 7th International Conference on Spoken Language Processing, ICSLP 2002 |
---|---|
Country/Territory | United States |
City | Denver |
Period | 02/9/16 → 02/9/20 |
ASJC Scopus subject areas
- Language and Linguistics
- Linguistics and Language