TY - GEN
T1 - Decision tree based clustering
AU - Yook, Dongsuk
N1 - Publisher Copyright:
© Springer-Verlag Berlin Heidelberg 2002.
Copyright:
Copyright 2020 Elsevier B.V., All rights reserved.
PY - 2002
Y1 - 2002
N2 - Adecision tree can be used not only as a classifier but also as a clustering method. One of such applications can be found in automatic speech recognition using hidden Markov models (HMMs). Due to the insufficient amount of training data, similar states of triphone HMMs are grouped together using a decision tree to share a common probability distribution. At the same time, in order to predict the statistics of unseen triphones, the decision tree is used as a classifier as well. In this paper, we study several cluster split criteria in decision tree building algorithms for the case where the instances to be clustered are probability density functions. Especially, when Gaussian probability distributions are to be clustered, we have found that the Bhattacharyya distance based measures are more consistent than the conventional log likelihood based measure.
AB - Adecision tree can be used not only as a classifier but also as a clustering method. One of such applications can be found in automatic speech recognition using hidden Markov models (HMMs). Due to the insufficient amount of training data, similar states of triphone HMMs are grouped together using a decision tree to share a common probability distribution. At the same time, in order to predict the statistics of unseen triphones, the decision tree is used as a classifier as well. In this paper, we study several cluster split criteria in decision tree building algorithms for the case where the instances to be clustered are probability density functions. Especially, when Gaussian probability distributions are to be clustered, we have found that the Bhattacharyya distance based measures are more consistent than the conventional log likelihood based measure.
UR - http://www.scopus.com/inward/record.url?scp=84947976088&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84947976088&partnerID=8YFLogxK
U2 - 10.1007/3-540-45675-9_73
DO - 10.1007/3-540-45675-9_73
M3 - Conference contribution
AN - SCOPUS:84947976088
SN - 9783540440253
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 487
EP - 492
BT - Intelligent Data Engineering and Automated Learning - IDEAL 2002 - 3rd International Conference, Proceedings
A2 - Yin, Hujun
A2 - Allinson, Nigel
A2 - Freeman, Richard
A2 - Keane, John
A2 - Hubbard, Simon
PB - Springer Verlag
T2 - 3rd International Conference on Intelligent Data Engineering and Automated Learning, IDEAL 2002
Y2 - 12 August 2002 through 14 August 2002
ER -