TY - GEN
T1 - Denoising and dimension reduction in feature space
AU - Braun, Mikio L.
AU - Buhmann, Joachim
AU - Müller, Klaus Robert
PY - 2007
Y1 - 2007
N2 - We show that the relevant information about a classification problem in feature space is contained up to negligible error in a finite number of leading kernel PCA components if the kernel matches the underlying learning problem. Thus, kernels not only transform data sets such that good generalization can be achieved even by linear discriminant functions, but this transformation is also performed in a manner which makes economic use of feature space dimensions. In the best case, kernels provide efficient implicit representations of the data to perform classification. Practically, we propose an algorithm which enables us to recover the subspace and dimensionality relevant for good classification. Our algorithm can therefore be applied (1) to analyze the interplay of data set and kernel in a geometric fashion, (2) to help in model selection, and to (3) de-noise in feature space in order to yield better classification results.
AB - We show that the relevant information about a classification problem in feature space is contained up to negligible error in a finite number of leading kernel PCA components if the kernel matches the underlying learning problem. Thus, kernels not only transform data sets such that good generalization can be achieved even by linear discriminant functions, but this transformation is also performed in a manner which makes economic use of feature space dimensions. In the best case, kernels provide efficient implicit representations of the data to perform classification. Practically, we propose an algorithm which enables us to recover the subspace and dimensionality relevant for good classification. Our algorithm can therefore be applied (1) to analyze the interplay of data set and kernel in a geometric fashion, (2) to help in model selection, and to (3) de-noise in feature space in order to yield better classification results.
UR - http://www.scopus.com/inward/record.url?scp=84862272240&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84862272240&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84862272240
SN - 9780262195683
T3 - Advances in Neural Information Processing Systems
SP - 185
EP - 192
BT - Advances in Neural Information Processing Systems 19 - Proceedings of the 2006 Conference
T2 - 20th Annual Conference on Neural Information Processing Systems, NIPS 2006
Y2 - 4 December 2006 through 7 December 2006
ER -